Lesson 3 of 5·10 min read

Transparency and Explainability

When an AI system says "No" — loan application denied, resume rejected, insurance claim dismissed — people need to understand why. This isn't just ethically important — under the EU AI Act, it's legally required for high-risk applications.

The Black-Box Problem

Modern deep learning models have billions of parameters. Even their developers often can't explain exactly why a model makes a specific decision. This creates a fundamental trust problem.

Two dimensions of transparency:

  • Model transparency: How does the system work internally?
  • Decision transparency: Why was this specific decision made?

Explainable AI (XAI) — Methods

Model-Agnostic Methods

  • SHAP (SHapley Additive exPlanations): Shows each feature's contribution to the prediction. Based on game theory.
  • LIME (Local Interpretable Model-agnostic Explanations): Creates a simple explanation model for individual predictions.
  • Counterfactual Explanations: "If you had €5,000 more income, the loan would have been approved."

Model-Specific Methods

  • Attention Maps (Transformers): Which input tokens influence the output?
  • Feature Importance (Gradient Boosting): Ranking of the most important variables
  • Decision Trees: Inherently interpretable but less powerful

Transparency Levels for Companies

LevelMeasureEffort
BasicPublish model cards and datasheetsLow
StandardProvide explanations per decisionMedium
AdvancedInteractive explanations for end usersHigh

Practical Recommendations

  1. Model Cards: Document purpose, training data, limitations, and fairness metrics for every model.
  2. Explainability by Design: Choose the simplest model that accomplishes the task. Not every task needs an LLM.
  3. Stakeholder-appropriate explanations: Engineers need SHAP values; customers need plain language.

Practical tip: Start with Model Cards (Google format). They force teams to think about limitations and risks before a model goes live.