Lesson 2 of 5·10 min read

Detecting and Mitigating Bias

AI bias isn't about bad intentions — it almost always stems from unconscious distortions in data, algorithms, or deployment decisions. If you don't actively look for bias, you'll only find it after damage is done.

The Three Types of Bias

1. Data Bias

Training data reflects historical inequalities. When a credit scoring model is trained on data where certain demographic groups were historically disadvantaged, it reproduces that inequality.

Example: Image recognition models trained primarily on photos of light-skinned people perform significantly worse on darker skin tones (Gender Shades Study, MIT).

2. Algorithm Bias

Model design itself can amplify distortions. Optimizing for a single metric (e.g., accuracy) can disadvantage minority groups that are underrepresented in the data.

3. Deployment Bias

A model works perfectly in the lab, but in practice distortions arise from:

  • Different user groups than expected
  • Changed data distributions (data drift)
  • Feedback loops that amplify bias

Practical Tools for Bias Detection

ToolProviderUse Case
FairlearnMicrosoftFairness metrics for ML models
AI Fairness 360IBMBias detection and mitigation
What-If ToolGoogleInteractive model analysis
Evidently AIOpen SourceProduction monitoring

Mitigation Strategies

  1. Pre-processing: Clean, augment, and re-sample training data
  2. In-processing: Build fairness constraints into training
  3. Post-processing: Output calibration and threshold adjustment
  4. Monitoring: Continuous monitoring with drift alerting

Practical tip: Define fairness metrics per use case. "Fairness" is context-dependent — statistical parity, equal opportunity, and predictive parity cannot all be satisfied simultaneously (Impossibility Theorem).

Bias detection isn't a one-time task but a continuous process. Build bias checks into your CI/CD pipeline.