Fatskills
Practice. Master. Repeat.
Study Guide: AI Trust and Fairness: Bias fairness and discrimination risk
Source: https://www.fatskills.com/ai-for-work/chapter/ai-trust-and-fairness-bias-fairness-and-discrimination-risk

AI Trust and Fairness: Bias fairness and discrimination risk

By Fatskills Exam Guides Team — the exam nerds behind 28,500+ quizzes and 2.1M practice questions across 500+ global exams.

⏱️ ~6 min read

Bias, Fairness, and Discrimination Risk in AI

What This Is

Bias, fairness, and discrimination risk refer to systematic errors or prejudices in AI systems that lead to unfair outcomes for certain groups. These risks matter in everyday work because AI tools—from hiring algorithms to loan approval models—can reinforce or amplify societal biases, leading to legal, reputational, and ethical harm. Example: Amazon scrapped an AI recruiting tool after discovering it penalized resumes containing the word "women’s" (e.g., "women’s chess club"), favoring male candidates due to biased training data.


Key Facts & Principles

  • Bias (in AI): Systematic deviation from fairness, often due to flawed data, design, or human assumptions. Example: A facial recognition system trained mostly on light-skinned faces performs poorly on darker-skinned individuals, leading to higher error rates in identification.
  • Fairness: The principle that AI systems should treat all individuals and groups equitably, without favoring or harming any based on protected attributes (e.g., race, gender, age). Example: A mortgage approval AI should not reject applicants from certain ZIP codes at higher rates unless the decision is justified by creditworthiness, not geography.
  • Protected attributes: Characteristics legally or ethically shielded from discrimination (e.g., race, gender, religion, disability). Example: U.S. law prohibits using race as a factor in loan approvals, even if an AI model finds it "predictive."
  • Disparate impact: When an AI system appears neutral but disproportionately harms a protected group. Example: A hiring tool that filters out candidates from historically Black colleges due to lower average test scores in the training data.
  • Proxy variables: Non-protected attributes that correlate with protected ones (e.g., neighborhood-race). Example: Using "commute time" to screen job applicants may indirectly discriminate against low-income groups who live farther from offices.
  • Fairness metrics: Quantitative measures to detect bias (e.g., demographic parity, equalized odds). Example: If a hiring AI selects 30% of male applicants but only 10% of female applicants, it fails demographic parity.
  • Feedback loops: When biased AI outputs reinforce existing inequalities. Example: A predictive policing tool over-policing minority neighborhoods leads to more arrests, which then "justifies" further policing.
  • Explainability vs. fairness: Explainability (understanding how a model works) doesn’t guarantee fairness (ensuring outcomes are equitable). Example: A transparent loan approval model might still reject Black applicants at higher rates if the training data reflects historical discrimination.

Step-by-Step Application

  1. Audit your data
  2. Action: Before training a model, analyze your dataset for imbalances in protected attributes (e.g., gender, race) and proxy variables (e.g., ZIP code, education level).
  3. How: Use tools like IBM’s AI Fairness 360 or Google’s What-If Tool to check for skewed distributions.
  4. Example: If your hiring dataset has 80% male candidates, the model may learn to favor male traits.

  5. Define fairness for your use case

  6. Action: Choose a fairness metric aligned with your goal (e.g., demographic parity for equal selection rates, equalized odds for equal error rates).
  7. How: Consult legal/ethics teams to align with regulations (e.g., EU AI Act, U.S. EEOC guidelines).
  8. Example: For a loan approval model, equalized odds (equal false-positive/false-negative rates across groups) may be more appropriate than demographic parity.

  9. Test for bias in development

  10. Action: Evaluate the model’s performance across subgroups using fairness metrics.
  11. How: Split test data by protected attributes (e.g., race, gender) and compare outcomes.
  12. Example: If a healthcare AI recommends fewer follow-up tests for women than men with the same symptoms, flag it for review.

  13. Mitigate bias (if found)

  14. Action: Apply techniques like:
    • Reweighting: Adjust training data to balance underrepresented groups.
    • Pre-processing: Remove proxy variables (e.g., ZIP code) or use fairness-aware algorithms.
    • Post-processing: Adjust model outputs to meet fairness targets (e.g., calibrate thresholds for different groups).
  15. Example: If a resume screener favors "male-coded" words (e.g., "aggressive"), retrain it on gender-neutral language.

  16. Monitor in production

  17. Action: Continuously track fairness metrics after deployment to catch drift or new biases.
  18. How: Set up dashboards to monitor disparities in outcomes (e.g., approval rates by race) and alert teams if thresholds are breached.
  19. Example: A bank’s loan AI might start rejecting more applicants from a specific demographic after a policy change—monitoring catches this early.

  20. Document and govern

  21. Action: Create a "bias impact statement" for stakeholders, detailing:
    • Data sources and limitations.
    • Fairness metrics used.
    • Mitigation steps taken.
    • Ongoing monitoring plans.
  22. Example: A healthcare AI’s documentation should note that it was tested on U.S. patient data and may not generalize to other populations.

Common Mistakes

  • Mistake: Assuming "neutral" data is unbiased.
  • Correction: Historical data often reflects past discrimination (e.g., fewer women in STEM roles). Audit data for representativeness and consider synthetic data or reweighting to balance underrepresented groups.

  • Mistake: Relying on a single fairness metric.

  • Correction: No metric fits all use cases. Combine metrics (e.g., demographic parity + equalized odds) and align with legal/ethical goals. Example: A hiring tool might pass demographic parity but fail equalized odds if it hires unqualified men at the same rate as qualified women.

  • Mistake: Treating fairness as a one-time fix.

  • Correction: Bias can emerge over time (e.g., feedback loops, data drift). Monitor continuously and retrain models with fresh, audited data.

  • Mistake: Ignoring proxy variables.

  • Correction: Even if you remove protected attributes (e.g., race), proxies (e.g., neighborhood, school) can reintroduce bias. Use statistical tests (e.g., correlation analysis) to identify and mitigate proxies.

  • Mistake: Over-optimizing for accuracy at the expense of fairness.

  • Correction: A model with 95% accuracy may still be unfair if errors disproportionately harm one group. Balance accuracy and fairness using techniques like multi-objective optimization.

Practical Tips

  • Start with high-risk use cases: Prioritize auditing AI systems that impact critical decisions (e.g., hiring, lending, healthcare) or vulnerable groups (e.g., children, minorities).
  • Involve diverse stakeholders: Include representatives from affected groups in design and testing. Example: A team building a voice assistant for elderly users should test it with actual elderly users, not just young engineers.
  • Use "red teaming" for bias: Have a separate team deliberately try to break the model by feeding it edge cases (e.g., resumes with non-Western names, atypical job histories).
  • Automate fairness checks: Integrate bias detection into CI/CD pipelines (e.g., fail a model deployment if fairness metrics drop below thresholds).

Quick Practice Scenario

Scenario: Your team built an AI tool to screen job applicants for a tech role. The model was trained on resumes from your company’s past hires (85% male). During testing, you notice it rejects 40% of female applicants but only 20% of male applicants. What’s the first step to address this?

Answer: Audit the training data for bias and proxy variables. The gender imbalance in the training data likely caused the model to favor male-coded resumes. Check for proxies (e.g., "fraternity" vs. "sorority") and consider reweighting or augmenting the data with more female resumes.


Last-Minute Cram Sheet

  1. Bias-fairness: Bias is a problem; fairness is the goal.
  2. Protected attributes: Race, gender, age, etc.—never use them directly in models. Proxies (e.g., ZIP code) can reintroduce bias.
  3. Disparate impact: A model can be unfair even if it doesn’t use protected attributes.
  4. Fairness metrics: Demographic parity (equal selection rates), equalized odds (equal error rates).
  5. Feedback loops: Biased outputs-more biased data-worse models. Break the cycle.
  6. Explainability-fairness: A transparent model can still be unfair.
  7. Mitigation techniques: Reweighting, pre-processing, post-processing.
  8. Monitor in production: Bias can emerge after deployment (e.g., data drift).
  9. Document everything: Data sources, fairness metrics, mitigation steps.
  10. Trap: "The model is fair because it’s accurate." Accuracy-fairness—check subgroup performance.