Lesson 08 of 10AI Healthcare Quality & Safety

Bias, Equity &
Fairness in Clinical AI

Algorithmic bias in healthcare is not a theoretical concern — it is a documented reality that has caused differential clinical harm to patients based on race, gender, age, socioeconomic status, and disability. Understanding how bias enters AI systems, how it manifests clinically, and how to evaluate and mitigate it is a fundamental professional responsibility.

What you will learn
Define algorithmic bias and explain the mechanisms by which it enters healthcare AI systems
Identify documented examples of AI bias causing differential clinical harm
Describe the multiple dimensions of AI fairness and the trade-offs between them
Apply a structured approach to bias evaluation for a clinical AI system
Explain the governance and organizational requirements for equitable AI deployment

How bias enters
healthcare AI systems

Algorithmic bias in healthcare AI arises through multiple mechanisms, often simultaneously. Training data bias is the most fundamental — when the population represented in training data is not representative of the population in whom the system will be deployed, the model systematically underperforms for underrepresented groups. Historical healthcare data frequently underrepresents racial and ethnic minorities, patients with disabilities, patients from lower socioeconomic backgrounds, and women — because access to care, documentation practices, and research participation have all historically been unequal.

Proxy variable bias occurs when AI systems use variables that appear clinically neutral but function as proxies for protected characteristics. The most documented example in healthcare AI: a widely used commercial algorithm used healthcare spending as a proxy for health need, on the assumption that sicker patients incur higher costs. For Black patients, systemic barriers to care access meant lower spending despite equivalent or greater health need — causing the algorithm to systematically underestimate the health needs of Black patients at similar health status, resulting in fewer referrals for high-need care programs.

Label bias occurs when the clinical outcomes used to train AI models reflect historical inequities in diagnosis and treatment rather than true disease burden. If a diagnostic condition has historically been underdiagnosed in women — as has been documented for cardiovascular disease, autism, and autoimmune conditions — an AI trained to predict that diagnosis will learn the biased historical pattern and perpetuate the underdiagnosis.

Proxy Variable Bias

Variables that appear clinically neutral can function as proxies for race, socioeconomic status, or disability in ways that cause systematic harm. Healthcare cost, zip code, insurance type, and prior healthcare utilization are all frequently used in AI models and all carry potential proxy bias risk that must be evaluated before deployment.

Documented examples
of AI bias causing clinical harm

The 2019 study in Science documenting bias in a commercial risk-stratification algorithm affecting millions of patients is the most cited example — and the most instructive. The algorithm used healthcare cost as a proxy for health need, producing systematic underestimation of Black patients' health needs. The investigators estimated that approximately 46,000 additional high-risk Black patients would have been enrolled in care management programs if the algorithm had not been biased.

Pulse oximetry — a non-AI technology but algorithmically determined — has been documented to systematically overestimate arterial oxygen saturation in patients with darker skin tones, causing delayed recognition of hypoxemia. This example illustrates that algorithmic bias is not a new phenomenon unique to deep learning — it is a design and validation failure that can occur in any algorithmic system when diverse populations are not adequately represented in development and testing.

Dermatology AI systems trained predominantly on images of lighter skin tones have consistently demonstrated degraded performance on patients with darker skin tones — a documented concern that has not prevented widespread deployment of tools with known performance differentials.

Evaluating and mitigating
algorithmic bias in practice

Bias evaluation requires analyzing model performance across demographic subgroups — not just overall accuracy metrics. An AI system that achieves 90% overall accuracy may achieve only 74% accuracy for a specific demographic group. Overall performance metrics mask subgroup disparities, which is why disaggregated performance analysis is a governance requirement for any AI system deployed in a diverse population.

The clinical governance framework for bias evaluation should examine: training data composition — is the training population representative of the deployment population across relevant demographic dimensions? Proxy variable review — do predictor variables function as demographic proxies? Label validation — do training labels reflect true disease burden or historical diagnostic inequity? And subgroup performance analysis — does the model perform differently across demographic groups, and is the differential clinically meaningful?

Bias mitigation strategies include: modifying training data composition to improve representativeness; removing or replacing proxy variables; applying algorithmic fairness constraints during model training; and calibrating model outputs separately for demographic subgroups. No mitigation strategy eliminates bias entirely — the governance obligation is ongoing monitoring, transparent reporting of known performance differentials, and a commitment to continuous improvement.

Disaggregated Performance

Overall model accuracy is an insufficient governance metric. A model that performs at 90% overall may perform at 73% for Black patients, 68% for women, or 61% for elderly patients — and overall accuracy will never reveal this. Disaggregated performance analysis by demographic subgroup is a minimum governance requirement for equitable AI deployment.

Key concepts
from this lesson

Key Concept

Algorithmic Bias

Systematic errors in AI outputs that produce differential outcomes across demographic groups — arising from training data, proxy variables, or label quality.

Key Concept

Training Data Bias

Underrepresentation of specific populations in training data, causing systematic underperformance for those groups in deployment.

Key Concept

Proxy Variable Bias

When AI models use variables that appear neutral but function as proxies for protected characteristics — causing indirect discrimination.

Key Concept

Disaggregated Performance

Analysis of model performance separately for demographic subgroups — the minimum governance standard for equitable AI deployment.

Key Concept

Label Bias

When training labels reflect historical diagnostic inequities rather than true disease burden — causing AI to learn and perpetuate those inequities.

Key Concept

Algorithmic Fairness

A set of technical and governance principles for designing and evaluating AI systems that do not produce unjustified differential outcomes.

Case Study

The algorithm that spent its way to bias

A large US health system uses a commercial risk-stratification algorithm to identify patients for enrollment in a complex care management program. The algorithm analyzes patient demographics, clinical history, and — most significantly — healthcare cost data to generate a risk score. High-scoring patients are referred to care coordinators for intensive support.

An internal equity review, prompted by a national publication identifying the same bias in a similar commercial product, reveals a significant disparity: Black patients enrolled in the care management program are substantially sicker than White patients enrolled at the same algorithm-generated risk score. Black patients require enrollment at a higher actual disease burden to generate the same algorithmic risk score as White patients.

The cause: healthcare cost — the primary proxy for health need in the algorithm — is lower for Black patients not because they are healthier, but because systemic barriers to care access have produced lower historical utilization. The algorithm had learned that lower spending equals lower need, invisibly encoding a history of healthcare inequity into a risk score used to allocate care.

The health system removes healthcare cost from the algorithm's predictor variables, recalibrates the model, and increases the proportion of Black patients identified as high-risk by 46% — without any change in actual disease burden in the population.

What this illustrates

This case illustrates how historical healthcare inequity can be algorithmically encoded and perpetuated at scale. The algorithm was not designed to discriminate. It learned to discriminate from data that reflected decades of unequal access. Preventing this requires explicit bias evaluation — examining not just what variables the model uses, but what those variables represent in the lives of the patients they describe.

Reflection Prompt

What assumptions are embedded in the AI systems you use?

Every AI system embeds assumptions — about which data is available, which outcomes matter, which populations were represented in development. For each AI system in your organization, ask: what are the predictor variables? Could any of them function as proxies for race, socioeconomic status, or disability? Has performance been analyzed separately for different demographic groups? If the answers are not available, they should be. Knowing what you don't know about an AI system is the beginning of responsible governance.

Further Learning

The Commonwealth Fund and the National Academy of Medicine have both published substantive work on health equity and algorithmic bias in healthcare AI. The Robert Wood Johnson Foundation's work on structural racism and health equity provides essential context for understanding why healthcare data reflects the inequities it does. Available through their respective websites.

Knowledge Check — Lesson 08

1. A commercial risk-stratification algorithm uses healthcare cost as a proxy for health need. For a population where access barriers have historically suppressed healthcare utilization for specific demographic groups, this produces:

AAccurate risk stratification because healthcare cost reliably reflects disease burden across all populations
BProxy variable bias — groups with lower utilization due to access barriers are underestimated as high-risk despite equivalent or greater health need
CLabel bias — the outcome variable has been incorrectly defined
DDistribution shift — the model has been deployed in a population different from its training data

2. A dermatology AI system achieves 94% sensitivity for melanoma detection overall, but 71% sensitivity for patients with Fitzpatrick skin types V and VI. The appropriate governance response is:

AThe overall performance is strong and clinical deployment should proceed without modification
BDeploy the system only for patients with Fitzpatrick skin types I–IV where performance meets clinical standards
CConduct a bias assessment, investigate the cause of the performance differential, and determine whether the differential is clinically acceptable before deployment decisions are made
DRequest that the vendor retrain the model on a dataset with higher representation of darker skin tones before making a deployment decision

3. Label bias in a clinical AI training dataset most directly refers to:

AIncorrectly formatted data labels causing model training errors
BTraining outcomes that reflect historical diagnostic inequities rather than true disease burden
CInsufficient label volume for minority class outcomes in the training dataset
DInconsistent labeling practices between different clinical sites contributing to the training dataset

4. Which performance metric is most important for equity governance of a clinical AI system deployed in a diverse population?

AOverall model accuracy across the full deployment population
BAUC-ROC — the area under the receiver operating characteristic curve
CDisaggregated performance metrics analyzed separately for relevant demographic subgroups
DPositive predictive value across the full deployment population

5. Which of the following best describes why eliminating algorithmic bias entirely from healthcare AI is not currently achievable?

AAI technology is not yet sophisticated enough to account for demographic variables
BHealthcare organizations do not collect sufficient demographic data for bias evaluation
CBias mitigation reduces overall model performance to a clinically unacceptable level
DTraining data reflects historical healthcare inequities, and no mitigation strategy can completely eliminate the influence of that history