arxiv_ml 95% Match Research Paper AI Ethicists,Healthcare AI Developers,Medical Researchers,RL Researchers 2 weeks ago

FairGRPO: Fair Reinforcement Learning for Equitable Clinical Reasoning

ai-safety › fairness

📄 Abstract

Abstract: Medical artificial intelligence systems have achieved remarkable diagnostic capabilities, yet they consistently exhibit performance disparities across demographic groups, causing real-world harm to underrepresented populations. While recent multimodal reasoning foundation models have advanced clinical diagnosis through integrated analysis of diverse medical data, reasoning trainings via reinforcement learning inherit and often amplify biases present in training datasets dominated by majority populations. We introduce Fairness-aware Group Relative Policy Optimization (FairGRPO), a hierarchical reinforcement learning approach that promotes equitable learning across heterogeneous clinical populations. FairGRPO employs adaptive importance weighting of advantages based on representation, task difficulty, and data source. To address the common issue of missing demographic labels in the clinical domain, we further employ unsupervised clustering, which automatically discovers latent demographic groups when labels are unavailable. Through comprehensive experiments across 7 clinical diagnostic datasets spanning 5 clinical modalities across X-ray, CT scan, dermoscropy, mammography and ultrasound, we demonstrate that FairGRPO reduces predictive parity by 27.2% against all vanilla and bias mitigated RL baselines, while improving F1 score by 12.49%. Furthermore, training dynamics analysis reveals that FairGRPO progressively improves fairness throughout optimization, while baseline RL methods exhibit deteriorating fairness as training progresses. Based on FairGRPO, we release FairMedGemma-4B, a fairness-aware clinical VLLM that achieves state-of-the-art performance while demonstrating significantly reduced disparities across demographic groups.

Authors (4)

Shiqi Dai

Wei Dai

Jiaee Cheong

Paul Pu Liang

Submitted

October 22, 2025

arXiv Category

cs.LG

arXiv PDF

Key Contributions

Introduces FairGRPO, a hierarchical reinforcement learning approach to promote equitable learning across heterogeneous clinical populations. FairGRPO uses adaptive importance weighting and unsupervised clustering to address performance disparities and bias in medical AI, particularly in scenarios with missing demographic labels.

Business Value

Enhances the trustworthiness and reliability of AI systems in healthcare, ensuring equitable outcomes for all patient populations and mitigating risks associated with biased decision-making.

Paper Metadata

Innovation Type

Algorithmic Improvement

Deployment Feasibility

Medium, requires integration into clinical AI systems and validation with real-world healthcare data.

Limitations Addressed

Addresses performance disparities and bias amplification in medical AI systems, especially in reinforcement learning, and tackles the challenge of missing demographic labels in clinical data.

Performance Gains

Promotes equitable learning and reduces performance disparities across different demographic groups in clinical reasoning tasks.

Technical Tags

FairnessReinforcement LearningClinical ReasoningEquitable LearningBias MitigationHierarchical RLImportance WeightingUnsupervised ClusteringDemographic DisparitiesMultimodal Reasoning

Research Topics

AI FairnessHealthcare AIReinforcement LearningBias in Machine LearningExplainable AI

Methods & Architectures

Fairness-aware Group Relative Policy Optimization (FairGRPO)Hierarchical Reinforcement LearningAdaptive Importance WeightingUnsupervised ClusteringPolicy Optimization Hierarchical Reinforcement Learning Model

Applications & Tasks

Healthcare Clinical Decision Support Medical AI Performance Disparities Across DemographicsAmplification of Bias in RLLack of Demographic LabelsEquitable AI in Healthcare Equitable Clinical ReasoningFair Policy Optimization in Healthcare

Related Fields

Artificial IntelligenceMachine LearningHealthcare InformaticsEthics in AIReinforcement Learning

Keywords

FairnessAI EthicsReinforcement LearningClinical AIBiasHealthcareEquitable AIHierarchical RLImportance WeightingUnsupervised Learning

Academic Context

#AI Fairness#Healthcare AI#Reinforcement Learning#Bias in Machine Learning#Explainable AI

Commercial Potential

Potential Products

Fair clinical decision support systemsBias-auditing tools for healthcare AIEquitable AI training frameworks

Target Industries

HealthcareBiotechnologyMedical DevicesHealth Tech

Use Case Examples

Ensuring fair diagnostic recommendations across different patient groupsDeveloping equitable treatment planning AIMitigating bias in AI-driven medical research

Competitive Edge

Addresses the critical issue of fairness in RL for healthcare, offering a novel hierarchical approach to mitigate demographic performance disparities.

Market Opportunity

Large (healthcare AI market, focus on fairness and equity)

Revenue Models

Software licensingconsulting services for fair AI implementation

Resource Requirements

Compute Needs

Requires significant compute for training hierarchical RL models, especially with large clinical datasets.

Data Requirements

Requires clinical datasets, ideally with demographic information (or suitable for unsupervised clustering).

Deployment Constraints

Integration into clinical workflows, regulatory approval, need for robust validation across diverse populations.

Scalability

Scalability depends on the complexity of the clinical reasoning task and the size of the demographic groups.

Regulatory Considerations

HIPAAFDA regulations for medical devicesAI ethics guidelines in healthcare

Production Readiness

Maturity Level

Research

Time to Market

Long (requires extensive validation and regulatory approval)

Patent Potential

Medium (novel fairness algorithms for RL)

View Full Paper Back to Papers