Redirecting to original paper in 30 seconds...
Click below to go immediately or wait for automatic redirect
📄 Abstract
Abstract: Diagnosis-Related Group (DRG) codes are essential for hospital reimbursement
and operations but require labor-intensive assignment. Large Language Models
(LLMs) struggle with DRG coding due to the out-of-distribution (OOD) nature of
the task: pretraining corpora rarely contain private clinical or billing data.
We introduce DRG-Sapphire, which uses large-scale reinforcement learning (RL)
for automated DRG coding from clinical notes. Built on Qwen2.5-7B and trained
with Group Relative Policy Optimization (GRPO) using rule-based rewards,
DRG-Sapphire introduces a series of RL enhancements to address domain-specific
challenges not seen in previous mathematical tasks. Our model achieves
state-of-the-art accuracy on the MIMIC-IV benchmark and generates
physician-validated reasoning for DRG assignments, significantly enhancing
explainability. Our study further sheds light on broader challenges of applying
RL to knowledge-intensive, OOD tasks. We observe that RL performance scales
approximately linearly with the logarithm of the number of supervised
fine-tuning (SFT) examples, suggesting that RL effectiveness is fundamentally
constrained by the domain knowledge encoded in the base model. For OOD tasks
like DRG coding, strong RL performance requires sufficient knowledge infusion
prior to RL. Consequently, scaling SFT may be more effective and
computationally efficient than scaling RL alone for such tasks.
Authors (7)
Hanyin Wang
Zhenbang Wu
Gururaj Kolar
Hariprasad Korsapati
Brian Bartlett
Bryan Hull
+1 more
Key Contributions
Introduces DRG-Sapphire, a system using large-scale Reinforcement Learning (RL) for automated DRG coding from clinical notes, achieving state-of-the-art accuracy on MIMIC-IV. It enhances explainability by generating physician-validated reasoning and addresses challenges of applying RL to knowledge-intensive, OOD tasks.
Business Value
Automates the complex and labor-intensive DRG coding process, leading to significant cost savings, improved billing accuracy, and better operational efficiency in healthcare institutions.