Redirecting to original paper in 30 seconds...
Click below to go immediately or wait for automatic redirect
Proposes a novel confidence estimation method for RAG systems using raw FFN activations, which aligns better with LLM output correctness than traditional methods. It models confidence as a sequence classification task with Huber loss regularization, enabling response abstinence and outperforming baselines in a real-world financial setting under strict latency constraints.
Enhances the safety and reliability of LLMs in critical applications like finance and healthcare by enabling them to abstain from answering when uncertain, preventing potentially harmful incorrect responses. This builds user trust and reduces risk.