Redirecting to original paper in 30 seconds...
Click below to go immediately or wait for automatic redirect
📄 Abstract
Abstract: Large language models (LLMs) increasingly generate natural language
rationales to enhance interpretability, but these often contain logical errors,
label mismatches, and domain-specific misalignments. Directly using such
rationales as supervision risks propagating noise and undermining training
stability. To address this challenge, we introduce Self-Filtered Distillation,
a framework specifically tailored for patent classification, which treats
LLM-generated rationales as trust signals rather than ground-truth supervision.
The framework employs selective distillation guided by three unsupervised trust
metrics: (1) Self-Consistency, which measures the stability of LLM-generated
rationales across multiple generations; (2) Class Entailment Alignment, which
assesses semantic coherence with patent-specific class definitions; and (3) LLM
Agreement Scoring, which validates rationale-label plausibility. These metrics
are integrated into a unified trust score that primarily weights training
samples while optionally filtering out extremely low-trust cases, enabling
reasoning-aware supervision. Experiments on the USPTO-2M dataset, a widely used
benchmark for patent classification, show that our method outperforms
label-based learning and conventional distillation in accuracy, stability, and
interpretability, establishing a reliable paradigm for leveraging
reasoning-aware trust indicators in patent analytics.
Key Contributions
This paper introduces Self-Filtered Distillation, a novel framework for patent classification that treats LLM-generated rationales as trust signals rather than ground-truth supervision. It addresses the issue of noisy and unreliable LLM rationales by employing three unsupervised trust metrics (Self-Consistency, Class Entailment Alignment, LLM Agreement Scoring) to selectively distill knowledge, thereby improving training stability and reliability.
Business Value
Enhances the accuracy and reliability of automated patent classification systems, which can significantly reduce manual review costs and speed up the patent examination process for legal firms and patent offices.