arxiv_ml 85% Match Research Paper NLP Researchers,Machine Learning Engineers,Data Scientists 2 weeks ago

Generative or Discriminative? Revisiting Text Classification in the Era of Transformers

large-language-models › evaluation

📄 Abstract

Abstract: The comparison between discriminative and generative classifiers has intrigued researchers since Efron's seminal analysis of logistic regression versus discriminant analysis. While early theoretical work established that generative classifiers exhibit lower sample complexity but higher asymptotic error in simple linear settings, these trade-offs remain unexplored in the transformer era. We present the first comprehensive evaluation of modern generative and discriminative architectures - Auto-regressive modeling, Masked Language Modeling, Discrete Diffusion, and Encoders for text classification. Our study reveals that the classical 'two regimes' phenomenon manifests distinctly across different architectures and training paradigms. Beyond accuracy, we analyze sample efficiency, calibration, noise robustness, and ordinality across diverse scenarios. Our findings offer practical guidance for selecting the most suitable modeling approach based on real-world constraints such as latency and data limitations.

Authors (10)

Siva Rajesh Kasa

Karan Gupta

Sumegh Roychowdhury

Ashutosh Kumar

Yaswanth Biruduraju

Santhosh Kumar Kasa

+4 more

Submitted

June 13, 2025

arXiv Category

cs.LG

arXiv PDF

Key Contributions

Provides the first comprehensive evaluation of modern generative and discriminative transformer architectures for text classification, analyzing trade-offs in accuracy, sample efficiency, calibration, and robustness. It reveals how classical 'two regimes' phenomena manifest in the transformer era and offers practical guidance for model selection.

Business Value

Helps organizations choose the most effective transformer-based models for text classification tasks, optimizing for factors like data availability, robustness requirements, and desired output properties, leading to better performance and efficiency.

Paper Metadata

Innovation Type

Empirical Evaluation

Deployment Feasibility

High (evaluates existing, deployable architectures)

Limitations Addressed

Addresses the lack of empirical understanding regarding the trade-offs between generative and discriminative models in the context of modern transformer architectures for text classification.

Performance Gains

Provides insights into relative performance and trade-offs, guiding selection rather than claiming specific gains.

Technical Tags

text classificationtransformersgenerative modelsdiscriminative modelsautoregressive modelsmasked language modelsdiscrete diffusionsample efficiencycalibrationnoise robustness

Research Topics

Natural Language ProcessingDeep Learning ArchitecturesModel ComparisonText Classification

Methods & Architectures

Comprehensive evaluationComparative analysisAutoregressive modelingMasked Language Modeling (MLM)Discrete DiffusionTransformer Encoders TransformersAutoregressive ModelsMasked Language ModelsDiscrete Diffusion ModelsEncoder-only models

Applications & Tasks

Natural Language Processing Information Retrieval Content Moderation Text classificationModel comparisonUnderstanding trade-offs Text classificationModel selection

Related Fields

Machine LearningStatisticsComputational Linguistics

Keywords

text classificationtransformersgenerative modelsdiscriminative modelsautoregressivemasked language modeldiffusion modelssample efficiencycalibrationrobustnessEfronlogistic regressiondiscriminant analysis

Academic Context

#Natural Language Processing#Deep Learning Architectures#Model Comparison#Text Classification

Commercial Potential

Potential Products

Automated content categorization systemsSentiment analysis toolsSpam filters

Target Industries

TechnologyMediaFinanceCustomer Service

Use Case Examples

Classifying customer reviewsRouting support ticketsModerating online content

Competitive Edge

Provides a comparative analysis of different transformer approaches, offering guidance beyond individual model performance.

Resource Requirements

Data Requirements

Diverse text classification datasets.

Deployment Constraints

Performance depends on the specific task and data characteristics.

Production Readiness

Maturity Level

Empirical Analysis

View Full Paper Back to Papers