arxiv_cl 93% Match Research Paper AI Researchers,ML Engineers,AI Safety Specialists,Developers of High-stakes AI 1 week ago

Generalization or Memorization: Dynamic Decoding for Mode Steering

large-language-models › reasoning

📄 Abstract

Abstract: Large Language Models (LLMs) exhibit a troubling duality, capable of both remarkable generalization and brittle, verbatim memorization of their training data. This unpredictability undermines their reliability in high-stakes applications. In this work, we propose a unified framework to understand, identify, and control these distinct reasoning modes. First, we introduce a theoretical model based on the Information Bottleneck (IB) principle, formalizing generalization as the learning of a compressed, task-relevant representation and memorization as a failure to compress. Building on this theory, we develop Dynamic Mode Steering (DMS), a novel inference-time algorithm which comprises two components: (1) a lightweight, causally-grounded linear probe that identifies the model's instantaneous reliance on memorization, and (2) a dynamic activation steering mechanism that nudges the model's computation towards pre-identified generalization circuits. We frame DMS as a form of adaptive, self-contrastive decoding. Experiments on reasoning and faithfulness tasks demonstrate that DMS significantly improves logical consistency and factual accuracy, thereby offering a principled approach to enhancing LLM reliability.

Authors (1)

Xuanming Zhang

Submitted

October 25, 2025

arXiv Category

cs.CL

arXiv PDF

Key Contributions

This paper proposes a unified theoretical framework based on the Information Bottleneck principle to understand generalization and memorization in LLMs. It introduces Dynamic Mode Steering (DMS), a novel inference-time algorithm that identifies memorization reliance and steers computation towards generalization circuits, enhancing LLM reliability.

Business Value

Increases the trustworthiness and safety of LLMs for critical applications like healthcare, finance, and legal analysis by reducing unpredictable memorization.

Paper Metadata

Innovation Type

Theoretical Framework and Novel Algorithm

Deployment Feasibility

Moderate; requires implementing the DMS algorithm at inference time, which adds a lightweight computational step.

Limitations Addressed

The unpredictable duality of LLMs (generalization vs. memorization) undermines their reliability in critical applications.

Performance Gains

Enables identification and steering of LLM reasoning modes, leading to more reliable and predictable outputs.

Technical Tags

GeneralizationMemorizationDynamic DecodingMode SteeringInformation BottleneckCausal InferenceActivation SteeringLLM ControlInference-time Algorithm

Research Topics

LLM InterpretabilityAI ReliabilityMachine Learning TheoryReasoning in Neural NetworksModel Behavior Analysis

Methods & Architectures

Theoretical Modeling (Information Bottleneck)Dynamic Mode Steering (DMS)Linear ProbingActivation SteeringCausal Grounding Large Language Models (LLMs)

Applications & Tasks

Natural Language Understanding AI Safety High-stakes AI Applications Unpredictable LLM BehaviorMemorization vs. GeneralizationControlling LLM Reasoning Modes Identifying MemorizationControlling GeneralizationSteering LLM ComputationImproving LLM Reliability

Related Fields

Machine Learning TheoryInterpretabilityCausal InferenceDeep LearningAI Safety

Keywords

GeneralizationMemorizationLLMsDynamic DecodingMode SteeringInformation BottleneckCausal InferenceActivation SteeringReliabilityInferenceReasoningControl

Academic Context

#LLM Interpretability#AI Reliability#Machine Learning Theory#Reasoning in Neural Networks#Model Behavior Analysis

Commercial Potential

Potential Products

Reliability-enhancing LLM inference enginesTools for analyzing LLM behavior

Target Industries

TechnologyFinanceHealthcareLegal

Use Case Examples

Ensuring factual accuracy in medical diagnosis assistantsPreventing data leakage in financial advisory botsImproving the robustness of legal document analysis tools

Competitive Edge

Offers a principled, unified approach to understanding and controlling LLM generalization vs. memorization, unlike ad-hoc methods.

Market Opportunity

Large and growing market for reliable LLM deployment.

Revenue Models

Licensing of the DMS technologyintegration into LLM platforms.

Resource Requirements

Compute Needs

Moderate, for the inference-time DMS algorithm.

Data Requirements

Requires understanding of training data characteristics to identify generalization circuits.

Deployment Constraints

The effectiveness of steering depends on identifying appropriate 'generalization circuits'.

Scalability

The DMS algorithm is designed to be lightweight and applicable at inference time.

Regulatory Considerations

Implications for AI explainability and reliability standards.

Production Readiness

Maturity Level

Research

Time to Market

1-2 years for integration into production systems.

Patent Potential

Moderate

View Full Paper Back to Papers