arxiv_ai 92% Match Research Paper AI Researchers,ML Engineers,Researchers focused on AI interpretability 2 weeks ago

Localist LLMs with Recruitment Learning

large-language-models › training-methods

📄 Abstract

Abstract: We present a novel framework for training large language models with continuously adjustable internal representations that span the full spectrum from localist (interpretable, rule-based) to distributed (generalizable, efficient) encodings. The key innovations are (1) a locality dial, a tunable parameter that dynamically controls the degree of localization during both training and inference without requiring model retraining, (2) an information-theoretic recruitment mechanism that adaptively allocates semantic blocks as needed, eliminating the requirement for complete domain knowledge at initialization, and (3) a hierarchical recruitment framework that extends capacity allocation to entire specialized LLMs, enabling multi-granularity architectural adaptation. This is achieved through group sparsity penalties on attention mechanisms, information-theoretic anchor design, dynamic rule injection, and principled recruitment criteria based on penalized likelihood with explicit units. We provide rigorous mathematical results establishing explicit threshold conditions under which attention provably concentrates on semantically relevant blocks at stationary points, with exact bounds on attention entropy and pointer fidelity. The hierarchical recruitment mechanism provides convergence guarantees at both the block level (fine-grained, within-LLM) and the LLM level (coarse-grained, cross-domain), ensuring the system discovers semantic partitions that balance model complexity against data encoding efficiency. This framework enables practitioners to continuously interpolate between interpretable and high-performance modes while adapting architectural capacity at multiple granularities, supporting applications in regulated domains requiring both transparency and capability.

Authors (1)

Joachim Diederich

Submitted

October 20, 2025

arXiv Category

cs.LG

arXiv PDF

Key Contributions

Introduces a framework for training LLMs with continuously adjustable internal representations from localist to distributed using a 'locality dial' and an 'information-theoretic recruitment mechanism'. This allows dynamic control over model behavior and capacity allocation without retraining.

Business Value

Enables the development of more transparent and adaptable AI systems, which can be crucial for regulated industries or applications requiring explainability, while maintaining high performance.

Paper Metadata

Innovation Type

Novel Training Mechanism and Control Parameter

Deployment Feasibility

Moderate. The proposed mechanisms add complexity to the training and inference pipeline. Validation on diverse tasks is needed.

Limitations Addressed

The trade-off between interpretable (localist) and generalizable (distributed) representations, and the need for complete domain knowledge at initialization.

Technical Tags

Locality DialRecruitment LearningInformation-theoreticHierarchical FrameworkGroup SparsityAttention MechanismsDynamic Rule InjectionLarge Language Models

Research Topics

Model InterpretabilityModel EfficiencyAdaptable AIMachine Learning TheoryRepresentation Learning

Methods & Architectures

Locality DialInformation-theoretic recruitment mechanismHierarchical recruitment frameworkGroup sparsity penalties on attention mechanismsDynamic rule injectionPenalized likelihood with explicit unit Large Language Models with adjustable internal representations

Applications & Tasks

General AI model training Interpretable AI Balancing interpretability and generalizability in LLMsDynamic adaptation of model representationsEfficient allocation of semantic capacity Training LLMs with adjustable localityAdaptive capacity allocationInterpretable LLM development

Related Fields

Machine LearningArtificial IntelligenceNatural Language ProcessingComputational Neuroscience

Keywords

LLMInterpretabilityGeneralizabilityLocality DialRecruitment LearningInformation TheoryAttentionAdaptationRepresentation LearningHierarchicalSparsity

Academic Context

#Model Interpretability#Model Efficiency#Adaptable AI#Machine Learning Theory#Representation Learning

Commercial Potential

Potential Products

Configurable AI modelsExplainable AI platformsAdaptive learning systems

Target Industries

FinanceHealthcareLegalTechnology

Use Case Examples

An LLM that can switch between detailed, rule-based reasoning and broad, generalized pattern matching based on the task.Models that can dynamically allocate computational resources based on the complexity of the input.

Competitive Edge

Offers a novel approach to control LLM behavior and representation, potentially providing a more flexible alternative to fixed architectures or fine-tuning.

Market Opportunity

Growing demand for controllable and interpretable AI.

Revenue Models

Licensing of the training frameworkspecialized AI services

Resource Requirements

Compute Needs

High (for training large models)

Data Requirements

Diverse datasets to test adaptability

Deployment Constraints

Complexity of implementation and tuning the 'locality dial'.

Scalability

The hierarchical framework suggests potential for scalability by composing specialized LLMs.

Production Readiness

Maturity Level

Research

Time to Market

3-5 years

Patent Potential

Moderate (novel training mechanisms)

View Full Paper Back to Papers