arxiv_ai 70% Match Research Theoretical machine learning researchers,Deep learning theorists,Researchers focused on model interpretability and understanding 2 weeks ago

When Does Closeness in Distribution Imply Representational Similarity? An Identifiability Perspective

large-language-models › alignment

📄 Abstract

Abstract: When and why representations learned by different deep neural networks are similar is an active research topic. We choose to address these questions from the perspective of identifiability theory, which suggests that a measure of representational similarity should be invariant to transformations that leave the model distribution unchanged. Focusing on a model family which includes several popular pre-training approaches, e.g., autoregressive language models, we explore when models which generate distributions that are close have similar representations. We prove that a small Kullback--Leibler divergence between the model distributions does not guarantee that the corresponding representations are similar. This has the important corollary that models with near-maximum data likelihood can still learn dissimilar representations -- a phenomenon mirrored in our experiments with models trained on CIFAR-10. We then define a distributional distance for which closeness implies representational similarity, and in synthetic experiments, we find that wider networks learn distributions which are closer with respect to our distance and have more similar representations. Our results thus clarify the link between closeness in distribution and representational similarity.

Authors (4)

Beatrix M. G. Nielsen

Emanuele Marconato

Andrea Dittadi

Luigi Gresele

Submitted

June 4, 2025

arXiv Category

cs.LG

arXiv PDF

Key Contributions

Investigates the relationship between distributional closeness (e.g., KL divergence) and representational similarity in deep neural networks from an identifiability perspective. It proves that small KL divergence between model distributions does not guarantee similar representations, implying models with near-maximum data likelihood can still learn dissimilar representations, and defines a distributional distance for which closeness implies similarity.

Business Value

Provides fundamental theoretical insights into deep learning, which can guide the development of more robust, interpretable, and reliable AI models, potentially improving generalization and reducing unexpected behaviors.

Paper Metadata

Innovation Type

Theoretical

Deployment Feasibility

N/A - This is a theoretical research paper.

Limitations Addressed

Lack of theoretical understanding of representation similarity,Assumption that similar data likelihood implies similar representations,Limitations of existing distributional distance measures

Technical Tags

representation similarityidentifiability theorydeep neural networksdistributional closenessKullback-Leibler divergencemodel trainingpre-trainingautoregressive modelsdata likelihoodinformation theory

Research Topics

Representation LearningDeep Learning TheoryModel IdentifiabilityInformation TheoryMachine Learning

Methods & Architectures

Identifiability analysisTheoretical proofsKullback-Leibler (KL) divergence analysisEmpirical evaluation on CIFAR-10Distributional distance definition Autoregressive language modelsDeep Neural Networks (DNNs)

Applications & Tasks

Machine Learning Theory Deep Learning Research Model Understanding AI Safety Understanding when learned representations are similarRelationship between distributional closeness and representational similarityImplications of high data likelihood for representation qualityTheoretical guarantees for representation learning Analyzing representational similarityProving theoretical bounds on representation learningUnderstanding factors influencing representation quality

Datasets & Benchmarks

Datasets

CIFAR-10

Kullback-Leibler (KL) divergenceRepresentational similarity measures

Related Fields

Machine Learning TheoryDeep LearningInformation TheoryStatisticsComputer Science TheoryRepresentation Learning

Keywords

representation learningidentifiabilitydeep learning theorydistributional closenessKL divergencemodel similarityautoregressive modelsdata likelihoodinformation theoryneural networks

Academic Context

#Representation Learning#Deep Learning Theory#Model Identifiability#Information Theory#Machine Learning

Technology Stack

Frameworks & Libraries

PyTorchTensorFlow

Programming Languages

Python

Commercial Potential

Competitive Edge

Offers a rigorous theoretical framework based on identifiability theory to analyze representation similarity, providing deeper insights than empirical observations alone.

Resource Requirements

Compute Needs

Low for theoretical analysis; moderate for empirical validation (e.g., training on CIFAR-10).

Data Requirements

Standard datasets like CIFAR-10 for empirical validation.

Deployment Constraints

N/A - Theoretical work.

Scalability

N/A - Theoretical work.

Production Readiness

Maturity Level

Theoretical Research

Time to Market

Long-term impact on AI development practices.

View Full Paper Back to Papers