arxiv_ai 95% Match Research Paper Machine Learning Researchers,Deep Learning Theorists,NLP Engineers 2 weeks ago

Language Models are Injective and Hence Invertible

large-language-models › model-architecture

📄 Abstract

Abstract: Transformer components such as non-linear activations and normalization are inherently non-injective, suggesting that different inputs could map to the same output and prevent exact recovery of the input from a model's representations. In this paper, we challenge this view. First, we prove mathematically that transformer language models mapping discrete input sequences to their corresponding sequence of continuous representations are injective and therefore lossless, a property established at initialization and preserved during training. Second, we confirm this result empirically through billions of collision tests on six state-of-the-art language models, and observe no collisions. Third, we operationalize injectivity: we introduce SipIt, the first algorithm that provably and efficiently reconstructs the exact input text from hidden activations, establishing linear-time guarantees and demonstrating exact invertibility in practice. Overall, our work establishes injectivity as a fundamental and exploitable property of language models, with direct implications for transparency, interpretability, and safe deployment.

Authors (6)

Giorgos Nikolaou

Tommaso Mencattini

Donato Crisostomi

Andrea Santilli

Yannis Panagakis

Emanuele Rodolà

Submitted

October 17, 2025

arXiv Category

cs.LG

arXiv PDF

Key Contributions

This paper mathematically proves that transformer language models are injective and thus lossless, a property preserved during training. It empirically validates this through extensive collision tests and introduces SipIt, an algorithm that efficiently reconstructs the exact input text from hidden activations, demonstrating practical invertibility. This work establishes injectivity as a fundamental property of transformers, challenging prior assumptions about information loss.

Business Value

Understanding the lossless nature of transformers can lead to more reliable information extraction and reconstruction from model representations, potentially improving downstream applications that rely on precise input recovery.

Paper Metadata

Innovation Type

Theoretical Breakthrough and Algorithmic Development

Deployment Feasibility

The SipIt algorithm has linear-time guarantees, suggesting good practical feasibility for reconstructing inputs from activations.

Limitations Addressed

The assumption that non-linear activations and normalization in transformers lead to non-injectivity and information loss.

Technical Tags

transformerlanguage modelsinjectivityinvertibilityrepresentation learninghidden activationssequence modelingdiscrete inputscontinuous representationsalgorithm

Research Topics

Understanding Transformer PropertiesInformation Recovery in Neural NetworksTheoretical Guarantees for LLMsAlgorithmic Inversion of Neural Models

Methods & Architectures

Mathematical ProofEmpirical ValidationCollision TestingSipIt AlgorithmLinear-time Guarantees Transformer

Applications & Tasks

Natural Language Processing Machine Learning Theory Information Loss in Neural NetworksRecovering Input from Representations Input ReconstructionVerifying Model Properties

Related Fields

Information TheoryTheoretical Computer ScienceDeep Learning Theory

Keywords

transformerlanguage modelinjectivityinvertibilitylosslessrepresentationactivationsalgorithmmathematical proofempirical validationsequencediscretecontinuousSipIt

Academic Context

#Understanding Transformer Properties#Information Recovery in Neural Networks#Theoretical Guarantees for LLMs#Algorithmic Inversion of Neural Models

Commercial Potential

Potential Products

Data recovery toolsModel interpretability frameworks

Target Industries

TechnologyResearch

Use Case Examples

Reconstructing original text from intermediate model statesVerifying information preservation during model inference

Competitive Edge

This work provides a fundamental theoretical understanding and a practical algorithm that directly addresses the perceived information loss in transformers, offering a new perspective compared to methods that focus on mitigating such losses.

Resource Requirements

Compute Needs

Moderate (for empirical validation)

Data Requirements

Large text corpora (for empirical validation)

Deployment Constraints

Requires access to model's hidden activations.

Scalability

The SipIt algorithm has linear-time guarantees, suggesting good scalability.

Production Readiness

Maturity Level

Theoretical and Algorithmic

View Full Paper Back to Papers