Redirecting to original paper in 30 seconds...

Click below to go immediately or wait for automatic redirect

arxiv_ai 92% Match Research Paper AI Researchers,ML Engineers,NLP Practitioners,Knowledge Engineers 17 hours ago

ExplicitLM: Decoupling Knowledge from Parameters via Explicit Memory Banks

large-language-models › model-architecture
📄 Abstract

Abstract: Large language models suffer from knowledge staleness and lack of interpretability due to implicit knowledge storage across entangled network parameters, preventing targeted updates and reasoning transparency. We propose ExplicitLM, a novel architecture featuring a million-scale external memory bank storing human-readable knowledge as token sequences, enabling direct inspection and modification. We design a differentiable two-stage retrieval mechanism with efficient coarse-grained filtering via product key decomposition (reducing complexity from $\mathcal{O}(N \cdot |I|)$ to $\mathcal{O}(\sqrt{N} \cdot |I|)$) and fine-grained Gumbel-Softmax matching for end-to-end training. Inspired by dual-system cognitive theory, we partition knowledge into frozen explicit facts (20%) and learnable implicit patterns (80%), maintained through Exponential Moving Average updates for stability. ExplicitLM achieves up to 43.67% improvement on knowledge-intensive tasks versus standard Transformers, with 3.62$\times$ gains in low-data regimes (10k samples). Analysis shows strong correlations between memory retrieval and performance, with correct predictions achieving 49% higher hit rates. Unlike RAG systems with frozen retrieval, our jointly optimized architecture demonstrates that interpretable, updatable models can maintain competitive performance while providing unprecedented knowledge transparency.

Key Contributions

ExplicitLM decouples knowledge from parameters using a million-scale external memory bank for human-readable knowledge. It features a differentiable two-stage retrieval mechanism with efficient product key decomposition and Gumbel-Softmax matching, achieving significant improvements on knowledge-intensive tasks and enabling targeted updates.

Business Value

Enables more dynamic and transparent knowledge management in AI systems, allowing for easier updates and better explainability. This is crucial for applications requiring up-to-date and verifiable information, such as expert systems or factual QA.