Redirecting to original paper in 30 seconds...

Click below to go immediately or wait for automatic redirect

arxiv_cl 97% Match Research Paper LLM researchers,AI researchers,Machine learning engineers,NLP practitioners 6 days ago

Scaling Latent Reasoning via Looped Language Models

large-language-models › reasoning
📄 Abstract

Abstract: Modern LLMs are trained to "think" primarily via explicit text generation, such as chain-of-thought (CoT), which defers reasoning to post-training and under-leverages pre-training data. We present and open-source Ouro, named after the recursive Ouroboros, a family of pre-trained Looped Language Models (LoopLM) that instead build reasoning into the pre-training phase through (i) iterative computation in latent space, (ii) an entropy-regularized objective for learned depth allocation, and (iii) scaling to 7.7T tokens. Ouro 1.4B and 2.6B models enjoy superior performance that match the results of up to 12B SOTA LLMs across a wide range of benchmarks. Through controlled experiments, we show this advantage stems not from increased knowledge capacity, but from superior knowledge manipulation capabilities. We also show that LoopLM yields reasoning traces more aligned with final outputs than explicit CoT. We hope our results show the potential of LoopLM as a novel scaling direction in the reasoning era. Our model could be found in: http://ouro-llm.github.io.
Authors (33)
Rui-Jie Zhu
Zixuan Wang
Kai Hua
Tianyu Zhang
Ziniu Li
Haoran Que
+27 more
Submitted
October 29, 2025
arXiv Category
cs.CL
arXiv PDF Code

Key Contributions

This paper introduces Ouro, a family of pre-trained Looped Language Models (LoopLM) that integrate reasoning into the pre-training phase via iterative latent space computation and an entropy-regularized objective for depth allocation. Ouro models (1.4B, 2.6B) achieve performance matching larger SOTA LLMs across benchmarks, demonstrating superior knowledge manipulation capabilities and more aligned reasoning traces compared to explicit CoT.

Business Value

Offers a more efficient and potentially more powerful way to build LLMs with strong reasoning abilities, which can lead to more capable AI assistants, better problem-solving tools, and more reliable information processing.

View Code on GitHub