Redirecting to original paper in 30 seconds...

Click below to go immediately or wait for automatic redirect

arxiv_ai 92% Match Research Paper AI Researchers,Robotics Engineers,HCI Researchers,Game AI Developers 1 week ago

Partner Modelling Emerges in Recurrent Agents (But Only When It Matters)

reinforcement-learning › multi-agent
📄 Abstract

Abstract: Humans are remarkably adept at collaboration, able to infer the strengths and weaknesses of new partners in order to work successfully towards shared goals. To build AI systems with this capability, we must first understand its building blocks: does such flexibility require explicit, dedicated mechanisms for modelling others -- or can it emerge spontaneously from the pressures of open-ended cooperative interaction? To investigate this question, we train simple model-free RNN agents to collaborate with a population of diverse partners. Using the `Overcooked-AI' environment, we collect data from thousands of collaborative teams, and analyse agents' internal hidden states. Despite a lack of additional architectural features, inductive biases, or auxiliary objectives, the agents nevertheless develop structured internal representations of their partners' task abilities, enabling rapid adaptation and generalisation to novel collaborators. We investigated these internal models through probing techniques, and large-scale behavioural analysis. Notably, we find that structured partner modelling emerges when agents can influence partner behaviour by controlling task allocation. Our results show that partner modelling can arise spontaneously in model-free agents -- but only under environmental conditions that impose the right kind of social pressure.
Authors (8)
Ruaridh Mon-Williams
Max Taylor-Davies
Elizabeth Mieczkowski
Natalia Velez
Neil R. Bramley
Yanwei Wang
+2 more
Submitted
May 22, 2025
arXiv Category
cs.AI
arXiv PDF

Key Contributions

Demonstrates that partner modeling (inferring collaborators' task abilities) can emerge spontaneously in simple model-free RNN agents trained for open-ended cooperative interaction in the `Overcooked-AI` environment. Agents develop structured internal representations of partners, enabling rapid adaptation and generalization to novel collaborators without explicit modeling mechanisms.

Business Value

Crucial for developing AI systems that can effectively collaborate with humans and other AI agents in complex, dynamic environments, leading to more intuitive human-AI teams and efficient multi-agent coordination.