Redirecting to original paper in 30 seconds...
Click below to go immediately or wait for automatic redirect
This work demonstrates that Reinforcement Learning significantly enhances the capabilities of LLM agents for long-horizon, multi-turn tasks, outperforming prompt-based approaches. The RL-trained agent achieved higher accuracy on a legal document search benchmark, highlighting the benefits of learning from experience.
Enables more sophisticated and efficient AI agents for complex information retrieval and task completion, particularly in specialized domains like legal research.