Redirecting to original paper in 30 seconds...
Click below to go immediately or wait for automatic redirect
📄 Abstract
Abstract: Recent advances in large language models (LLMs) have enabled the automatic
generation of executable code for task planning and control in embodied agents
such as robots, demonstrating the potential of LLM-based embodied intelligence.
However, these LLM-based code-as-policies approaches often suffer from limited
environmental grounding, particularly in dynamic or partially observable
settings, leading to suboptimal task success rates due to incorrect or
incomplete code generation. In this work, we propose a neuro-symbolic embodied
task planning framework that incorporates explicit symbolic verification and
interactive validation processes during code generation. In the validation
phase, the framework generates exploratory code that actively interacts with
the environment to acquire missing observations while preserving task-relevant
states. This integrated process enhances the grounding of generated code,
resulting in improved task reliability and success rates in complex
environments. We evaluate our framework on RLBench and in real-world settings
across dynamic, partially observable scenarios. Experimental results
demonstrate that our framework improves task success rates by 46.2% over
Code-as-Policies baselines and attains over 86.8% executability of
task-relevant actions, thereby enhancing the reliability of task planning in
dynamic environments.
Authors (5)
Sanghyun Ahn
Wonje Choi
Junyong Lee
Jinwoo Park
Honguk Woo
Submitted
October 24, 2025
Key Contributions
This paper introduces a neuro-symbolic framework for embodied task planning that enhances LLM-based code generation by incorporating explicit symbolic verification and interactive validation. This approach improves environmental grounding, leading to more reliable task execution in dynamic and partially observable settings.
Business Value
Enables more robust and reliable autonomous agents (e.g., robots) capable of performing complex tasks in real-world, dynamic environments, reducing errors and increasing operational efficiency.