arxiv_ai 95% Match Research Paper Robotics Researchers,AI Researchers,Embodied AI Developers,Autonomous Systems Engineers 1 week ago

Towards Reliable Code-as-Policies: A Neuro-Symbolic Framework for Embodied Task Planning

robotics › embodied-agents

📄 Abstract

Abstract: Recent advances in large language models (LLMs) have enabled the automatic generation of executable code for task planning and control in embodied agents such as robots, demonstrating the potential of LLM-based embodied intelligence. However, these LLM-based code-as-policies approaches often suffer from limited environmental grounding, particularly in dynamic or partially observable settings, leading to suboptimal task success rates due to incorrect or incomplete code generation. In this work, we propose a neuro-symbolic embodied task planning framework that incorporates explicit symbolic verification and interactive validation processes during code generation. In the validation phase, the framework generates exploratory code that actively interacts with the environment to acquire missing observations while preserving task-relevant states. This integrated process enhances the grounding of generated code, resulting in improved task reliability and success rates in complex environments. We evaluate our framework on RLBench and in real-world settings across dynamic, partially observable scenarios. Experimental results demonstrate that our framework improves task success rates by 46.2% over Code-as-Policies baselines and attains over 86.8% executability of task-relevant actions, thereby enhancing the reliability of task planning in dynamic environments.

Authors (5)

Sanghyun Ahn

Wonje Choi

Junyong Lee

Jinwoo Park

Honguk Woo

Submitted

October 24, 2025

arXiv Category

cs.AI

arXiv PDF

Key Contributions

This paper introduces a neuro-symbolic framework for embodied task planning that enhances LLM-based code generation by incorporating explicit symbolic verification and interactive validation. This approach improves environmental grounding, leading to more reliable task execution in dynamic and partially observable settings.

Business Value

Enables more robust and reliable autonomous agents (e.g., robots) capable of performing complex tasks in real-world, dynamic environments, reducing errors and increasing operational efficiency.

Paper Metadata

Innovation Type

Algorithmic/Framework

Deployment Feasibility

Moderate to High. Requires integration with robotic hardware and simulation environments, but the framework itself is designed for practical application.

Limitations Addressed

Limited environmental grounding and suboptimal task success rates in LLM-based code-as-policies for embodied agents, especially in dynamic or partially observable environments.

Performance Gains

Improved task reliability,Improved task success rates

Technical Tags

Neuro-Symbolic AIEmbodied AgentsTask PlanningLarge Language Models (LLMs)Code GenerationSymbolic VerificationInteractive ValidationEnvironmental GroundingRoboticsDynamic Environments

Research Topics

Embodied AIRobotics PlanningNeuro-Symbolic ReasoningLLM GroundingAgent Control

Methods & Architectures

Neuro-Symbolic FrameworkSymbolic VerificationInteractive ValidationCode GenerationReinforcement Learning (implied) Large Language Models (LLMs)Neuro-Symbolic Architectures

Applications & Tasks

Robotics Embodied AI Autonomous Systems Limited Environmental GroundingSuboptimal Task Success RatesIncorrect Code GenerationDynamic EnvironmentsPartially Observable Settings Embodied Task PlanningCode Generation for AgentsImproving Task ReliabilityAcquiring Missing Observations

Related Fields

Artificial IntelligenceRoboticsCognitive ScienceSymbolic AIMachine Learning

Keywords

Embodied AIRoboticsTask PlanningLLMsNeuro-SymbolicCode GenerationVerificationValidationEnvironmental GroundingAutonomous AgentsDynamic EnvironmentsPartially ObservableReliability

Academic Context

#Embodied AI#Robotics Planning#Neuro-Symbolic Reasoning#LLM Grounding#Agent Control

Commercial Potential

Potential Products

Autonomous robot control systemsIntelligent agent frameworks

Target Industries

RoboticsLogisticsManufacturingExploration

Use Case Examples

Robots performing complex assembly tasks in dynamic factoriesAutonomous drones navigating and collecting data in unknown environments

Competitive Edge

Offers a more robust and grounded approach compared to pure LLM-based code generation, particularly for tasks requiring interaction with physical environments.

Market Opportunity

Growing market for autonomous systems and intelligent robotics.

Revenue Models

Licensing of the frameworkdevelopment of specialized agent solutions.

Resource Requirements

Compute Needs

Significant compute for LLM inference and potentially for simulation/verification.

Data Requirements

Requires data from simulated or real-world environments for training and validation.

Deployment Constraints

Integration with physical hardware, real-time processing needs, safety considerations.

Scalability

Scalability depends on the complexity of tasks and environments; neuro-symbolic integration might add overhead.

Regulatory Considerations

Safety standards for autonomous systemsdata privacy if interacting with humans.

Production Readiness

Maturity Level

Research/Prototype

Time to Market

2-4 years for robust commercial deployment.

Patent Potential

Moderate, for the specific neuro-symbolic framework and validation techniques.

View Full Paper Back to Papers