arxiv_ai 95% Match Research Paper Robotics Engineers,AI Researchers,Control Systems Engineers,Autonomous Systems Developers 1 week ago

Hybrid DQN-TD3 Reinforcement Learning for Autonomous Navigation in Dynamic Environments

robotics › navigation

📄 Abstract

Abstract: This paper presents a hierarchical path-planning and control framework that combines a high-level Deep Q-Network (DQN) for discrete sub-goal selection with a low-level Twin Delayed Deep Deterministic Policy Gradient (TD3) controller for continuous actuation. The high-level module selects behaviors and sub-goals; the low-level module executes smooth velocity commands. We design a practical reward shaping scheme (direction, distance, obstacle avoidance, action smoothness, collision penalty, time penalty, and progress), together with a LiDAR-based safety gate that prevents unsafe motions. The system is implemented in ROS + Gazebo (TurtleBot3) and evaluated with PathBench metrics, including success rate, collision rate, path efficiency, and re-planning efficiency, in dynamic and partially observable environments. Experiments show improved success rate and sample efficiency over single-algorithm baselines (DQN or TD3 alone) and rule-based planners, with better generalization to unseen obstacle configurations and reduced abrupt control changes. Code and evaluation scripts are available at the project repository.

Authors (4)

Xiaoyi He

Danggui Chen

Zhenshuo Zhang

Zimeng Bai

Submitted

October 30, 2025

arXiv Category

cs.RO

arXiv PDF

Key Contributions

Presents a hierarchical path-planning and control framework combining DQN for sub-goal selection and TD3 for continuous control, achieving improved success rates and sample efficiency in dynamic environments. It incorporates a practical reward shaping scheme and LiDAR-based safety.

Business Value

Enables more robust and efficient autonomous navigation for robots, reducing the need for manual intervention and improving operational reliability in complex environments.

Paper Metadata

Innovation Type

Algorithmic Framework

Deployment Feasibility

High, demonstrated in ROS + Gazebo simulation with TurtleBot3, indicating practical implementation potential.

Limitations Addressed

Challenges in autonomous navigation in dynamic and partially observable environments, and limitations of single-algorithm approaches.

Performance Gains

Improved success rate and sample efficiency over single-algorithm baselines (DQN, TD3) and rule-based planners, with better generalization.

Technical Tags

Autonomous NavigationReinforcement LearningHierarchical ControlDQNTD3Path PlanningDynamic EnvironmentsROSGazebo

Research Topics

Robotics NavigationReinforcement LearningHierarchical Control SystemsMotion PlanningAutonomous Systems

Methods & Architectures

Hierarchical DQN-TD3Reward ShapingLiDAR-based Safety GateROS + Gazebo Simulation Deep Q-Network (DQN)Twin Delayed Deep Deterministic Policy Gradient (TD3)

Applications & Tasks

Robotics Autonomous Vehicles Mobile Robots Simulation Environments Navigation in Dynamic EnvironmentsPath PlanningContinuous ControlSample Efficiency Autonomous navigation for mobile robotsHierarchical path-planning and controlSafe navigation in dynamic and partially observable environments

Datasets & Benchmarks

Benchmarks

PathBench metrics

Success RateCollision RatePath EfficiencyRe-planning EfficiencySample Efficiency

Related Fields

RoboticsReinforcement LearningControl SystemsArtificial IntelligenceSimulation

Keywords

Autonomous NavigationRoboticsReinforcement LearningDQNTD3Path PlanningDynamic EnvironmentsHierarchical ControlROSGazebo

Academic Context

#Robotics Navigation#Reinforcement Learning#Hierarchical Control Systems#Motion Planning#Autonomous Systems

Technology Stack

Frameworks & Libraries

ROSGazebo

Commercial Potential

Potential Products

Autonomous navigation software for mobile robotsRobotics simulation platforms

Target Industries

RoboticsLogisticsWarehousingManufacturingAutonomous Vehicles

Use Case Examples

Autonomous mobile robots in warehousesRobotic exploration in unknown environmentsDelivery robots

Competitive Edge

Offers a hybrid hierarchical approach that combines the strengths of DQN and TD3 for improved navigation performance over single-algorithm methods.

Market Opportunity

Growing market for autonomous mobile robots and navigation solutions.

Revenue Models

Licensing of navigation softwareintegration services.

Resource Requirements

Compute Needs

Moderate for simulation, potentially higher for real-world deployment depending on sensor processing.

Data Requirements

Requires simulation environments (ROS/Gazebo) and potentially real-world robot data.

Deployment Constraints

Robot hardware compatibility, sensor integration, real-time processing capabilities.

Scalability

Scalability depends on the complexity of the environment and the robot's capabilities.

Production Readiness

Maturity Level

Research/Prototype

Time to Market

2-3 years

Patent Potential

Moderate, for the hierarchical control framework and reward shaping scheme.

View Full Paper Back to Papers