arxiv_cv 97% Match Research Paper Robotics engineers,AR/VR developers,Computer vision researchers,3D graphics specialists 2 weeks ago

GS2POSE: Marry Gaussian Splatting to 6D Object Pose Estimation

computer-vision › 3d-vision

📄 Abstract

Abstract: Accurate 6D pose estimation of 3D objects is a fundamental task in computer vision, and current research typically predicts the 6D pose by establishing correspondences between 2D image features and 3D model features. However, these methods often face difficulties with textureless objects and varying illumination conditions. To overcome these limitations, we propose GS2POSE, a novel approach for 6D object pose estimation. GS2POSE formulates a pose regression algorithm inspired by the principles of Bundle Adjustment (BA). By leveraging Lie algebra, we extend the capabilities of 3DGS to develop a pose-differentiable rendering pipeline, which iteratively optimizes the pose by comparing the input image to the rendered image. Additionally, GS2POSE updates color parameters within the 3DGS model, enhancing its adaptability to changes in illumination. Compared to previous models, GS2POSE demonstrates accuracy improvements of 1.4\%, 2.8\% and 2.5\% on the T-LESS, LineMod-Occlusion and LineMod datasets, respectively.

Authors (7)

Junbo Li

Weimin Yuan

Yinuo Wang

Yue Zeng

Shihao Shu

Cai Meng

+1 more

Submitted

October 19, 2025

arXiv Category

cs.CV

arXiv PDF

Key Contributions

GS2POSE integrates Gaussian Splatting with pose estimation by formulating a Bundle Adjustment-inspired regression algorithm using Lie algebra. It enables a pose-differentiable rendering pipeline that iteratively optimizes pose and updates color parameters, leading to improved accuracy and robustness, especially for textureless objects and varying lighting conditions.

Business Value

Enhances the reliability of object recognition and manipulation for robots and AR/VR systems, enabling more precise interaction with the physical world even in challenging visual conditions.

Paper Metadata

Innovation Type

Algorithmic/Integration

Deployment Feasibility

Moderate. Gaussian Splatting can be computationally intensive, but advancements are making it more feasible for real-time applications.

Limitations Addressed

Difficulties with textureless objects in current pose estimation methods,Sensitivity to varying illumination conditions,Limitations in establishing 2D-3D correspondences

Performance Gains

1.4% improvement on T-LESS, 2.8% on LineMod-Occlusion, and 2.5% on another dataset compared to previous models.

Technical Tags

6D object pose estimationGaussian SplattingBundle AdjustmentLie algebradifferentiable renderingtextureless objectsillumination invariancepose regression

Research Topics

3D Computer VisionPose EstimationRenderingRoboticsAugmented Reality

Methods & Architectures

Gaussian Splatting (3DGS)Bundle Adjustment (BA)Lie AlgebraPose RegressionDifferentiable Rendering GS2POSE3D Gaussian Splatting

Applications & Tasks

Robotics Augmented Reality (AR) Virtual Reality (VR) Autonomous Driving 6D Pose EstimationObject RecognitionRendering Optimization Estimating the 6D pose (position and orientation) of 3D objectsImproving robustness to textureless objects and varying illumination

Datasets & Benchmarks

Datasets

T-LESS, LineMod-Occlusion

Benchmarks

Accuracy improvements of 1.4% on T-LESS • 2.8% on LineMod-Occlusion • 2.5% on ...

Pose accuracyADD(-S) metric

Related Fields

RoboticsAugmented RealityComputer Graphics3D ReconstructionMachine Learning

Keywords

6D pose estimationGaussian SplattingBundle AdjustmentLie algebradifferentiable renderingtextureless objectsilluminationroboticsARVRcomputer vision3D object recognition

Academic Context

#3D Computer Vision#Pose Estimation#Rendering#Robotics#Augmented Reality

Commercial Potential

Potential Products

Robotic manipulation systemsAR/VR interaction platforms3D scanning and modeling tools

Target Industries

RoboticsManufacturingLogisticsGamingRetail

Use Case Examples

Robots accurately grasping objects in cluttered environmentsAR applications precisely overlaying virtual objects onto real-world objectsAutomated quality inspection in manufacturing

Competitive Edge

Offers a novel integration of Gaussian Splatting with pose estimation, achieving state-of-the-art results by addressing key challenges like textureless objects and illumination variations.

Market Opportunity

Growing demand for precise 3D understanding in robotics and AR/VR.

Revenue Models

Software licensingspecialized hardware integration

Resource Requirements

Compute Needs

High (especially for rendering and optimization)

Data Requirements

RGB-D images with ground truth 6D poses.

Deployment Constraints

Real-time performance can be challenging due to the complexity of Gaussian Splatting and optimization.

Scalability

Scalability depends on the efficiency of the Gaussian Splatting implementation and optimization process.

Production Readiness

Maturity Level

Research

Time to Market

2-4 years

Patent Potential

Low

View Full Paper Back to Papers