arxiv_cv 95% Match Research Paper HCI researchers,Robotics researchers,AI researchers,Mixed reality developers,UX designers 17 hours ago

SigmaCollab: An Application-Driven Dataset for Physically Situated Collaboration

robotics › human-robot-interaction

📄 Abstract

Abstract: We introduce SigmaCollab, a dataset enabling research on physically situated human-AI collaboration. The dataset consists of a set of 85 sessions in which untrained participants were guided by a mixed-reality assistive AI agent in performing procedural tasks in the physical world. SigmaCollab includes a set of rich, multimodal data streams, such as the participant and system audio, egocentric camera views from the head-mounted device, depth maps, head, hand and gaze tracking information, as well as additional annotations performed post-hoc. While the dataset is relatively small in size (~ 14 hours), its application-driven and interactive nature brings to the fore novel research challenges for human-AI collaboration, and provides more realistic testing grounds for various AI models operating in this space. In future work, we plan to use the dataset to construct a set of benchmarks for physically situated collaboration in mixed-reality task assistive scenarios. SigmaCollab is available at https://github.com/microsoft/SigmaCollab.

Key Contributions

Introduces SigmaCollab, a novel dataset designed for research on physically situated human-AI collaboration. The dataset comprises multimodal data from 85 sessions where participants performed procedural tasks guided by an MR assistive AI agent, providing a realistic environment for studying human-AI interaction.

Business Value

Facilitates the development of more intuitive and effective AI assistants for real-world tasks, improving productivity and user experience in collaborative settings.

Paper Metadata

Innovation Type

Dataset and Benchmark

Deployment Feasibility

The dataset itself is a resource for research. The AI agents developed using it would require integration into MR hardware and software.

Limitations Addressed

Lack of datasets specifically designed for physically situated human-AI collaboration in mixed reality environments.

Technical Tags

Physically Situated CollaborationHuman-AI CollaborationMixed Reality (MR)Assistive AI AgentProcedural TasksMultimodal DataEgocentric VideoGaze TrackingDatasetBenchmark

Research Topics

Human-AI InteractionHuman-Robot CollaborationMixed RealityEmbodied AIDataset Creation

Methods & Architectures

Data collection through guided procedural tasksMultimodal data stream recordingPost-hoc annotation

Applications & Tasks

Human-AI Interaction Robotics Mixed Reality Assistive Technologies Training and Simulation Enabling research on physically situated human-AI collaborationProviding realistic testing grounds for AI models in collaborative tasksDeveloping benchmarks for mixed-reality task assistance Physically situated collaborationTask assistanceHuman-AI interaction modelingBenchmark development

Datasets & Benchmarks

Datasets

SigmaCollab

Benchmarks

Planned benchmarks for physically situated collaboration in mixed-reality task assistive scenarios

Related Fields

Human-Computer Interaction (HCI)RoboticsVirtual Reality (VR)Augmented Reality (AR)Artificial IntelligenceMachine Learning

Keywords

Human-AI collaborationPhysically situatedMixed realityAssistive AIDatasetBenchmarkProcedural tasksEgocentric videoGaze trackingHuman-robot interaction

Academic Context

#Human-AI Interaction#Human-Robot Collaboration#Mixed Reality#Embodied AI#Dataset Creation

Commercial Potential

Potential Products

AI-powered mixed reality assistantsTraining and simulation platformsCollaborative robotics systems

Target Industries

ManufacturingHealthcareLogisticsEducationGaming

Use Case Examples

MR assistant guiding a technician through a complex repair procedureAI agent helping a user learn a new skill in a virtual environmentCollaborative assembly tasks between humans and robots in mixed reality

Competitive Edge

Provides a unique, application-driven dataset specifically for physically situated human-AI collaboration in mixed reality, enabling research that was previously difficult due to data limitations.

Market Opportunity

Growing market for AR/VR applications and AI-powered assistants.

Revenue Models

Development of AI-powered MR applications and services.

Resource Requirements

Compute Needs

Moderate (for processing multimodal data and training models)

Data Requirements

The SigmaCollab dataset itself.

Deployment Constraints

Requires mixed reality hardware (e.g., headsets), integration with AI agents.

Scalability

The dataset is relatively small (~14 hours), but its richness enables diverse research. Future work plans to build benchmarks.

Regulatory Considerations

Data privacy (participant consent)

Production Readiness

Maturity Level

Research (Dataset Release)

Time to Market

1-3 years (for applications built on the dataset)

Patent Potential

Low (dataset release, but potential for patents on AI agents developed using it)

View Full Paper Back to Papers