arxiv_cl 92% Match Research Paper AI Researchers,Dialogue System Developers,NLP Engineers,Software Engineers working with LLMs 2 weeks ago

CoDial: Interpretable Task-Oriented Dialogue Systems Through Dialogue Flow Alignment

large-language-models › alignment

📄 Abstract

Abstract: Building Task-Oriented Dialogue (TOD) systems that generalize across different tasks remains a challenging problem. Data-driven approaches often struggle to transfer effectively to unseen tasks. While recent schema-based TOD frameworks improve generalization by decoupling task logic from language understanding, their reliance on neural or generative models often obscures how task schemas influence behaviour and hence impair interpretability. In this work, we introduce a novel framework, CoDial (Code for Dialogue), which converts a TOD task schema, represented as a novel structured heterogeneous graph, to programmatic LLM guardrailing code, such as NVIDIA's Colang, enabling interpretable and efficient alignment of dialogue policies during inference. We introduce two paradigms, $\text{CoDial}_{\text{free}}$ and $\text{CoDial}_{\text{structured}}$ for generating LLM guardrails, and propose a feedback mechanism that integrates human feedback to iteratively improve the generated code. Empirically, CoDial achieves state-of-the-art (SOTA) performance on the widely used STAR dataset and is on par with SOTA on the MultiWOZ dataset, while also providing interpretability. We additionally demonstrate CoDial's iterative improvement via manual and LLM-aided feedback, making it a practical tool for expert-guided alignment of LLMs in high-stakes domains.

Authors (5)

Radin Shayanfar

Chu Fei Luo

Rohan Bhambhoria

Samuel Dahan

Xiaodan Zhu

Submitted

June 2, 2025

arXiv Category

cs.CL

arXiv PDF

Key Contributions

Introduces CoDial, a framework that converts task-oriented dialogue schemas into programmatic LLM guardrailing code (like NVIDIA's Colang). This enables interpretable and efficient alignment of dialogue policies during inference, improving generalization and allowing human feedback integration.

Business Value

Creates more trustworthy and controllable conversational AI systems, reducing development complexity and improving user experience in task-oriented applications.

Paper Metadata

Innovation Type

Framework/Methodology

Deployment Feasibility

Moderate, requires integration with LLM inference and a system for generating/managing Colang code.

Limitations Addressed

Challenges in generalizing task-oriented dialogue systems across tasks and the lack of interpretability in current data-driven approaches.

Performance Gains

Enables interpretable and efficient alignment of dialogue policies during inference, improving generalization and allowing for iterative improvement via human feedback.

Technical Tags

task-oriented dialogue (TOD)interpretabilitydialogue flow alignmentLLM guardrailingNVIDIA Colangschema-based TODprogrammatic controlhuman feedbackstructured heterogeneous graphdialogue policy

Research Topics

Dialogue SystemsLarge Language ModelsInterpretability in AIAI AlignmentHuman-Computer Interaction

Methods & Architectures

CoDial frameworkconversion of TOD schema to LLM guardrailing code (Colang)structured heterogeneous graph representationdialogue policy alignmenthuman feedback integration Large Language Models (LLMs)

Applications & Tasks

Conversational AI Customer Service Task Automation Generalization across different tasks in TOD systemsLack of interpretability in neural/generative TOD modelsDifficulty in aligning task schemas with model behavior Building interpretable TOD systemsAligning dialogue policies with task schemasEnabling generalization across unseen tasks

Related Fields

Natural Language ProcessingConversational AIArtificial Intelligence SafetySoftware Engineering

Keywords

task-oriented dialogueinterpretabilityLLMguardrailingdialogue systemsNVIDIA Colangschemadialogue policyCoDialhuman feedback

Academic Context

#Dialogue Systems#Large Language Models#Interpretability in AI#AI Alignment#Human-Computer Interaction

Companies & Organizations

Companies Mentioned

NVIDIA

Technology Stack

Frameworks & Libraries

NVIDIA Colang

Commercial Potential

Potential Products

Interpretable dialogue system frameworksTools for building controllable LLM-based agents

Target Industries

Customer ServiceTechnologySoftware DevelopmentAutomotive (in-car assistants)

Use Case Examples

Building customer support chatbots with transparent decision-makingDeveloping voice assistants that reliably follow user instructionsCreating AI agents for automated task completion

Competitive Edge

Offers a novel approach to TOD by translating task schemas into executable code (guardrails), providing a level of interpretability and control not found in purely neural or generative models.

Market Opportunity

Large market for conversational AI and customer service automation.

Revenue Models

Licensing of the CoDial frameworkdevelopment services for custom dialogue systems.

Resource Requirements

Compute Needs

Moderate, primarily for LLM inference.

Data Requirements

Requires task schemas for TOD systems.

Deployment Constraints

Integration with LLM APIs and the Colang execution environment.

Scalability

Scalable to complex dialogue tasks and multiple domains.

Production Readiness

Maturity Level

Research

Time to Market

2-3 years

Patent Potential

Moderate, for the CoDial framework and its application.

View Full Paper Back to Papers