Redirecting to original paper in 30 seconds...
Click below to go immediately or wait for automatic redirect
Addresses contextual integrity (CI) for autonomous agents by developing a framework that instills reasoning about appropriate information disclosure. It combines explicit prompting with a reinforcement learning approach, significantly reducing inappropriate disclosures while maintaining task performance across various LLMs.
Crucial for building trustworthy AI systems, especially those acting on behalf of users, by preventing privacy breaches and ensuring responsible information handling.