arxiv_ml 93% Match Research Paper ML Researchers,AI Ethicists,LLM Developers,Legal/Compliance Officers 2 weeks ago

BLUR: A Bi-Level Optimization Approach for LLM Unlearning

large-language-models › training-methods

📄 Abstract

Abstract: Enabling large language models (LLMs) to unlearn knowledge and capabilities acquired during training has proven vital for ensuring compliance with data regulations and promoting ethical practices in generative AI. Although there are growing interests in developing various unlearning algorithms, it remains unclear how to best formulate the unlearning problem. The most popular formulation uses a weighted sum of forget and retain loss, but it often leads to performance degradation due to the inherent trade-off between forget and retain losses. In this work, we argue that it is important to model the hierarchical structure of the unlearning problem, where the forget problem (which \textit{unlearns} certain knowledge and/or capabilities) takes priority over the retain problem (which preserves model utility). This hierarchical structure naturally leads to a bi-level optimization formulation where the lower-level objective focuses on minimizing the forget loss, while the upper-level objective aims to maintain the model's utility. Based on this new formulation, we propose a novel algorithm, termed Bi-Level UnleaRning (\texttt{BLUR}), which not only possesses strong theoretical guarantees but more importantly, delivers superior performance. In particular, our extensive experiments demonstrate that \texttt{BLUR} consistently outperforms all the state-of-the-art algorithms across various unlearning tasks, models, and metrics. Codes are available at https://github.com/OptimAI-Lab/BLURLLMUnlearning.

Authors (9)

Hadi Reisizadeh

Jinghan Jia

Zhiqi Bu

Bhanukiran Vinzamuri

Anil Ramakrishna

Kai-Wei Chang

+3 more

Submitted

June 9, 2025

arXiv Category

cs.LG

arXiv PDF

Key Contributions

Introduces a bi-level optimization formulation for LLM unlearning, arguing for a hierarchical structure where forgetting takes priority over retaining utility. This approach aims to better manage the trade-off between forgetting specific knowledge and preserving overall model performance.

Business Value

Enables organizations to comply with data privacy regulations (like GDPR's 'right to be forgotten') by selectively removing data or knowledge from LLMs, reducing legal and reputational risks.

Paper Metadata

Innovation Type

Algorithmic Formulation

Deployment Feasibility

Moderate (requires specialized training infrastructure)

Limitations Addressed

Performance degradation in LLMs due to the inherent trade-off between forget and retain losses in traditional weighted sum formulations.

Performance Gains

Aims to mitigate performance degradation compared to standard unlearning methods.

Technical Tags

LLM unlearningbi-level optimizationforget lossretain lossdata regulationsethical AIhierarchical optimizationknowledge removalmodel utility

Research Topics

AI EthicsData PrivacyModel InterpretabilityLLM TrainingOptimization Techniques

Methods & Architectures

Bi-level optimizationGradient descent (implied)Loss function formulation Large Language Models (LLMs)

Applications & Tasks

AI Governance Data Privacy Ethical AI Development Removing specific knowledge from LLMsBalancing knowledge forgetting and retentionEnsuring compliance with data regulations Unlearning sensitive informationMaintaining model utility post-unlearning

Related Fields

Machine LearningOptimizationData PrivacyAI Governance

Keywords

LLM unlearningbi-level optimizationforget lossretain lossdata privacyGDPRethical AIknowledge removalmodel utilityhierarchical optimizationcompliance

Academic Context

#AI Ethics#Data Privacy#Model Interpretability#LLM Training#Optimization Techniques

Commercial Potential

Potential Products

LLM unlearning servicesPrivacy-preserving AI platforms

Target Industries

TechnologyHealthcareFinanceAny industry using LLMs with sensitive data

Use Case Examples

Removing personally identifiable information (PII) from an LLMUnlearning outdated or incorrect information from a deployed LLMComplying with data deletion requests

Competitive Edge

Offers a more principled optimization framework for unlearning compared to simpler loss-balancing methods.

Market Opportunity

Growing regulatory pressure,Increasing adoption of LLMs

Resource Requirements

Compute Needs

High (requires significant compute for LLM training/unlearning)

Data Requirements

Requires data for both forgetting and retaining knowledge.

Deployment Constraints

Computational cost,Complexity of bi-level optimization

Scalability

Scalability is a major challenge, as unlearning large models is computationally intensive.

Regulatory Considerations

Compliance with data privacy laws (GDPR, CCPA, etc.)

Production Readiness

Maturity Level

Research

Time to Market

2-4 years

Patent Potential

Moderate (novel optimization approach)

View Full Paper Back to Papers