Redirecting to original paper in 30 seconds...
Click below to go immediately or wait for automatic redirect
📄 Abstract
Abstract: The dynamic nature of information necessitates continuously updating large
vision-language models (LVLMs). While recent knowledge editing techniques hint
at promising directions, they often focus on editing a single modality (vision
or language) in isolation. This prevalent practice neglects the inherent
multimodality of LVLMs and the continuous nature of knowledge updates,
potentially leading to suboptimal editing outcomes when considering the
interplay between modalities and the need for ongoing knowledge refinement. To
address these limitations, we propose MemEIC, a novel method for Continual and
Compositional Knowledge Editing (CCKE) in LVLMs. MemEIC enables compositional
editing of both visual and textual knowledge sequentially. Our approach employs
a hybrid external-internal editor featuring a dual external memory for
cross-modal evidence retrieval and dual LoRA adapters that facilitate
disentangled parameter updates for each modality. A key component is a
brain-inspired knowledge connector, activated selectively for compositional
reasoning, that integrates information across different modalities. Experiments
demonstrate that MemEIC significantly improves performance on complex
multimodal questions and effectively preserves prior edits, setting a new
benchmark for CCKE in LVLMs.
Authors (8)
Jin Seong
Jiyun Park
Wencke Liermann
Hongseok Choi
Yoonji Nam
Hyun Kim
+2 more
Submitted
October 29, 2025
Key Contributions
MemEIC introduces a novel method for Continual and Compositional Knowledge Editing (CCKE) in Large Vision-Language Models (LVLMs). It addresses the limitation of single-modality editing by enabling sequential editing of both visual and textual knowledge, which is crucial for dynamic information environments.
Business Value
Enables dynamic updating of AI models with the latest information, ensuring their relevance and accuracy in rapidly changing fields. This is critical for applications requiring up-to-date knowledge, such as chatbots, recommendation systems, and information retrieval.