Octopus: History-Free Gradient Orthogonalization for Continual Learning in Multimodal Large Language Models
Summary
Octopus is a novel two-stage continual learning framework designed for multimodal large language models (MLLMs) that addresses catastrophic forgetting without relying on historical task data. This framework, based on History-Free Gradient Orthogonalization (HiFGO), enforces gradient-level orthogonality to prevent parameter interference. Unlike architecture-based methods that add computational overhead or rehearsal-based methods with privacy and storage concerns, Octopus decouples task adaptation from regularization. This strategy aims to balance model plasticity and stability during sequential knowledge acquisition. Experimental results on the UCIT benchmark demonstrate that Octopus achieves state-of-the-art performance, outperforming previous methods by 2.14% in average accuracy and 6.82% in last task accuracy.
Key takeaway
For AI Engineers developing continual learning systems for MLLMs, Octopus offers a compelling alternative to data-intensive or computationally heavy methods. By adopting its history-free gradient orthogonalization, you can mitigate catastrophic forgetting while addressing privacy and storage concerns. Consider implementing this two-stage finetuning approach to achieve a better balance between model plasticity and stability in your sequential learning tasks.
Key insights
Octopus enables continual learning in MLLMs by orthogonalizing gradients without needing historical data.
Principles
- Decouple task adaptation from regularization.
- Enforce gradient orthogonality for stability.
Method
Octopus employs a two-stage finetuning strategy using History-Free Gradient Orthogonalization (HiFGO) to achieve gradient-level orthogonality, balancing plasticity and stability in MLLMs.
In practice
- Apply HiFGO for privacy-sensitive continual learning.
- Use two-stage finetuning for MLLM stability.
Topics
- Continual Learning
- Multimodal Large Language Models
- Gradient Orthogonalization
- Catastrophic Forgetting
- Finetuning Strategy
Best for: AI Engineer, Research Scientist, AI Scientist, Machine Learning Engineer, NLP Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Machine Learning.