Octopus: History-Free Gradient Orthogonalization for Continual Learning in Multimodal Large Language Models

· Source: Machine Learning · Field: Technology & Digital — Artificial Intelligence & Machine Learning · Depth: Expert, quick

Summary

Octopus is a novel two-stage continual learning framework designed for multimodal large language models (MLLMs) that addresses catastrophic forgetting without relying on historical task data. This framework, based on History-Free Gradient Orthogonalization (HiFGO), enforces gradient-level orthogonality to prevent parameter interference. Unlike architecture-based methods that add computational overhead or rehearsal-based methods with privacy and storage concerns, Octopus decouples task adaptation from regularization. This strategy aims to balance model plasticity and stability during sequential knowledge acquisition. Experimental results on the UCIT benchmark demonstrate that Octopus achieves state-of-the-art performance, outperforming previous methods by 2.14% in average accuracy and 6.82% in last task accuracy.

Key takeaway

For AI Engineers developing continual learning systems for MLLMs, Octopus offers a compelling alternative to data-intensive or computationally heavy methods. By adopting its history-free gradient orthogonalization, you can mitigate catastrophic forgetting while addressing privacy and storage concerns. Consider implementing this two-stage finetuning approach to achieve a better balance between model plasticity and stability in your sequential learning tasks.

Key insights

Octopus enables continual learning in MLLMs by orthogonalizing gradients without needing historical data.

Principles

Method

Octopus employs a two-stage finetuning strategy using History-Free Gradient Orthogonalization (HiFGO) to achieve gradient-level orthogonality, balancing plasticity and stability in MLLMs.

In practice

Topics

Best for: AI Engineer, Research Scientist, AI Scientist, Machine Learning Engineer, NLP Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Machine Learning.