Quantization-Robust LLM Unlearning via Low-Rank Adaptation

· Source: Machine Learning · Field: Technology & Digital — Artificial Intelligence & Machine Learning · Depth: Advanced, quick

Summary

A new method for quantization-robust unlearning in Large Language Models (LLMs) addresses the challenge where post-training quantization (PTQ) can negate unlearning updates, causing models to revert to their original behavior. Standard full-parameter fine-tuning often results in parameter changes too subtle to survive aggressive low-bit quantization, such as 4-bit PTQ. The proposed approach utilizes low-rank adaptation (LoRA) by freezing the base model and concentrating unlearning into trainable adapters, ensuring the effective update persists after quantization. Evaluated on Llama-2-7B with the MUSE dataset (BOOKS and NEWS), LoRA improved 4-bit utility by up to 7.93 points (NPO+GDR on BOOKS: 50.17 to 58.10) and increased utility on NEWS for GA+GDR (40.06 to 44.82). Furthermore, LoRA significantly reduced privacy leakage under 4-bit PTQ, moving PrivLeak from -25.68 to -5.86 for GA+KLR on BOOKS, while maintaining effective forgetting.

Key takeaway

For AI Engineers deploying LLMs that require both unlearning and efficient inference via post-training quantization, integrating LoRA into your unlearning workflow is crucial. This approach ensures that unlearning updates are preserved even under aggressive 4-bit quantization, significantly improving utility and reducing privacy leakage. You should prioritize LoRA-based unlearning methods to maintain model integrity and compliance in production environments.

Key insights

Low-rank adaptation (LoRA) enables effective LLM unlearning that survives aggressive post-training quantization.

Principles

Method

Freeze the base LLM and apply unlearning updates exclusively to trainable LoRA adapters, preserving the effective update through post-training quantization.

In practice

Topics

Best for: AI Engineer, NLP Engineer, AI Scientist, Machine Learning Engineer, Deep Learning Engineer, AI Researcher

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Machine Learning.