TimeROME-DLM: Temporal Causal Tracing and Low-Rank Inference-Time Knowledge Editing for Masked Diffusion Language Models
Summary
TimeROME-DLM is a novel training-free, gradient-free, inference-time knowledge-editing framework designed specifically for Masked Diffusion Language Models (MDLMs) like LLaDA. Unlike existing methods such as ROME and MEMIT, which target autoregressive LLMs and struggle with MDLMs' iterative denoising or high VRAM costs, TimeROME-DLM effectively addresses these challenges. The framework integrates a Temporal Indirect Effect (TIE) causal-tracing protocol to pinpoint critical intervention coordinates and a closed-form, low-rank residual edit memory that applies a single ridge-regularized update per diffusion forward. This approach keeps backbone weights frozen, requiring tuning only three hyperparameters (alpha, lambda, q). Benchmarked on TOFU forget01 with LLaDA-8B-Base, TimeROME-DLM reduced forget-set log-probability by approximately 83 nats, maintained retain-set utility within ~1 nat across 50 facts, and achieved a 4-14x wall-clock speedup with zero additional VRAM. It also scales sub-linearly to 400 facts and is compatible with models like Dream-7B and DiffuLLaMA-7B.
Key takeaway
For Machine Learning Engineers developing or deploying Masked Diffusion Language Models, TimeROME-DLM offers a critical solution for knowledge editing. If you need to update facts or unlearn information in MDLMs like LLaDA-8B-Base, consider integrating this gradient-free, inference-time framework. It avoids costly retraining and high VRAM. This approach provides significant speedups (4-14x) and maintains model utility. It enables agile model updates and improved data governance for your diffusion-based language models.
Key insights
TimeROME-DLM enables efficient, training-free knowledge editing for Masked Diffusion Language Models, overcoming limitations of AR LLM methods.
Principles
- Causal tracing identifies critical edit coordinates.
- Low-rank residual edits preserve utility.
- Gradient-free updates reduce VRAM.
Method
TimeROME-DLM uses Temporal Indirect Effect (TIE) causal tracing to locate edit points, then applies a closed-form, low-rank residual edit memory with ridge regularization and sparsification at each diffusion forward step.
In practice
- Edit LLaDA-8B-Base facts without retraining.
- Apply to Dream-7B, DiffuLLaMA-7B models.
- Achieve 4-14x speedup in editing.
Topics
- Masked Diffusion Language Models
- Knowledge Editing
- Causal Tracing
- Low-Rank Adaptation
- Model Unlearning
- LLaDA
Best for: Research Scientist, AI Scientist, NLP Engineer, Machine Learning Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.