TimeROME-DLM: Temporal Causal Tracing and Low-Rank Inference-Time Knowledge Editing for Masked Diffusion Language Models

· Source: Artificial Intelligence · Field: Technology & Digital — Artificial Intelligence & Machine Learning · Depth: Expert, quick

Summary

TimeROME-DLM is a novel training-free, gradient-free, inference-time knowledge-editing framework designed specifically for Masked Diffusion Language Models (MDLMs) like LLaDA. Unlike existing methods such as ROME and MEMIT, which target autoregressive LLMs and struggle with MDLMs' iterative denoising or high VRAM costs, TimeROME-DLM effectively addresses these challenges. The framework integrates a Temporal Indirect Effect (TIE) causal-tracing protocol to pinpoint critical intervention coordinates and a closed-form, low-rank residual edit memory that applies a single ridge-regularized update per diffusion forward. This approach keeps backbone weights frozen, requiring tuning only three hyperparameters (alpha, lambda, q). Benchmarked on TOFU forget01 with LLaDA-8B-Base, TimeROME-DLM reduced forget-set log-probability by approximately 83 nats, maintained retain-set utility within ~1 nat across 50 facts, and achieved a 4-14x wall-clock speedup with zero additional VRAM. It also scales sub-linearly to 400 facts and is compatible with models like Dream-7B and DiffuLLaMA-7B.

Key takeaway

For Machine Learning Engineers developing or deploying Masked Diffusion Language Models, TimeROME-DLM offers a critical solution for knowledge editing. If you need to update facts or unlearn information in MDLMs like LLaDA-8B-Base, consider integrating this gradient-free, inference-time framework. It avoids costly retraining and high VRAM. This approach provides significant speedups (4-14x) and maintains model utility. It enables agile model updates and improved data governance for your diffusion-based language models.

Key insights

TimeROME-DLM enables efficient, training-free knowledge editing for Masked Diffusion Language Models, overcoming limitations of AR LLM methods.

Principles

Method

TimeROME-DLM uses Temporal Indirect Effect (TIE) causal tracing to locate edit points, then applies a closed-form, low-rank residual edit memory with ridge regularization and sparsification at each diffusion forward step.

In practice

Topics

Best for: Research Scientist, AI Scientist, NLP Engineer, Machine Learning Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.