Distributed Multi-Layer Editing for Rule-Level Knowledge in Large Language Models

· Source: Computation and Language · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Software Development & Engineering · Depth: Expert, quick

Summary

A new study investigates the editing of rule-level knowledge in large language models (LLMs), noting that existing methods primarily target fact-level knowledge and assume localized interventions. This assumption fails for rules, which require consistency across symbolic expressions, natural language, and concrete instances. Researchers extended the RuleEdit benchmark from 80 to 200 manually verified rules in mathematics and physics. Causal tracing revealed that rule knowledge is organized in a form-specific manner across transformer layers: formulas and descriptions reside in earlier layers, while instances are linked to middle layers. This distributed organization suggests that single-layer or contiguous-block interventions are insufficient for reliable rule editing. Consequently, the study proposes Distributed Multi-Layer Editing (DMLE), which applies distinct updates to early and middle layers. DMLE significantly improves rule-level editing, enhancing instance portability by 13.91 percentage points and rule understanding by 50.19 percentage points over baselines across models like GPT-J-6B, Qwen2.5-7B, Qwen2-7B, and LLaMA-3-8B.

Key takeaway

For AI Engineers and Research Scientists working on LLM knowledge editing, understanding that rule-level knowledge is distributed across model layers is crucial. Your current fact-level editing methods may fail for rules, leading to inconsistent behavior. Adopt Distributed Multi-Layer Editing (DMLE) to apply targeted updates to specific layers, ensuring greater consistency and improved performance in rule understanding and instance portability for your LLMs.

Key insights

Rule-level knowledge in LLMs is distributed across layers, requiring multi-layer editing for consistent updates.

Principles

Method

Distributed Multi-Layer Editing (DMLE) applies a shared early-layer update for formulas/descriptions and a separate middle-layer update for instances to improve rule consistency.

In practice

Topics

Code references

Best for: AI Engineer, Research Scientist, AI Scientist, NLP Engineer, Machine Learning Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Computation and Language.