GEMS: Geometric Constraints Enable Multi-Semantic Superposition in LLMs

· Source: Computation and Language · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Software Development & Engineering · Depth: Expert, quick

Summary

GEMS is a novel training-free method designed to enable multi-semantic superposition in Large Language Models (LLMs) without collapse, addressing a key limitation in activation steering. The research identifies two independent sources of collapse: distributional deviation, where additive perturbations accumulate and push activations outside the training distribution, and directional interference, where non-orthogonal semantic vectors mutually dampen. GEMS tackles these by applying geometric constraints: norm-preserving weighted superposition and targeted attention-pathway injection for distributional deviation, and real-time orthogonalization for directional interference. On the GSM8K benchmark, GEMS preserves 98% accuracy when injecting three non-mathematical directions, significantly outperforming the 92% baseline and the 4% from unconstrained addition. Wikitext-2 saw only a 2.2% PPL increase, and steering effects scaled from 3B to 31B architectures.

Key takeaway

For Machine Learning Engineers developing advanced LLM control mechanisms, GEMS provides a robust, training-free solution for multi-semantic activation steering. Integrate its geometric constraints, such as norm-preserving superposition and real-time orthogonalization, into your inference pipelines for stable, concurrent semantic control. This approach avoids model collapse, preserving 98% accuracy on GSM8K. It also offers a scalable method for fine-tuning model behavior across various architectures.

Key insights

Geometric constraints are crucial for stable, multi-semantic activation steering in Large Language Models.

Principles

Method

GEMS employs norm-preserving weighted superposition, targeted attention-pathway injection, and real-time orthogonalization to manage semantic vectors.

In practice

Topics

Best for: Research Scientist, AI Scientist, Machine Learning Engineer, NLP Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Computation and Language.