GEMS: Geometric Constraints Enable Multi-Semantic Superposition in LLMs

2026-06-18 · Source: Computation and Language · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Software Development & Engineering · Depth: Expert, quick

Summary

GEMS is a novel training-free method designed to enable multi-semantic superposition in Large Language Models (LLMs) without collapse, addressing a key limitation in activation steering. The research identifies two independent sources of collapse: distributional deviation, where additive perturbations accumulate and push activations outside the training distribution, and directional interference, where non-orthogonal semantic vectors mutually dampen. GEMS tackles these by applying geometric constraints: norm-preserving weighted superposition and targeted attention-pathway injection for distributional deviation, and real-time orthogonalization for directional interference. On the GSM8K benchmark, GEMS preserves 98% accuracy when injecting three non-mathematical directions, significantly outperforming the 92% baseline and the 4% from unconstrained addition. Wikitext-2 saw only a 2.2% PPL increase, and steering effects scaled from 3B to 31B architectures.

Key takeaway

For Machine Learning Engineers developing advanced LLM control mechanisms, GEMS provides a robust, training-free solution for multi-semantic activation steering. Integrate its geometric constraints, such as norm-preserving superposition and real-time orthogonalization, into your inference pipelines for stable, concurrent semantic control. This approach avoids model collapse, preserving 98% accuracy on GSM8K. It also offers a scalable method for fine-tuning model behavior across various architectures.

Key insights

Geometric constraints are crucial for stable, multi-semantic activation steering in Large Language Models.

Principles

Multi-semantic steering collapse arises from distributional deviation and directional interference.
Norm-preserving superposition and orthogonalization are essential geometric constraints.

Method

GEMS employs norm-preserving weighted superposition, targeted attention-pathway injection, and real-time orthogonalization to manage semantic vectors.

In practice

Implement geometric constraints for multi-directional LLM steering.
Utilize attention-pathway injection to control activation distribution.
Apply real-time orthogonalization for semantic vector stability.

Topics

Activation Steering
Large Language Models
Geometric Constraints
Multi-Semantic Superposition
Model Collapse
LLM Inference

Best for: Research Scientist, AI Scientist, Machine Learning Engineer, NLP Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Computation and Language.