A Geometric Measure of Linear Separability for Neural Representations
Summary
A new diagnostic tool, the directional linear separability measure (LSM), is introduced to characterize the class-wise geometry of neural representations, particularly for modern neural classifiers relying on linear readouts. Unlike predictive metrics, LSM quantifies the smallest intrusion of competing samples into an affine halfspace containing a target class, normalized by the target class size. This measure is asymmetric, class-wise, and applicable to finite representations from neural networks. The authors establish LSM's supporting-hyperplane characterization, relate it to optimal affine classification accuracy, and prove its invariance under full-rank linear embeddings, which helps differentiate changes from linear reparameterization versus information loss. A penalty-based affine search is also provided for estimating class-wise LSM in high-dimensional features, and it is empirically used to diagnose class-wise intrusion across deep-learning components and architectures.
Key takeaway
For Machine Learning Engineers evaluating neural network representations, understanding the class-wise geometry is crucial. You should consider applying the directional linear separability measure (LSM) to diagnose specific class intrusions and assess how linear reparameterization impacts your model's internal representations. This can help you pinpoint where information loss or nonlinear transformations occur, guiding architectural improvements or feature engineering efforts more effectively.
Key insights
LSM provides a geometric, class-wise diagnostic for linear separability in neural representations, distinguishing reparameterization from information loss.
Principles
- Linear separability can be geometrically quantified beyond predictive accuracy.
- Full-rank linear embeddings preserve LSM, isolating information loss.
- Asymmetric, class-wise measures reveal fine-grained representation geometry.
Method
LSM involves searching for affine halfspaces containing a target class and measuring the smallest normalized intrusion of competing samples. Estimation uses a penalty-based affine search for high-dimensional features.
In practice
- Diagnose class-wise intrusion in deep-learning components.
- Analyze coordinatewise gated nonlinearities as geometric operators.
- Evaluate representation changes from linear reparameterization.
Topics
- Neural Representations
- Linear Separability
- Machine Learning Metrics
- Deep Learning Architectures
- Affine Classification
- Geometric Measures
Best for: Research Scientist, AI Scientist, Machine Learning Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Machine Learning.