InTrain: Intrinsic Trainability for Zero-Cost Neural Architecture Search
Summary
InTrain, a novel theoretical proxy for zero-cost neural architecture search, formalizes "intrinsic trainability" as an architectural invariant derived from two synergistic components: geometric capacity and optimization resilience. This approach aims to efficiently discover high-performance networks without costly training, addressing the limitations of existing fragmented heuristics. Geometric capacity is quantified by the participation ratio of the activation covariance eigenspectrum, reflecting the effective dimensionality of representation manifolds. Optimization resilience is measured through cumulative gradient health, which assesses backpropagation robustness across network depth. InTrain integrates these dimensions using a scale-invariant multiplicative coupling, hypothesized to capture their non-additive relationship. Extensive experiments on standard NAS benchmarks demonstrate that InTrain achieves ranking correlations comparable to leading ensemble-based proxies and surpasses other single-metric methods.
Key takeaway
For Machine Learning Engineers focused on optimizing neural architecture search efficiency, InTrain provides a robust, training-free proxy to assess network trainability. This method allows you to evaluate architectures based on intrinsic properties like geometric capacity and optimization resilience, significantly reducing the computational cost associated with traditional training-based evaluations. You should consider integrating InTrain for early-stage architecture screening to accelerate high-performance network discovery.
Key insights
InTrain quantifies neural network trainability intrinsically through geometric capacity and optimization resilience, enabling efficient zero-cost neural architecture search.
Principles
- Trainability is an architectural invariant.
- Geometric capacity quantifies representation manifold dimensionality.
- Optimization resilience measures backpropagation robustness.
Method
InTrain operationalizes intrinsic trainability by quantifying geometric capacity via activation covariance eigenspectrum participation ratio and optimization resilience via cumulative gradient health, synthesizing them through a scale-invariant multiplicative coupling.
In practice
- Discover high-performance networks efficiently.
- Evaluate architectures without costly training.
- Benchmark against ensemble-based NAS proxies.
Topics
- Neural Architecture Search
- Zero-Cost Proxies
- Intrinsic Trainability
- Geometric Capacity
- Optimization Resilience
- Machine Learning
Best for: Research Scientist, AI Scientist, Machine Learning Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Computer Vision and Pattern Recognition.