InTrain: Intrinsic Trainability for Zero-Cost Neural Architecture Search

· Source: Computer Vision and Pattern Recognition · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Computer Vision & Pattern Recognition · Depth: Expert, quick

Summary

InTrain, a novel theoretical proxy for zero-cost neural architecture search, formalizes "intrinsic trainability" as an architectural invariant derived from two synergistic components: geometric capacity and optimization resilience. This approach aims to efficiently discover high-performance networks without costly training, addressing the limitations of existing fragmented heuristics. Geometric capacity is quantified by the participation ratio of the activation covariance eigenspectrum, reflecting the effective dimensionality of representation manifolds. Optimization resilience is measured through cumulative gradient health, which assesses backpropagation robustness across network depth. InTrain integrates these dimensions using a scale-invariant multiplicative coupling, hypothesized to capture their non-additive relationship. Extensive experiments on standard NAS benchmarks demonstrate that InTrain achieves ranking correlations comparable to leading ensemble-based proxies and surpasses other single-metric methods.

Key takeaway

For Machine Learning Engineers focused on optimizing neural architecture search efficiency, InTrain provides a robust, training-free proxy to assess network trainability. This method allows you to evaluate architectures based on intrinsic properties like geometric capacity and optimization resilience, significantly reducing the computational cost associated with traditional training-based evaluations. You should consider integrating InTrain for early-stage architecture screening to accelerate high-performance network discovery.

Key insights

InTrain quantifies neural network trainability intrinsically through geometric capacity and optimization resilience, enabling efficient zero-cost neural architecture search.

Principles

Method

InTrain operationalizes intrinsic trainability by quantifying geometric capacity via activation covariance eigenspectrum participation ratio and optimization resilience via cumulative gradient health, synthesizing them through a scale-invariant multiplicative coupling.

In practice

Topics

Best for: Research Scientist, AI Scientist, Machine Learning Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Computer Vision and Pattern Recognition.