A Smooth Alternative to the Boosted Tree

· Source: Agus’s Substack · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics · Depth: Expert, long

Summary

The Multi-Scale Spectral Kernel Machine (MS-SKM) is presented as a smooth alternative to gradient-boosted trees (XGBoost, LightGBM, CatBoost) for tabular data, with its full technical treatment in "Multi-Scale Spectral Kernel Machines for Tabular Data" (Sudjianto and Zhang, 2026) and code available on GitHub. While boosted trees excel with abrupt, piecewise-constant target structures, MS-SKM, a positive-semidefinite kernel learned from the spectral domain, is designed for smooth targets like prices or rates. It offers competitive performance, closed-form interpretability, and a calibrated predictive distribution. The model reframes learning as choosing a kernel, assembled from smooth Fourier embeddings and a radial base kernel, trained by marginal likelihood. This approach provides native representation for smooth regimes, contrasting with trees that approximate curves with staircases.

Key takeaway

For Machine Learning Engineers building models on tabular data, if your target variable is smooth (e.g., prices, rates), consider the Multi-Scale Spectral Kernel Machine (MS-SKM) as a powerful alternative to gradient-boosted trees. MS-SKM offers superior calibration, built-in interpretability, and quantified predictive uncertainty, which trees often lack. Evaluate your data's underlying structure to choose between partition geometry for abrupt changes and similarity geometry for smooth variations.

Key insights

MS-SKM provides a smooth, interpretable kernel-based alternative to boosted trees for tabular data with smooth underlying structures.

Principles

Method

MS-SKM constructs a positive-semidefinite kernel using per-feature Fourier embeddings and a radial base kernel, then trains it directly via gradient descent on marginal likelihood.

In practice

Topics

Code references

Best for: AI Engineer, Research Scientist, AI Scientist, Machine Learning Engineer, Data Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Agus’s Substack.