ParamBoost: Gradient Boosted Piecewise Cubic Polynomials

· Source: stat.ML updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics · Depth: Expert, extended

Summary

ParamBoost is a novel Generalized Additive Model (GAM) that employs a Gradient Boosting algorithm to learn piecewise cubic polynomial shape functions for individual input features. This model is designed to create non-linear, "glass-box" interpretable models where the predictive function is fully observable. ParamBoost integrates several parametric analysis constraints, including continuity of shape functions and their derivatives up to $C^{2}$, monotonicity, convexity, feature interaction constraints, and model specification constraints. Empirical evaluations across 11 real-world datasets for regression, binary, and multi-class classification tasks demonstrate that unconstrained ParamBoost consistently outperforms other state-of-the-art GAMs. Furthermore, modellers can selectively apply these constraints, tailoring the model to specific interpretability and parametric-analysis requirements with only a modest trade-off in predictive performance. The model also allows for the computation of formal standard errors and 95% confidence intervals for its shape functions.

Key takeaway

For research scientists developing interpretable machine learning models, ParamBoost offers a powerful approach to integrate domain knowledge directly into model structure. You should consider ParamBoost when your application demands not only high predictive accuracy but also explicit, constrained interpretability, such as ensuring monotonic relationships or specific curvature in feature effects. This allows for more trustworthy and explainable models, particularly in fields like econometrics or health where parametric analysis is critical.

Key insights

ParamBoost offers a gradient-boosted GAM with piecewise cubic polynomials, enabling constrained, interpretable models with strong predictive performance.

Principles

Method

ParamBoost uses gradient boosting to fit cubic polynomial functions at leaf nodes, ensuring piecewise cubic shape functions. It incorporates constraints like $C^{2}$ continuity, monotonicity, and convexity during training, and computes confidence intervals using Hessian approximation.

In practice

Topics

Code references

Best for: Research Scientist, AI Scientist, Machine Learning Engineer, Data Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by stat.ML updates on arXiv.org.