ParamBoost: Gradient Boosted Piecewise Cubic Polynomials
Summary
ParamBoost is a novel Generalized Additive Model (GAM) that employs a Gradient Boosting algorithm to learn piecewise cubic polynomial shape functions for individual input features. This model is designed to create non-linear, "glass-box" interpretable models where the predictive function is fully observable. ParamBoost integrates several parametric analysis constraints, including continuity of shape functions and their derivatives up to $C^{2}$, monotonicity, convexity, feature interaction constraints, and model specification constraints. Empirical evaluations across 11 real-world datasets for regression, binary, and multi-class classification tasks demonstrate that unconstrained ParamBoost consistently outperforms other state-of-the-art GAMs. Furthermore, modellers can selectively apply these constraints, tailoring the model to specific interpretability and parametric-analysis requirements with only a modest trade-off in predictive performance. The model also allows for the computation of formal standard errors and 95% confidence intervals for its shape functions.
Key takeaway
For research scientists developing interpretable machine learning models, ParamBoost offers a powerful approach to integrate domain knowledge directly into model structure. You should consider ParamBoost when your application demands not only high predictive accuracy but also explicit, constrained interpretability, such as ensuring monotonic relationships or specific curvature in feature effects. This allows for more trustworthy and explainable models, particularly in fields like econometrics or health where parametric analysis is critical.
Key insights
ParamBoost offers a gradient-boosted GAM with piecewise cubic polynomials, enabling constrained, interpretable models with strong predictive performance.
Principles
- Glass-box interpretability is enhanced by explicit constraints.
- Domain knowledge can be integrated via shape function constraints.
- Continuous differentiability enables marginal effect computation.
Method
ParamBoost uses gradient boosting to fit cubic polynomial functions at leaf nodes, ensuring piecewise cubic shape functions. It incorporates constraints like $C^{2}$ continuity, monotonicity, and convexity during training, and computes confidence intervals using Hessian approximation.
In practice
- Apply ParamBoost for explicit, constrained model interpretability.
- Use $C^{2}$ continuity for reliable marginal effect analysis.
- Impose monotonicity/convexity based on domain knowledge.
Topics
- ParamBoost
- Generalized Additive Models
- Gradient Boosting
- Piecewise Cubic Polynomials
- Model Interpretability
Code references
Best for: Research Scientist, AI Scientist, Machine Learning Engineer, Data Scientist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by stat.ML updates on arXiv.org.