Five Ways to Fine-Tune Chronos-2, the Time Series Foundation Model
Summary
Chronos-2, a time-series foundation model, can be significantly improved for specific datasets through fine-tuning, particularly when zero-shot performance is insufficient. This analysis demonstrates five fine-tuning scenarios using a synthetic dataset of hourly electricity demand for eight commercial buildings. The method employs Low-Rank Adaptation (LoRA) to efficiently adapt the 120M-parameter Transformer model. Scenarios include single-building, portfolio, covariate-informed, portfolio with covariates, and held-out transfer. Using a 12-week training window and 1-week test horizon, the study shows WAPE reductions from 8.3% to 7.6% for single-building, and a substantial 66.8% relative reduction (from 8.4% to 2.8%) for portfolio + covariate fine-tuning. Covariate-informed setups consistently yielded the largest gains, with held-out transfer also showing a 26.8% relative reduction for unseen assets.
Key takeaway
For Machine Learning Engineers deploying Chronos-2 for time series forecasting, if your zero-shot model exhibits systematic errors or your data differs from pretraining, you should implement LoRA-based fine-tuning. Prioritize incorporating known-future covariates like temperature or occupancy, as this strategy yields substantial accuracy improvements, reducing WAPE by up to 66.8%. This approach also enables effective transfer learning to new, unseen assets.
Key insights
Fine-tuning Chronos-2 with LoRA significantly improves time series forecasting accuracy, especially with known-future covariates.
Principles
- Zero-shot performance is often insufficient.
- LoRA efficiently adapts large models.
- Covariates enhance fine-tuning gains.
Method
Fine-tune Chronos-2 using Hugging Face "peft" and LoRA by adapting Q, K, V, O projections in attention layers and the output patch embedding.
In practice
- Use "prediction_length=168" for one-week ahead forecasts.
- Provide "past_covariates" and "future_covariates" for context.
- Employ "EarlyStoppingCallback" to prevent overfitting.
Topics
- Chronos-2
- Time Series Forecasting
- LoRA
- Parameter-Efficient Fine-Tuning
- Covariate Modeling
- Electricity Load Forecasting
Code references
Best for: Machine Learning Engineer, AI Engineer, Data Scientist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Towards Data Science.