Student Spotlight: Aaron Payne, Data Analyst

· Source: Data Skeptic · Field: Business & Management — Operations & Process Management, Corporate Strategy & Leadership, Consulting & Professional Services · Depth: Intermediate, extended

Summary

Aaron Payne, a Senior Insights Analyst at Chick-fil-A and Georgia Tech MBA student, discusses the application of business analytics to real-world problems, specifically a forecasting project for Comfama, a social services company in Colombia. The project aimed to accurately predict Comfama's affiliated population amidst economic inconsistencies, including the impact of the COVID-19 pandemic. Payne highlights the challenges of working with multilingual, manually entered data, emphasizing the 80% data cleaning adage. The team developed an ensemble model combining SERIMAX, a seasonal autoregressive moving average with exogenous variables, and XGBoost to improve forecast accuracy and interpretability. This approach allowed for the incorporation of economic indicators from Colombia's Bureau of Labor Statistics (GAIN) and accounted for seasonal and trend effects, significantly reducing prediction residuals.

Key takeaway

For data scientists or business analysts tasked with developing forecasting models for critical social services, prioritize interpretability and stakeholder collaboration from the outset. Your models should not only be accurate but also transparent, allowing operational teams to understand the drivers behind predictions. Consider ensemble methods like SERIMAX and XGBoost to balance accuracy with the need to incorporate external economic factors and handle data anomalies like COVID-19 effectively, ensuring real-world operational excellence and direct benefit to end-users.

Key insights

Effective analytics projects prioritize interpretability and real-world impact, especially in social services.

Principles

Method

An ensemble model combining SERIMAX with exogenous variables (economic indicators) and XGBoost, weighted by RMSE, can improve forecasting accuracy and provide interpretability for stakeholders.

In practice

Topics

Best for: Data Scientist, Data Analyst, AI Student

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Data Skeptic.