GHGbench: A Unified Multi-Entity, Multi-Task Benchmark for Carbon Emission Prediction

· Source: Takara TLDR - Daily AI Papers · Field: Science & Research — Artificial Intelligence & Machine Learning, Environmental Science & Earth Systems, Data Science & Analytics · Depth: Expert, medium

Summary

GHGbench is a new open dataset and benchmark designed for predicting company- and building-level greenhouse gas (GHG) emissions, addressing fragmentation in existing resources. The company track includes over 32,000 company-year records from 12,000+ firms, detailing Scope 1+2 and Scope 3 emissions alongside financial and sectoral data. The building track harmonizes 491,591 building-year records from 13 open sources across 26 metropolitan areas, incorporating climate covariates and multimodal remote-sensing embeddings. GHGbench establishes canonical splits for in-distribution and cross-region/city transfer tasks, with temporal hold-out and short-horizon forecasting as supplementary tasks. Baselines include gradient-boosted trees, a tabular foundation model, MLP, FT-Transformer, and multimodal fusion, evaluated using multi-seed paired-bootstrap tests. Key findings indicate building emissions are harder to predict than company emissions, the in-distribution to out-of-distribution gap is significant, and multimodal remote-sensing embeddings improve prediction where tabular generalization fails.

Key takeaway

For AI Scientists and Machine Learning Engineers developing carbon emission prediction models, GHGbench offers a standardized, comprehensive benchmark to evaluate model performance under realistic conditions. Your focus should be on improving generalization across diverse entities and regions, as the out-of-distribution gap significantly impacts accuracy. Consider integrating multimodal remote-sensing data, especially for building emissions, to overcome limitations of purely tabular approaches and enhance predictive precision.

Key insights

GHGbench unifies carbon emission prediction with comprehensive datasets and benchmarks for company and building levels.

Principles

Method

GHGbench defines canonical data splits for in-distribution and cross-region/city transfer tasks, evaluating baselines with multi-seed paired-bootstrap tests.

In practice

Topics

Code references

Best for: AI Scientist, Machine Learning Engineer, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Takara TLDR - Daily AI Papers.