Transparent Screening for LLM Inference and Training Impacts

· Source: cs.LG updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Sustainable AI & Green Computing · Depth: Advanced, long

Summary

Emotia, a Paris-based institute, has developed Emotia, a transparent screening framework for estimating the environmental impacts of large language models (LLMs) during inference and training, particularly when direct provider telemetry is unavailable. This framework, detailed in their paper, converts natural-language application descriptions into bounded environmental estimates and powers the "ImpactLLM Observatory," which currently covers 41 models. The methodology uses a multi-factor proxy approach, separating inference and training estimates, and making assumptions explicit. For inference, it uses an observed literature anchor (e.g., Gemini Apps median prompt energy of 0.24 Wh/prompt), accounts for token volume with a higher weight for output generation (ω=1.8), and adjusts for effective active parameters based on context window, serving mode, modality, and architecture. Training estimates combine parameter count with training token priors (e.g., 20 tokens per parameter), training regime, architecture, and hardware class proxies. The framework provides auditable, source-linked proxy estimates, emphasizing comparability, transparency, and reproducibility over direct measurement for opaque proprietary services.

Key takeaway

For AI/ML Directors evaluating LLM adoption, this framework offers a crucial tool for comparing environmental impacts without direct vendor data. You can quickly generate auditable, comparative estimates for inference and training, enabling more informed decisions about model selection and deployment based on energy consumption and carbon footprint. This transparency helps mitigate risks associated with opaque vendor claims and supports more disciplined environmental reporting.

Key insights

A transparent proxy framework estimates LLM inference and training impacts when direct telemetry is unavailable.

Principles

Method

The framework converts natural-language descriptions into scenarios, then applies multi-factor screening proxies for inference (anchored in prompt energy, weighted tokens, effective parameters) and training (anchored in parameter count, training tokens, regime, architecture, hardware).

In practice

Topics

Code references

Best for: Research Scientist, CTO, Director of AI/ML, AI Scientist, AI Ethicist, Policy Maker

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.LG updates on arXiv.org.