Amazon kills internal AI leaderboard after employees gamed it with pointless tasks

· Source: The Decoder · Field: Business & Management — Corporate Strategy & Leadership, Operations & Process Management · Depth: Fundamental Awareness, quick

Summary

Amazon discontinued its internal AI ranking system, "Kirorank," on May 29, 2026, after employees exploited it by directing AI agents to perform "pointless tasks" on the Kiro developer platform. This behavior inflated their scores and significantly increased the company's cloud costs. Senior Vice President Dave Treadwell acknowledged the dashboard's "good intentions" but cited the unintended cost escalation. This incident occurred as Amazon aims for over 80 percent weekly AI developer usage and plans to invest approximately \$200 billion in 2026, primarily in AI infrastructure. A similar issue with employees chasing AI usage scores was reported at Meta. Amazon is now shifting its focus to tracking "normalized deployments" of AI-generated code, emphasizing actual utility over raw token consumption.

Key takeaway

For Directors of AI/ML implementing AI adoption strategies, recognize that gamified usage metrics like raw token consumption can lead to significant cost overruns and unproductive activity. You should prioritize tracking "normalized deployments" or other metrics that directly measure the utility and integration of AI-generated code into production. This shift ensures your investments drive actual business value, not just inflated internal scores.

Key insights

Gamified AI usage metrics can inadvertently drive up costs and encourage unproductive activity, necessitating a focus on useful deployments.

Principles

In practice

Topics

Best for: CTO, AI Product Manager, Product Manager, Director of AI/ML, VP of Engineering/Data, Executive

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by The Decoder.