The 80% power lie

· Source: Statistical Modeling, Causal Inference, and Social Science · Field: Science & Research — Mathematics & Computational Sciences, Research Methodology & Innovation · Depth: Intermediate, short

Summary

The common claim of 80% statistical power in research studies, often required for NIH funding, is mathematically inconsistent with observed scientific outcomes. For a study to truly have 80% power (meaning an 80% probability of a 95% confidence interval excluding zero), the true effect size must be at least 2.8 standard errors from zero. This implies that studies should routinely yield p-values as low as 0.0005 or even 0.0000016. However, such extremely low p-values are rarely seen in practice, suggesting that the actual power of most studies is significantly lower, potentially as low as 6%. This discrepancy arises because assumed effect sizes are often inflated, literature estimates are biased, and systematic errors and variations across conditions are prevalent, making it much harder to achieve 80% power than simple calculations suggest.

Key takeaway

For AI scientists designing experiments or evaluating research, recognize that routine claims of 80% statistical power are often unrealistic. Your experimental designs should account for the common overestimation of effect sizes and the prevalence of systematic errors, rather than relying on simple power calculations that yield inflated N values. Prioritize robust methodology and realistic effect size assumptions over merely meeting a funding requirement for 80% power.

Key insights

Claims of 80% statistical power in research are often mathematically inconsistent with observed p-value distributions.

Principles

In practice

Topics

Best for: AI Scientist, Research Scientist, Data Scientist, AI Student

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Statistical Modeling, Causal Inference, and Social Science.