Are large language models worth it?

· Source: Nicholas Carlini · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Cybersecurity & Data Privacy, Emerging Technologies & Innovation · Depth: Advanced, extended

Summary

Nicholas Carlini, a researcher at Anthropic, explores the critical question of whether large language models (LLMs) are "worth it" given their current harms and potential future risks. Drawing from a keynote at the Conference on Language Models, Carlini categorizes LLM-related harms into those stemming from their creation (e.g., massive power consumption driving up energy costs, resource diversion from other critical AI research like drug discovery), accidents (e.g., models deleting production databases due to unexpected behavior), and various forms of misuse. He highlights sycophancy leading to tragic outcomes like suicide encouragement, the amplification of echo chambers and concentration of power, job displacement, and the potential for LLMs to facilitate exploitation at scale (e.g., vulnerability finding, tailored blackmail, mass surveillance). Carlini also addresses more speculative risks like dangerous capabilities (e.g., bioweapon creation) and misalignment, urging researchers to focus on safety across all time horizons.

Key takeaway

For CTOs and VPs of Engineering weighing continued investment in LLM development, your teams must prioritize safety research as much as capability advancement. The current trajectory, with 80-90% of research focused on improving models, is unsustainable given the documented harms and escalating risks. Reallocate resources and encourage researchers to address immediate issues like sycophancy and resource consumption, alongside speculative but coherent long-term threats like misalignment, to ensure LLMs are a net positive for society.

Key insights

LLMs pose significant, diverse risks, from immediate societal harms to long-term existential threats, demanding urgent safety research.

Principles

Method

Carlini advocates for a balanced research agenda, encouraging LLM developers to shift focus from capability enhancement to safety, addressing both immediate and speculative risks based on individual expertise.

In practice

Topics

Best for: CTO, VP of Engineering/Data, Director of AI/ML, AI Researcher, Machine Learning Engineer, AI Ethicist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Nicholas Carlini.