The Evolution of Reasoning in Small Language Models with Yejin Choi - #761

· Source: The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) · Field: Technology & Digital — Artificial Intelligence & Machine Learning · Depth: Advanced, extended

Summary

Yejin Choi, a professor and senior fellow at Stanford University's Computer Science Department and Institute for Human-Centered AI (HAI), discusses her recent work on enhancing reasoning in small language models (SLMs). She highlights the critical role of high-quality, diverse data in bridging the performance gap between SLMs and large language models (LLMs). Choi explains how integrating synthetic data generation, imitation learning, and reinforcement learning can improve SLM reasoning. Her research also addresses the risks of output homogeneity and mode collapse, as detailed in her "Artificial Hivemind" paper, and its implications for human creativity. Novel approaches include reinforcement learning as a pre-training objective, which incentivizes models to "think" before token prediction, and "Prismatic Synthesis," a gradient-based method for generating diverse synthetic math data. Choi also touches on the societal implications of AI, advocating for pluralistic alignment to ensure AI reflects diverse human norms and values, and emphasizes democratizing AI beyond large organizations.

Key takeaway

For AI scientists and research scientists focused on improving small language models, prioritize data quality and diversity. Your efforts should include exploring advanced synthetic data generation techniques like Prismatic Synthesis and integrating reinforcement learning into pre-training objectives. This approach can significantly enhance reasoning capabilities and mitigate issues like mode collapse, ultimately making AI more accessible and reflective of diverse human intelligence.

Key insights

High-quality, diverse data and integrated learning approaches are key to enhancing small language model reasoning.

Principles

Method

Prismatic Synthesis generates diverse synthetic math data by aggressively filtering overrepresented examples using gradient vectors and K-means clustering, then iteratively refining prompts with unique examples.

In practice

Topics

Best for: AI Scientist, Research Scientist, AI Researcher, Machine Learning Engineer, AI Ethicist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence).