Sarah Pan, teenage AI wizard
Summary
Sarah Pan, a high school student, achieved a NeurIPS publication by developing an RLHF pipeline for multi-step reasoning, building on OpenAI's reward model verifiers. Her work, initiated in early 2023, improved mathematical and logical coherence in large language models by grading individual steps in a reasoning process. This approach predated similar advancements seen in OpenAI's 01 models. Pan's journey into AI began at 13-14 with fast.ai courses, fostering a non-linear learning path. She later joined the MIT Primes program, mentored by Vlad Leen, and became an Answer AI fellow, contributing to WebGPU puzzles with Austin Huang. Her experience highlights the value of self-directed learning and early, practical engagement in AI.
Key takeaway
For AI students or aspiring researchers considering traditional academic paths, your demonstrated portfolio and self-directed learning are paramount. Focus on building practical skills and contributing to real-world projects, as this can open doors to top-tier research opportunities and fellowships, even without advanced degrees. Embrace non-linear learning to accelerate your expertise and impact in the field.
Key insights
Early, self-directed, top-down learning in AI can lead to significant research contributions and career opportunities.
Principles
- Top-down learning accelerates complex skill acquisition.
- Process-oriented feedback improves AI reasoning.
- Online communities foster self-driven learning.
Method
Sarah's research extended OpenAI's reward models by integrating process-based verifiers into an RLHF pipeline to update a completion model for improved multi-step reasoning.
In practice
- Adopt top-down learning for new technical domains.
- Seek mentorship for early-stage research projects.
- Utilize online forums for community support.
Topics
- Multi-step Reasoning
- Reinforcement Learning from Human Feedback
- Large Language Models
- AI Education
- WebGPU
- Answer AI
- NeurIPS
Best for: Research Scientist, AI Student, AI Scientist, Director of AI/ML
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Jeremy Howard.