Website "In the Weights" shows whether AI models know who you are
Summary
The website "In the Weights" identifies individuals whose information is encoded within the numerical weights of large language models, indicating their relevance during training. This platform queries multiple AI models, aggregates the findings, and assigns a "strength score" to each person. For instance, scores range from 175 for Maximilian Schreiner and 262 for Matthias, up to a maximum of 996 for highly prominent figures like Mozart, Shakespeare, or Taylor Swift. Developed by former OpenAI employees Joey Flynn and Thomas Dimson, the site notes that appearing in smaller models, such as Meta's Llama with its billion parameters, signifies particularly high relevance. The creators acknowledge limitations, including the potential for models to hallucinate biographical details, the negative impact of typos on scores, and generally poorer results for common names.
Key takeaway
For AI scientists and data privacy professionals evaluating model training data and potential biases, "In the Weights" offers a unique lens into how deeply individuals are embedded in LLM knowledge. You should use this tool to assess the prominence of specific persons within various models, recognizing that smaller models indicate higher relevance. Be mindful of the site's noted limitations, such as hallucination risks and score inaccuracies for common names, when interpreting results for data governance or ethical AI considerations.
Key insights
The site reveals how deeply individuals are embedded in LLM training data, quantified by a "strength score".
Principles
- LLM weights encode knowledge of relevant individuals.
- Smaller models imply higher relevance for included individuals.
- LLMs can hallucinate biographical details.
Method
The "In the Weights" site queries several AI models, combines their results for a specific person, and then assigns a cumulative strength score based on these findings.
In practice
- Check personal "strength score" on "In the Weights".
- Evaluate LLM data retention for specific individuals.
Topics
- Large Language Models
- Model Weights
- Data Retention
- AI Hallucinations
- Personal Data
- In the Weights
Best for: CTO, VP of Engineering/Data, Director of AI/ML, AI Scientist, Tech Journalist, General Interest
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by The Decoder.