Website "In the Weights" shows whether AI models know who you are

· Source: The Decoder · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Emerging Technologies & Innovation · Depth: Fundamental Awareness, quick

Summary

The website "In the Weights" identifies individuals whose information is encoded within the numerical weights of large language models, indicating their relevance during training. This platform queries multiple AI models, aggregates the findings, and assigns a "strength score" to each person. For instance, scores range from 175 for Maximilian Schreiner and 262 for Matthias, up to a maximum of 996 for highly prominent figures like Mozart, Shakespeare, or Taylor Swift. Developed by former OpenAI employees Joey Flynn and Thomas Dimson, the site notes that appearing in smaller models, such as Meta's Llama with its billion parameters, signifies particularly high relevance. The creators acknowledge limitations, including the potential for models to hallucinate biographical details, the negative impact of typos on scores, and generally poorer results for common names.

Key takeaway

For AI scientists and data privacy professionals evaluating model training data and potential biases, "In the Weights" offers a unique lens into how deeply individuals are embedded in LLM knowledge. You should use this tool to assess the prominence of specific persons within various models, recognizing that smaller models indicate higher relevance. Be mindful of the site's noted limitations, such as hallucination risks and score inaccuracies for common names, when interpreting results for data governance or ethical AI considerations.

Key insights

The site reveals how deeply individuals are embedded in LLM training data, quantified by a "strength score".

Principles

Method

The "In the Weights" site queries several AI models, combines their results for a specific person, and then assigns a cumulative strength score based on these findings.

In practice

Topics

Best for: CTO, VP of Engineering/Data, Director of AI/ML, AI Scientist, Tech Journalist, General Interest

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by The Decoder.