Llama 3 is not very censored
Summary
On April 19, 2024, the Llama 3 large language model was released, demonstrating significantly reduced censorship compared to its predecessor, Llama 2. The new model exhibits less than one-third the false refusal rates of Llama 2, enabling it to engage with a broader array of topics. Examples illustrate this change: Llama 3 provides helpful suggestions for "killing time at the airport" instead of refusing, offers Python code for formatting a hard drive (with a warning), and discusses the destructive potential of a hypothetical global uranium-based nuclear bomb, complete with estimated stockpiles. In contrast, Llama 2 refused all these prompts, citing ethical or safety concerns. Llama 3 can be run locally by downloading Ollama and executing `ollama run llama3`.
Key takeaway
For NLP Engineers evaluating open-source models for sensitive applications, Llama 3's substantially lower false refusal rate means your applications can handle a wider range of user queries without encountering unnecessary content restrictions. You should consider integrating Llama 3 to improve user experience and reduce development overhead associated with prompt engineering workarounds for overly cautious models.
Key insights
Llama 3 significantly reduces false refusals compared to Llama 2, expanding its conversational range.
Principles
- Lower false refusal rates improve model utility.
- Contextual warnings can precede sensitive code generation.
Method
The article implicitly demonstrates a comparative evaluation method by presenting side-by-side responses of Llama 3 and Llama 2 to identical prompts across various sensitive topics.
In practice
- Run `ollama run llama3` to test Llama 3 locally.
- Compare Llama 3's responses to Llama 2 for specific use cases.
Topics
- Llama 3
- Prompt Refusal
- LLM Safety
- Content Moderation
- Large Language Models
Best for: NLP Engineer, CTO, VP of Engineering/Data, AI Engineer, Machine Learning Engineer, AI Researcher
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Ollama Blog.