One Polluted Page Is Enough: Evaluating Web Content Pollution in Generative Recommenders

· Source: Artificial Intelligence · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Cybersecurity & Data Privacy · Depth: Expert, quick

Summary

FORGE (Fake Online Recommendations in Generative Environments) is a new benchmark designed to evaluate how search-augmented Large Language Models (LLMs) promote fake products due to polluted web content. This benchmark simulates web-content pollution by locally rewriting real products into fake ones within retrieved web pages, then measures the LLM's propensity to recommend these fake products. FORGE encompasses 225 real-world products across 15 categories and 5 consumer scenarios. Evaluations across 12 commercial and open-weight LLMs revealed significant vulnerability: a single polluted page led to fooled rates up to 27%, escalating to 73.8% when the top-3 search results were replaced. Vulnerability varied by category, increasing when LLMs lacked stable prior knowledge. Reasoning capabilities did not mitigate this, often generating spurious social proof. Evaluated defenses, including skepticism prompting and consensus filtering, either exacerbated vulnerability or risked suppressing legitimate products.

Key takeaway

For AI/ML engineers developing search-augmented generative recommenders, you must prioritize robust content verification. Your models are highly susceptible to promoting fake products from even a single polluted web page, with reasoning often exacerbating the issue by fabricating justifications. Implement multi-source validation and carefully evaluate any skepticism-based defenses, as they can worsen vulnerability or suppress legitimate recommendations. Consider integrating external knowledge bases to strengthen product priors.

Key insights

Generative recommenders are highly vulnerable to web content pollution, promoting fake products even from a single compromised page.

Principles

Method

FORGE simulates web content pollution by rewriting real products into fake ones on retrieved pages. It then measures how often LLMs recommend these fake products across 225 items, 15 categories, and 5 scenarios.

In practice

Topics

Code references

Best for: Research Scientist, CTO, VP of Engineering/Data, AI Scientist, Machine Learning Engineer, AI Security Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.