We Evaluated ChatGPT vs. Google on 500 Search Queries

· Source: Surge AI Blog · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Software Development & Engineering, Emerging Technologies & Innovation · Depth: Intermediate, long

Summary

A human evaluation by Surge AI compared ChatGPT's performance against Google's across 100 general informational queries and 100 coding-specific queries. The study found that ChatGPT matched or slightly surpassed Google on general queries, with 42% preference for ChatGPT versus 40% for Google. However, ChatGPT significantly outperformed Google on coding queries, winning 70% of the time. While ChatGPT excels at synthesizing information, providing concise answers, and understanding complex queries, it also exhibits "hallucinations" or inaccuracies and lacks media integration. Google, conversely, provides accurate, official answers but often presents verbose results with ads, requiring users to synthesize information themselves.

Key takeaway

For AI Engineers and Machine Learning Engineers developing search or conversational AI products, this analysis highlights ChatGPT's strengths in synthesis and complex query understanding, particularly for coding. Your focus should be on integrating these conversational strengths while rigorously addressing factual accuracy and hallucination issues. Consider how to combine the best of both worlds: AI-driven synthesis with robust, verifiable data sources to create a superior user experience.

Key insights

ChatGPT significantly outperforms Google on coding queries and ties it on general informational queries, despite not being optimized for search.

Principles

Method

Human evaluators used their own recent informational queries (pre-2022) on both Google and ChatGPT, rating and comparing the experiences. A separate set of 100 coding queries was also evaluated.

In practice

Topics

Best for: AI Engineer, Machine Learning Engineer, Entrepreneur, Software Engineer, AI Product Manager, Executive

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Surge AI Blog.