We Evaluated ChatGPT vs. Google on 500 Search Queries
Summary
A human evaluation by Surge AI compared ChatGPT's performance against Google's across 100 general informational queries and 100 coding-specific queries. The study found that ChatGPT matched or slightly surpassed Google on general queries, with 42% preference for ChatGPT versus 40% for Google. However, ChatGPT significantly outperformed Google on coding queries, winning 70% of the time. While ChatGPT excels at synthesizing information, providing concise answers, and understanding complex queries, it also exhibits "hallucinations" or inaccuracies and lacks media integration. Google, conversely, provides accurate, official answers but often presents verbose results with ads, requiring users to synthesize information themselves.
Key takeaway
For AI Engineers and Machine Learning Engineers developing search or conversational AI products, this analysis highlights ChatGPT's strengths in synthesis and complex query understanding, particularly for coding. Your focus should be on integrating these conversational strengths while rigorously addressing factual accuracy and hallucination issues. Consider how to combine the best of both worlds: AI-driven synthesis with robust, verifiable data sources to create a superior user experience.
Key insights
ChatGPT significantly outperforms Google on coding queries and ties it on general informational queries, despite not being optimized for search.
Principles
- AI models can synthesize information more effectively than traditional search.
- Conversational interfaces enhance user engagement and learning.
- Accuracy remains a critical challenge for generative AI.
Method
Human evaluators used their own recent informational queries (pre-2022) on both Google and ChatGPT, rating and comparing the experiences. A separate set of 100 coding queries was also evaluated.
In practice
- Use ChatGPT for complex coding problems or step-by-step instructions.
- Verify ChatGPT's factual claims, especially for critical information.
- Consider conversational AI for personalized learning experiences.
Topics
- ChatGPT
- Large Language Models
- Search Engine Comparison
- Human Evaluation
- Coding Assistance
Best for: AI Engineer, Machine Learning Engineer, Entrepreneur, Software Engineer, AI Product Manager, Executive
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Surge AI Blog.