How to Humanize AI Text: What We Learned from 19,804 Text Pairs
Summary
ReText.AI conducted an internal study across 19,804 paired texts, evaluating how a specialized AI humanizer alters detection scores from a single AI detector. The research involved eight language models, including Llama-3.2–3B-Instruct and GPT-oss-120B, and covered 20 topic categories, with 80% of the dataset in Russian. The humanizer, built on Gemma-2–9B-IT and fine-tuned with SimPO, aims to rewrite text by adjusting sentence structure, rhythm, and vocabulary to sound more natural. Results showed a significant shift in AI-likelihood scores, with the median dropping from approximately 0.92 to 0.45 after humanization. More than 90% of texts received a lower score, and 14 of 20 categories achieved a classification flip rate above 50%, notably in recipes (66.7%) and legal texts (64.2%). The study acknowledges limitations, including its internal nature, the use of the same detector for training and evaluation, and the dataset's language skew.
Key takeaway
For content creators and editors aiming to produce authentic-sounding text, you should integrate AI humanizers as an editing tool, not a shortcut. Start by outlining your arguments, then use AI for a first draft. After humanization, meticulously verify semantics, data, and references, as the process may alter accuracy. Finally, cross-check with AI detectors for insights, but always complete the text by hand with personal insights and specific details to ensure genuine authorship and quality.
Key insights
Targeted AI text rewriting can substantially lower detector scores, but its effectiveness varies by topic and measurement.
Principles
- Humanization rewrites sentence structure, rhythm, and vocabulary.
- Lower detector scores do not prove human authorship.
- Structured content humanizes more effectively.
Method
A Gemma-2–9B-IT model, fine-tuned with SimPO, generated rewritten variants optimized to reduce AI-likelihood probability based on detector confidence.
In practice
- Generate AI first drafts, then polish.
- Verify factual accuracy post-humanization.
- Use AI detectors for cross-referencing drafts.
Topics
- AI Text Humanization
- AI Content Detection
- Language Model Evaluation
- SimPO Fine-tuning
- Text Rewriting
- Multilingual Datasets
Best for: Research Scientist, AI Scientist, NLP Engineer, Marketing Professional
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Naturallanguageprocessing on Medium.