ChinAI #343: AI Safety/Security Governance Research Report (CAICT 2025)
Summary
A China Academy of Information and Communications Technology (CAICT) research report on AI safety and security governance, published in 2025, reveals Chinese scholars' deep engagement with global AI governance developments. The report, authored by affiliates of CAICT, the China AI Industry Alliance, the Ministry of Information and Industry Technology, and the Ministry of Public Security, cites international sources like CISPA Helmholtz Center's "JailbreakRadar" and OWASP's LLM vulnerabilities list. It highlights emerging risks such as "value alignment lapses" and the misuse potential of open-source AI, exemplified by "KawaiiGPT" and "WormGPT 4." The report also details information pollution risks, noting Jinri Toutiao blocked over 930,000 pieces of low-quality AI-generated content in 2024. CAICT's AI Safety benchmark, which consistently updates, shows all fifteen models tested in December 2025 had over a 10% hallucination rate.
Key takeaway
For AI/ML Directors evaluating governance strategies, the CAICT 2025 report underscores the critical need to integrate international AI safety and security research. Your teams should actively track global benchmarks and vulnerability lists, such as OWASP's LLM top ten, to anticipate and mitigate risks like value alignment failures and information pollution. Prioritize robust evaluation of open-source models given their potential for misuse and monitor hallucination rates as a key safety metric.
Key insights
Chinese AI governance research actively integrates global findings on safety, security, and emerging risks.
Principles
- Open-source AI lowers misuse barriers.
- AI capabilities raise "unknown risks."
- Information pollution is a societal risk.
Method
The CAICT AI Safety benchmark evaluates models, with the 2025 Q1 update focusing on hallucination rates across fifteen models.
In practice
- Monitor CAICT's AI Safety benchmark for updates.
- Review OWASP's LLM vulnerabilities list.
- Consider "JailbreakRadar" for new AI security risks.
Topics
- AI Safety Governance
- Large Language Model Vulnerabilities
- AI Hallucination
- Information Pollution
- CAICT AI Safety Benchmark
Best for: CTO, VP of Engineering/Data, Director of AI/ML, AI Researcher, AI Ethicist, Policy Maker
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by ChinAI Newsletter.