Characterising LLM-Generated Competency Questions: a Cross-Domain Empirical Study using Open and Closed Models

· Source: Artificial Intelligence · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics · Depth: Advanced, quick

Summary

A new study characterizes Competency Questions (CQs) generated by various Large Language Models (LLMs) for ontology engineering. Traditionally, CQs are manually created by ontology engineers and domain experts to define requirements as natural language questions an ontology must satisfy. This research introduces quantitative measures to systematically compare LLM-generated CQs across multiple dimensions, including readability, relevance to input text, and structural complexity. The study evaluates both open models like KimiK2-1T, LLama3.1-8B, and LLama3.2-3B, and closed models such as Gemini 2.5 Pro and GPT 4.1. Experiments conducted across diverse use cases and requirements reveal that LLM performance in CQ generation produces distinct profiles influenced by the specific use case, highlighting the varied intrinsic properties of the CQs produced.

Key takeaway

For ontology engineers and AI scientists evaluating LLMs for automated Competency Question generation, you should conduct a systematic, cross-domain analysis of generated CQs. Focus on readability, relevance, and structural complexity, as model performance and output characteristics are highly dependent on the specific use case. This will help you select the most appropriate LLM for your project's requirements.

Key insights

LLMs can automate Competency Question generation, but their output quality varies significantly by model and use case.

Principles

Method

The study uses quantitative measures to compare LLM-generated CQs based on readability, relevance to input, and structural complexity across open and closed models for various use cases.

In practice

Topics

Best for: AI Scientist, Research Scientist, NLP Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.