v264: Proceedings of Large Foundation Models for Educational Assessment

· Source: Proceedings of Machine Learning Research · Field: Education & Learning — Educational Technology (EdTech), Academic Research & Higher Education · Depth: Expert, quick

Summary

Volume 264 presents the proceedings from the First Workshop on Large Foundation Models for Educational Assessment, held on December 15-16, 2024, in Vancouver, BC, Canada. This collection features research exploring diverse applications of Large Foundation Models (LFMs) and Large Language Models (LLMs) in educational contexts. Key contributions include MIRROR, a novel approach for automated evaluation of open-ended question generation, and a comparative study demonstrating GPT-4V's superior performance over Gemini Pro in educational tasks. Other papers detail methods for automatic generation of mathematics question hints, automated feedback for open-ended questions using fine-tuned LLMs, and machine learning techniques like BanditCAT and AutoIRT for Computerized Adaptive Testing. Further innovations include VISTA for tailored math problem generation, GPT-enhanced non-cognitive assessment item generation, and scalable automated grading for engineering conceptual questions. The volume also addresses assessing spatially distributed personality traits and automating educational presentation generation.

Key takeaway

For educational technologists and assessment developers evaluating AI integration, this volume highlights the immediate utility of Large Foundation Models. You should consider GPT-4V for tasks requiring high performance, as it outperforms Gemini Pro in educational contexts. Fine-tuning LLMs offers a viable path for automating feedback and generating diverse assessment items, including non-cognitive ones. Explore specialized approaches like BanditCAT for adaptive testing and VISTA for tailored math problem generation to enhance your assessment systems.

Key insights

Large Foundation Models are transforming educational assessment across diverse applications.

Principles

In practice

Topics

Best for: AI Scientist, Research Scientist, NLP Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Proceedings of Machine Learning Research.