LLM attribution analysis across different fine-tuning strategies and model scales for automated code compliance

· Source: cs.AI updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Software Development & Engineering · Depth: Expert, quick

Summary

A study investigated the interpretive behaviors of Large Language Models (LLMs) when applied to automated code compliance, focusing on how different fine-tuning strategies and model scales affect attribution patterns. Researchers used a perturbation-based attribution analysis to compare full fine-tuning (FFT), Low-Rank Adaptation (LoRA), and quantized LoRA fine-tuning across various LLM parameter sizes. The findings indicate that FFT generates statistically distinct and more focused attribution patterns compared to parameter-efficient methods like LoRA. Additionally, as LLM scale increases, models prioritize numerical constraints and rule identifiers in text, though performance gains in semantic similarity for generated computer-processable rules plateaued for models exceeding 7 billion parameters. This research aims to enhance the explainability of LLMs for critical, regulation-based tasks within the Architecture, Engineering, and Construction (AEC) industry.

Key takeaway

For AI Engineers developing LLMs for automated code compliance in regulated industries like AEC, understanding attribution patterns is critical. Your choice of fine-tuning strategy, particularly between full fine-tuning and LoRA, directly impacts model interpretability. Prioritize full fine-tuning if explainability and focused attribution are paramount, and carefully evaluate the performance gains of models larger than 7B parameters, as semantic similarity improvements may plateau.

Key insights

Fine-tuning strategies and model scale significantly alter LLM interpretive behavior for code compliance.

Principles

Method

A perturbation-based attribution analysis was used to compare interpretive behaviors of LLMs across full fine-tuning, LoRA, and quantized LoRA, examining the impact of varying model parameter sizes.

In practice

Topics

Best for: AI Engineer, NLP Engineer, AI Scientist, Machine Learning Engineer, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.