LLM attribution analysis across different fine-tuning strategies and model scales for automated code compliance

· Source: Artificial Intelligence · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Software Development & Engineering · Depth: Expert, quick

Summary

Research on large language models (LLMs) for automated code compliance typically focuses on performance, but this study investigates how training decisions influence their interpretive behavior. The paper uses a perturbation-based attribution analysis to compare LLMs across various fine-tuning strategies, including full fine-tuning (FFT), Low-Rank Adaptation (LoRA), and quantized LoRA. It also examines the impact of different LLM parameter sizes. Findings indicate that FFT yields attribution patterns that are statistically distinct and more concentrated than those from parameter-efficient methods. Additionally, as model scale increases, LLMs prioritize numerical constraints and rule identifiers in text, though performance gains in semantic similarity for generated rules plateau for models larger than 7B parameters. This work aims to enhance the transparency of LLMs for critical, regulation-based tasks in the Architecture, Engineering, and Construction industry.

Key takeaway

For AI Engineers developing LLMs for regulatory compliance, understanding how fine-tuning and model scale affect interpretive behavior is crucial. Full fine-tuning (FFT) offers more focused attribution, which can be vital for explainability in critical applications. Be aware that performance gains in semantic similarity may plateau for models larger than 7B, suggesting an optimal scale for resource allocation.

Key insights

Fine-tuning strategies and model scale significantly alter LLM interpretive behavior in code compliance tasks.

Principles

Method

A perturbation-based attribution analysis was used to compare LLM interpretive behaviors across full fine-tuning, LoRA, and quantized LoRA, and varying model scales.

In practice

Topics

Best for: AI Engineer, NLP Engineer, AI Scientist, Machine Learning Engineer, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.