Fine-tune Amazon Nova models for accurate email data extraction

· Source: Artificial Intelligence · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Cloud Computing & IT Infrastructure · Depth: Intermediate, long

Summary

Fine-tuning Amazon Nova models using Amazon SageMaker AI and PEFT (LoRA) enabled Parcel Perform to achieve up to 94.77% extraction accuracy, a 16.6 percentage point improvement over baseline, reduced inference latency by over 30%, and halved costs for email data extraction. This solution addresses common challenges like model hallucinations, confusion between similar data types, and high token costs associated with HTML-formatted emails. The process involves preparing training data in Amazon Bedrock conversation format, uploading it to Amazon S3, creating a fine-tuning job in SageMaker AI with LoRA configuration, and deploying the model via Amazon Bedrock for on-demand inference. Notably, the smaller Nova Micro model, when fine-tuned, outperformed the larger Nova Lite, demonstrating the effectiveness of task-specific optimization.

Key takeaway

For MLOps Engineers automating data extraction from diverse email formats, fine-tuning Amazon Nova models with SageMaker AI and PEFT (LoRA) can significantly improve accuracy (up to 94.77%), reduce inference latency (over 30%), and cut costs (50%). This approach allows deploying customized models on Amazon Bedrock with on-demand, token-based pricing, eliminating dedicated LLM infrastructure. You should prepare at least 1,300 training samples in Bedrock conversation format to achieve meaningful results and ensure your data represents production email variety.

Key insights

Fine-tuning Amazon Nova models with PEFT significantly boosts email data extraction accuracy and efficiency while reducing costs.

Principles

Method

Prepare training data in Amazon Bedrock conversation format, upload to S3, create a SageMaker AI fine-tuning job using LoRA, then deploy via Amazon Bedrock for on-demand inference.

In practice

Topics

Code references

Best for: Machine Learning Engineer, MLOps Engineer, AI Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.