On Sequence-to-Sequence Models for Automated Log Parsing

· Source: cs.SE updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Software Development & Engineering, Data Science & Analytics · Depth: Expert, extended

Summary

This study systematically evaluated four sequence modeling architectures—Transformer, Mamba state-space, monodirectional LSTM, and bidirectional LSTM—for automated log parsing. Researchers trained 396 models across various dataset configurations, using relative Levenshtein edit distance for evaluation. Transformer achieved the lowest mean relative edit distance (0.111), followed by Mamba (0.145), mono-LSTM (0.186), and bi-LSTM (0.265). Mamba provided competitive accuracy with substantially lower computational cost. Character-level tokenization generally improved performance, while sequence length had negligible practical impact on Transformer accuracy. Both Mamba and Transformer demonstrated stronger sample efficiency than recurrent models, with Transformers reducing parsing error by 23.4%.

Key takeaway

For MLOps engineers designing log parsing pipelines, this study clarifies architectural choices. If your environment has structured or moderately variable log datasets, Mamba offers strong performance with significantly lower computational costs, making it ideal for budget-constrained deployments. For highly heterogeneous or distribution-shifted log formats, Transformers provide maximum robustness, reducing parsing error by 23.4%, despite higher compute demands. Evaluate character-level tokenization for improved accuracy.

Key insights

Mamba and Transformer models offer superior log parsing accuracy and sample efficiency, with Mamba providing cost advantages.

Principles

Method

A controlled empirical study compared Transformer, Mamba, mono-LSTM, and bi-LSTM architectures, training 396 models across varied datasets and evaluating with relative Levenshtein edit distance.

In practice

Topics

Code references

Best for: AI Engineer, Research Scientist, AI Scientist, Machine Learning Engineer, MLOps Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.SE updates on arXiv.org.