CelerLog: Fast Log Parsing via Dynamic Routing

Minxing Wang; Shiwen Shan; Yintong Huo; Yuxin Su; Zhiying Wu; Zibin Zheng

arxiv: 2605.26005 · v1 · pith:Z2LYA454new · submitted 2026-05-25 · 💻 cs.SE

CelerLog: Fast Log Parsing via Dynamic Routing

Shiwen Shan , Yintong Huo , Minxing Wang , Zhiying Wu , Yuxin Su , Zibin Zheng This is my paper

Pith reviewed 2026-06-29 20:18 UTC · model grok-4.3

classification 💻 cs.SE

keywords log parsingdynamic routingLLM log analysisstatistical log parsinghybrid log parserlog analysis efficiencysemantic inferencecost reduction

0 comments

The pith

CelerLog routes logs by statistical density so most avoid LLMs while accuracy holds.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper proposes CelerLog, a log parser that first classifies incoming logs into dense groups showing strong statistical patterns and sparse groups that lack them. Dense logs are handled by a fast statistical processor while only sparse logs reach an LLM for semantic inference. This selective approach avoids running expensive LLM calls on the repetitive majority of logs. Experiments across 14 public datasets show the method leads in performance, runs 7.9x to 18.6x faster than full LLM parsers, and cuts token consumption by 80.2 to 94.1 percent. A sympathetic reader would care because log parsing supports automated system monitoring and current semantic methods remain too slow and costly at scale.

Core claim

CelerLog introduces a dynamic routing mechanism to classify logs into dense and sparse groups. Logs with strong statistical patterns are processed by an efficient statistical processor while sparse groups lacking such patterns are routed to an LLM. This hybrid strategy avoids unnecessary LLM invocations and achieves leading performance over state-of-the-art baselines on 14 public datasets, with speeds 7.9x to 18.6x faster than LLM methods and up to 1.5x faster than Drain, plus token reductions of 80.2-94.1 percent and LLM invocation reductions of 86.4-90.9 percent.

What carries the argument

The dynamic routing mechanism that partitions logs into dense (statistically patterned) and sparse groups for selective statistical or LLM processing.

If this is right

Most logs exhibit repetitive patterns that statistical methods can extract without semantic reasoning.
Only logs lacking statistical patterns require the slower, costlier LLM path.
Overall parsing latency and cost drop substantially while accuracy remains competitive with pure LLM or pure statistical baselines.
The same routing principle reduces both token consumption and the number of LLM calls by more than 80 percent on standard benchmarks.
The hybrid design outperforms both Drain and full LLM parsers on the 14 evaluated datasets.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

If the routing decision itself stays accurate across varied log sources, the same density-based split could apply to other tasks that mix patterned and novel data.
Lower LLM invocation rates could make high-volume log analysis practical on hardware with limited compute or budget.
Further tuning the classifier to reduce even small routing overhead would compound the observed speed gains.
The observed cost reductions suggest the method scales to production volumes where full LLM parsing is currently prohibitive.

Load-bearing premise

Incoming logs can be reliably and cheaply partitioned into dense and sparse groups such that classification overhead stays negligible and misrouting does not degrade end-to-end accuracy or cost savings.

What would settle it

A new log dataset on which the classifier routes a large fraction of logs to the wrong processor, producing either accuracy below current baselines or no measurable drop in LLM token use.

Figures

Figures reproduced from arXiv: 2605.26005 by Minxing Wang, Shiwen Shan, Yintong Huo, Yuxin Su, Zhiying Wu, Zibin Zheng.

**Figure 1.** Figure 1: The general paradigm of log parsing. effectiveness due to their lack of semantic awareness, struggling to handle complex log patterns accurately. (2) Semantic-based parsers, on the other hand, exploit semantic-aware models to distinguish dynamic parameters from static text. Recently, LLM-based parsers have emerged as a dominant solution in this category. By leveraging the powerful natural language underst… view at source ↗

**Figure 2.** Figure 2: The parsing effectiveness-cost of log parsers. We [PITH_FULL_IMAGE:figures/full_fig_p002_2.png] view at source ↗

**Figure 4.** Figure 4: The overview of CelerLog. 2.2 Dense & Sparse Log Groups To address the above question, we investigate the statistical distribution of log data. Our goal is to determine whether there exists a boundary that distinguishes logs suitable for syntax-based parsing from those requiring semantic inference. Inspiration & Observation. We hypothesize that if a log template generates multiple unique messages with va… view at source ↗

**Figure 5.** Figure 5: An example of the anchor-based merging phase of the Router. [PITH_FULL_IMAGE:figures/full_fig_p004_5.png] view at source ↗

**Figure 6.** Figure 6: An example of the Statistical Processor. [PITH_FULL_IMAGE:figures/full_fig_p005_6.png] view at source ↗

**Figure 7.** Figure 7: The prompt of the LLM-based processor. as a list of exact variable strings. To ensure robustness, we process the LLM’s output by verifying that the identified variables are indeed present in the original log message. We then mask these validated variable substrings with a placeholder (e.g., <*>) to generate the final template. If the LLM fails to identify any variables or produces an invalid format, we r… view at source ↗

**Figure 8.** Figure 8: Average LLM token consumption of LLM-based log parsers and CelerLog. CelerLog achieves the lowest token consumption. CelerLog LUNAR LogBatcher LILAC 0 50 100 150 200 250 300 350 400 # Avg. Invocation 33.786 370.905 283.810 248.690 [PITH_FULL_IMAGE:figures/full_fig_p007_8.png] view at source ↗

**Figure 10.** Figure 10: Performance and cost trade-offs of log parsers. [PITH_FULL_IMAGE:figures/full_fig_p008_10.png] view at source ↗

**Figure 11.** Figure 11: The parsing results under different similarity [PITH_FULL_IMAGE:figures/full_fig_p009_11.png] view at source ↗

**Figure 12.** Figure 12: The parsing results under different top- [PITH_FULL_IMAGE:figures/full_fig_p009_12.png] view at source ↗

read the original abstract

Log parsing is a fundamental step for automated log analysis, which transforms raw log messages into structured formats. Existing syntax-based parsers struggle with complex logs because they lack semantic reasoning ability. Emerging LLM-powered semantic parsers achieve high accuracy but suffer from prohibitive latency and token costs because they apply semantic inference across all logs. Our key observation is that not all logs necessitate complex semantic understanding: a vast majority of logs exhibit repetitive patterns that can be extracted via straightforward statistical analysis. Driven by this insight, we propose CelerLog, a fast and effective log parser. CelerLog introduces a dynamic routing mechanism to classify logs into dense and sparse groups. Logs with strong statistical patterns (dense groups) are processed by an efficient statistical processor, whereas the sparse groups lacking such patterns are routed to an LLM for semantic inference. This hybrid strategy avoids unnecessary LLM invocations. Extensive experiments on 14 public datasets show that CelerLog achieves leading performance over state-of-the-art baselines and is 7.9x to 18.6x faster than LLM methods and up to 1.5x faster than Drain. Additionally, it reduces costs by decreasing token consumption by 80.2% - 94.1% and LLM invocations by 86.4% - 90.9%.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

CelerLog's dynamic routing idea makes sense for cutting LLM costs on repetitive logs, but the abstract gives zero evidence the classifier works reliably enough to support the speed and savings claims.

read the letter

The one thing to know is that this paper's core move is routing most logs to a fast statistical parser and only the sparse ones to an LLM, which could be useful if the routing step is cheap and accurate. The abstract frames this as a new observation-driven design that avoids blanket LLM calls.

What the paper does well is identify a real practical bottleneck: LLM semantic parsers are accurate on complex logs but too slow and expensive for high-volume use, while pure statistical ones like Drain fall short on tricky cases. The hybrid split is a straightforward engineering response to that.

The soft spots are not minor. All the headline numbers—7.9x–18.6x faster than LLM methods, up to 1.5x faster than Drain, 80–94% less token use, 86–91% fewer LLM calls—rest entirely on the assumption that the dynamic router correctly sends the vast majority of logs to the statistical side without much overhead or error. The abstract supplies no description of the classifier, no routing accuracy figures, no ablation on misrouting effects, and only aggregate results across the 14 datasets. If the misrouting rate is even a few percent, either accuracy drops or the claimed savings evaporate. The stress-test note lands cleanly here.

This is for people building or tuning log analysis pipelines in industry or applied SE research, especially where volume makes per-log LLM calls prohibitive. A reader looking for concrete hybrid designs could extract the routing concept even if the numbers need checking.

It deserves peer review because the problem is real and the idea is reasonable, but any referee would have to verify the router details and per-dataset breakdowns that are missing from the abstract. I would send it rather than desk reject.

Referee Report

2 major / 1 minor

Summary. The paper presents CelerLog, a hybrid log parser that uses a dynamic routing mechanism to send logs exhibiting strong statistical patterns (dense groups) to an efficient statistical processor and sparse logs lacking such patterns to an LLM for semantic inference. On 14 public datasets it claims leading performance over baselines, with speedups of 7.9x–18.6x versus LLM methods and up to 1.5x versus Drain, plus token-consumption reductions of 80.2%–94.1% and LLM-invocation reductions of 86.4%–90.9%.

Significance. If the routing step reliably partitions logs with negligible overhead and error, the hybrid design would materially improve the cost–accuracy trade-off for semantic log parsing, allowing LLM use to be restricted to a small minority of inputs while preserving overall accuracy.

major comments (2)

[Abstract] Abstract: the headline speed and cost claims rest on the premise that the dynamic router correctly identifies the 'vast majority' of logs as dense and routes only true sparse cases to the LLM; yet the abstract supplies no classifier description (features, threshold, training), no routing-accuracy or misrouting-rate figures, and no ablation showing end-to-end accuracy or latency under routing mistakes. This is load-bearing: even modest misrouting would either erode the reported savings or degrade parsing quality.
[Abstract] Abstract: only aggregate speedups and accuracy are stated across the 14 datasets; no per-dataset tables, error bars, or breakdowns of routing decisions are mentioned, preventing verification that the claimed gains are consistent rather than driven by a few favorable datasets.

minor comments (1)

The abstract would benefit from a one-sentence description of the statistical processor employed for the dense group.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive comments on the abstract. We agree that the abstract would benefit from additional context on the routing mechanism and consistency of results. We will revise the abstract accordingly. Point-by-point responses are below.

read point-by-point responses

Referee: [Abstract] Abstract: the headline speed and cost claims rest on the premise that the dynamic router correctly identifies the 'vast majority' of logs as dense and routes only true sparse cases to the LLM; yet the abstract supplies no classifier description (features, threshold, training), no routing-accuracy or misrouting-rate figures, and no ablation showing end-to-end accuracy or latency under routing mistakes. This is load-bearing: even modest misrouting would either erode the reported savings or degrade parsing quality.

Authors: We acknowledge that the provided abstract is concise and does not include a description of the classifier, routing accuracy metrics, or an ablation on misrouting effects. The full manuscript details the routing mechanism (features, threshold, and training) in Section 3. Routing accuracy and misrouting rates are reported in the experimental evaluation. We agree an explicit ablation on routing errors is warranted and will add one. We will revise the abstract to include a brief statement on routing performance. revision: yes
Referee: [Abstract] Abstract: only aggregate speedups and accuracy are stated across the 14 datasets; no per-dataset tables, error bars, or breakdowns of routing decisions are mentioned, preventing verification that the claimed gains are consistent rather than driven by a few favorable datasets.

Authors: We agree the abstract reports only aggregate figures. The manuscript contains per-dataset tables for accuracy, runtime, and cost metrics along with error bars and routing decision breakdowns in the experimental section. These show consistent gains. We will revise the abstract to note that improvements hold across all datasets. revision: yes

Circularity Check

0 steps flagged

No circularity; empirical system evaluated on external benchmarks

full rationale

The paper describes an engineering hybrid parser whose performance numbers (speedups, token reductions) are measured outcomes on 14 public datasets against external baselines. No equations, fitted parameters renamed as predictions, or self-citation chains appear in the supplied text. The routing mechanism is presented as a design choice whose accuracy is asserted via experiment rather than derived from prior self-work. This is the normal non-circular case for an applied systems paper.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract supplies no explicit free parameters, axioms, or invented entities; the routing threshold and group definitions are mentioned only conceptually.

pith-pipeline@v0.9.1-grok · 5769 in / 1020 out tokens · 25814 ms · 2026-06-29T20:18:33.516813+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

58 extracted references · 7 canonical work pages · 1 internal anchor

[1]

NLTK Project

2001. NLTK Project. https://www.nltk.org/howto/wordnet.html. Oneline; Accessed: 2026-03-16

2001
[2]

Industrial-Strength Natural Language Processing

2016. Industrial-Strength Natural Language Processing. https://spacy.io/. Online; Accessed: 2026-03-16

2016
[3]

OpenAI API

2026. OpenAI API. https://openai.com/blog/openai-api. Online; Accessed: 2026-03-16

2026
[4]

Shan Ali, Chaima Boufaied, Domenico Bianculli, Paula Branco, Lionel Briand, and Nathan Aschbacher. 2023. An Empirical Study on Log-based Anomaly Detection Using Machine Learning.arXiv preprint arXiv:2307.16714(2023)

work page arXiv 2023
[5]

O’Reilly Media, Inc

Steven Bird, Ewan Klein, and Edward Loper. 2009.Natural language processing with Python: analyzing text with the natural language toolkit. " O’Reilly Media, Inc. "

2009
[6]

Boyuan Chen, Jian Song, Peng Xu, Xing Hu, and Zhen Ming Jiang. 2018. An automated approach to estimating code coverage measures via execution logs. In Proceedings of the 33rd ACM/IEEE International Conference on Automated Software Engineering. 305–316

2018
[7]

Zhuangbin Chen, Jinyang Liu, Wenwei Gu, Yuxin Su, and Michael R Lyu. 2021. Ex- perience report: Deep learning-based system log analysis for anomaly detection. arXiv preprint arXiv:2107.05908(2021)

work page arXiv 2021
[8]

Tianyu Cui, Ruowei Fu, Changchang Liu, Yuhe Ji, Wenwei Gu, Shenglin Zhang, Yongqian Sun, and Dan Pei. 2025. AetherLog: Log-based Root Cause Analysis by Integrating Large Language Models with Knowledge Graphs. In2025 IEEE 36th International Symposium on Software Reliability Engineering (ISSRE). IEEE, 49–60

2025
[9]

Hetong Dai, Heng Li, Che-Shao Chen, Weiyi Shang, and Tse-Hsun Chen. 2020. Logram: Efficient Log Parsing Using 𝑛 n-Gram Dictionaries.IEEE Transactions on Software Engineering (TSE)48, 3 (2020), 879–892

2020
[10]

Hetong Dai, Yiming Tang, Heng Li, and Weiyi Shang. 2023. PILAR: Studying and Mitigating the Influence of Configurations on Log Parsing. In2023 IEEE/ACM 45th International Conference on Software Engineering (ICSE). IEEE, 818–829

2023
[11]

Min Du, Feifei Li, Guineng Zheng, and Vivek Srikumar. 2017. Deeplog: Anomaly detection and diagnosis from system logs through deep learning. InProceedings of the 2017 ACM SIGSAC conference on computer and communications security. 1285–1298

2017
[12]

Qiang Fu, Jian-Guang Lou, Yi Wang, and Jiang Li. 2009. Execution anomaly detection in distributed systems through unstructured log analysis. In2009 ninth IEEE international conference on data mining (ICDM). IEEE, 149–158

2009
[13]

Hossein Hamooni, Biplob Debnath, Jianwu Xu, Hui Zhang, Guofei Jiang, and Abdullah Mueen. 2016. Logmine: Fast pattern recognition for log analytics. InProceedings of the 25th ACM International on Conference on Information and Knowledge Management (CIKM). 1573–1582

2016
[14]

Minghua He, Tong Jia, Chiming Duan, Huaqian Cai, Ying Li, and Gang Huang
[15]

In2025 IEEE/ACM 47th International Conference on Soft- ware Engineering (ICSE)

Weakly-supervised log-based anomaly detection with inexact labels via multi-instance learning. In2025 IEEE/ACM 47th International Conference on Soft- ware Engineering (ICSE). IEEE, 2918–2930
[16]

Pinjia He, Jieming Zhu, Shilin He, Jian Li, and Michael R Lyu. 2016. An evaluation study on log parsing and its use in log mining. In2016 46th annual IEEE/IFIP international conference on dependable systems and networks (DSN). IEEE, 654– 661

2016
[17]

Pinjia He, Jieming Zhu, Zibin Zheng, and Michael R Lyu. 2017. Drain: An online log parsing approach with fixed depth tree. In2017 IEEE international conference on web services (ICWS). IEEE, 33–40

2017
[18]

Junjie Huang, Zhihan Jiang, Zhuangbin Chen, and Michael Lyu. 2025. No more labelled examples? an unsupervised log parser with llms.Proceedings of the ACM on Software Engineering2, FSE (2025), 2406–2429

2025
[19]

Yintong Huo, Yuxin Su, Cheryl Lee, and Michael R Lyu. 2023. Semparser: A semantic parser for log analytics. In2023 IEEE/ACM 45th International Conference on Software Engineering (ICSE). IEEE, 881–893

2023
[20]

Zhihan Jiang, Jinyang Liu, Zhuangbin Chen, Yichen Li, Junjie Huang, Yintong Huo, Pinjia He, Jiazhen Gu, and Michael R Lyu. 2024. Lilac: Log parsing using llms with adaptive parsing cache.Proceedings of the ACM on Software Engineering 1, FSE (2024), 137–160

2024
[21]

Zhihan Jiang, Jinyang Liu, Junjie Huang, Yichen Li, Yintong Huo, Jiazhen Gu, Zhuangbin Chen, Jieming Zhu, and Michael R Lyu. 2024. A large-scale evaluation for log parsing techniques: How far are we?. InProceedings of the 33rd ACM SIGSOFT International Symposium on Software Testing and Analysis. 223–234

2024
[22]

Zhihan Jiang, Jinyang Liu, Yichen Li, Haiyu Huang, Xiao He, Tieying Zhang, Jianjun Chen, Yi Li, Rui Shi, and Michael R Lyu. 2025. LogPilot: Intent-aware and Scalable Alert Diagnosis for Large-scale Online Service Systems.arXiv preprint arXiv:2509.25874(2025)

work page arXiv 2025
[23]

Zhen Ming Jiang, Ahmed E Hassan, Parminder Flora, and Gilbert Hamann. 2008. Abstracting execution logs to execution events for enterprise applications (short paper). In2008 The Eighth International Conference on Quality Software. IEEE, 181–186

2008
[24]

Van-Hoang Le and Hongyu Zhang. 2021. Log-based anomaly detection with- out log parsing. In2021 36th IEEE/ACM International Conference on Automated Software Engineering (ASE). IEEE, 492–504

2021
[25]

Van-Hoang Le and Hongyu Zhang. 2023. Log Parsing with Prompt-based Few- shot Learning.arXiv preprint arXiv:2302.07435(2023)

work page arXiv 2023
[26]

Xiaoyun Li, Hongyu Zhang, Van-Hoang Le, and Pengfei Chen. 2024. Logshrink: Effective log compression by leveraging commonality and variability of log data. InProceedings of the 46th IEEE/ACM International Conference on Software Engineering. 1–12

2024
[27]

Jinyang Liu, Jieming Zhu, Shilin He, Pinjia He, Zibin Zheng, and Michael R Lyu. 2019. Logzip: Extracting hidden structures via iterative clustering for log compression. In2019 34th IEEE/ACM International Conference on Automated Software Engineering (ASE). IEEE, 863–873

2019
[28]

Yudong Liu, Xu Zhang, Shilin He, Hongyu Zhang, Liqun Li, Yu Kang, Yong Xu, Minghua Ma, Qingwei Lin, Yingnong Dang, et al. 2022. Uniparser: A unified log parser for heterogeneous log data. InProceedings of the ACM Web Conference 2022 (WWW). 1893–1901

2022
[29]

Jian-Guang Lou, Qiang Fu, Shenqi Yang, Ye Xu, and Jiang Li. 2010. Mining invariants from console logs for system problem detection. In2010 USENIX annual technical conference (USENIX ATC 10)

2010
[30]

Zeyang Ma, Dong Jae Kim, and Tse-Hsun Chen. 2024. Librelog: Accurate and efficient unsupervised log parsing using open-source large language models. arXiv preprint arXiv:2408.01585(2024)

work page arXiv 2024
[31]

Adetokunbo AO Makanju, A Nur Zincir-Heywood, and Evangelos E Milios. 2009. Clustering event logs using iterative partitioning. InProceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining (KDD). 1255–1264

2009
[32]

Weibin Meng, Ying Liu, Yichen Zhu, Shenglin Zhang, Dan Pei, Yuqing Liu, Yihao Chen, Ruizhi Zhang, Shimin Tao, Pei Sun, et al. 2019. Loganomaly: Unsupervised detection of sequential and quantitative anomalies in unstructured logs.. InIjcai, Vol. 19. 4739–4745

2019
[33]

Suphakit Niwattanakul, Jatsada Singthongchai, Ekkachai Naenudorn, and Su- pachanun Wanapu. 2013. Using of Jaccard coefficient for keywords similarity. InProceedings of the international multiconference of engineers and computer scientists, Vol. 1. 380–384

2013
[34]

Paolo Notaro, Soroush Haeri, Jorge Cardoso, and Michael Gerndt. 2023. LogRule: Efficient Structured Log Mining for Root Cause Analysis.IEEE Transactions on Network and Service Management(2023)

2023
[35]

David Ohana. 2020. A blog about Drain usage in IBM Cloud. https://developer.ibm.com/blogs/how-mining-log-templates-can-help-ai- ops-in-cloud-scale-data-centers/. Online; Accessed: 2026-03-16

2020
[36]

OpenAI. [n. d.]. GPT-5.2. https://platform.openai.com/docs/models/gpt-5.2. One- line; Accessed: 2026-03-16

2026
[37]

Daan Schipper, Maurício Aniche, and Arie van Deursen. 2019. Tracing back log data to its log statement: from research to practice. In2019 IEEE/ACM 16th International Conference on Mining Software Repositories (MSR). IEEE, 545–549

2019
[38]

Issam Sedki, Abdelwahab Hamou-Lhadj, Otmane Ait-Mohamed, and Mo- hammed A Shehab. 2022. An Effective Approach for Parsing Large Log Files. In2022 IEEE International Conference on Software Maintenance and Evolution (ICSME). IEEE, 1–12

2022
[39]

Shiwen Shan, Yintong Huo, Yuxin Su, Yichen Li, Dan Li, and Zibin Zheng. 2024. Face it yourselves: An llm-based two-stage strategy to localize configuration errors via logs. InProceedings of the 33rd ACM SIGSOFT international symposium on software testing and analysis. 13–25

2024
[40]

Keiichi Shima. 2016. Length matters: Clustering system log messages using length of words.arXiv preprint arXiv:1611.03213(2016)

work page internal anchor Pith review Pith/arXiv arXiv 2016
[41]

Donghwan Shin, Zanis Ali Khan, Domenico Bianculli, and Lionel Briand. 2021. A theoretical framework for understanding the relationship between log pars- ing and anomaly detection. InInternational Conference on Runtime Verification. Springer, 277–287

2021
[42]

Risto Vaarandi. 2003. A data clustering algorithm for mining patterns from event logs. InProceedings of the 3rd IEEE Workshop on IP Operations & Management (IPOM)(IEEE Cat. No. 03EX764). Ieee, 119–126

2003
[43]

Risto Vaarandi and Mauno Pihelgas. 2015. Logcluster-a data clustering and pattern mining algorithm for event logs. In2015 11th International conference on network and service management (CNSM). IEEE, 1–7. Conference’17, July 2017, Washington, DC, USA Shiwen Shan ¶, Yintong Huo ‡, Minxing Wang‡, Zhiying Wu ¶, Yuxin Su ¶, Zibin Zheng ¶

2015
[44]

Lingzhi Wang, Nengwen Zhao, Junjie Chen, Pinnong Li, Wenchi Zhang, and Kaixin Sui. 2020. Root-cause metric location for microservice systems via log anomaly detection. In2020 IEEE international conference on web services (ICWS). IEEE, 142–150

2020
[45]

Teng Wang, Xiaodong Liu, Shanshan Li, Xiangke Liao, Wang Li, and Qing Liao
[46]

In2018 IEEE International Conference on Software Quality, Reliability and Security (QRS)

MisconfDoctor: diagnosing misconfiguration via log-based configuration testing. In2018 IEEE International Conference on Software Quality, Reliability and Security (QRS). IEEE, 1–12
[47]

Thorsten Wittkopp, Philipp Wiesner, and Odej Kao. 2024. Logrca: Log-based root cause analysis for distributed services. InEuropean Conference on Parallel Processing. Springer, 362–376

2024
[48]

Pei Xiao, Tong Jia, Chiming Duan, Minghua He, Weijie Hong, Xixuan Yang, Yihan Wu, Ying Li, and Gang Huang. 2025. Clslog: Collaborating large and small models for log-based anomaly detection. InProceedings of the 33rd ACM International Conference on the Foundations of Software Engineering. 686–690

2025
[49]

Yi Xiao, Van-Hoang Le, and Hongyu Zhang. 2024. free: Towards more practical log parsing with large language models. InProceedings of the 39th IEEE/ACM International Conference on Automated Software Engineering. 153–165

2024
[50]

Weiyuan Xu, Juntao Luo, Tao Huang, Kaixin Sui, Jie Geng, Qijun Ma, Isami Akasaka, Xiaoxue Shi, Jing Tang, and Peng Cai. 2025. LogSage: An LLM-based framework for CI/CD failure detection and remediation with industrial validation. arXiv preprint arXiv:2506.03691(2025)

work page arXiv 2025
[51]

Siyu Yu, Pinjia He, Ningjiang Chen, and Yifan Wu. 2023. Brain: Log parsing with bidirectional parallel tree.IEEE Transactions on Services Computing16, 5 (2023), 3224–3237

2023
[52]

Siyu Yu, Yifan Wu, Ying Li, and Pinjia He. 2024. Unlocking the power of numbers: Log compression via numeric token parsing. InProceedings of the 39th IEEE/ACM International Conference on Automated Software Engineering. 919–930

2024
[53]

Ding Yuan, Haohui Mai, Weiwei Xiong, Lin Tan, Yuanyuan Zhou, and Shankar Pasupathy. 2010. Sherlog: error diagnosis by connecting clues from run-time logs. InProceedings of the fifteenth International Conference on Architectural support for programming languages and operating systems. 143–154

2010
[54]

Chenbo Zhang, Wenying Xu, Jinbu Liu, Lu Zhang, Guiyang Liu, Jihong Guan, Qi Zhou, and Shuigeng Zhou. 2025. SemanticLog: Towards Effective and Efficient Large-Scale Semantic Log Parsing.IEEE Transactions on Software Engineering (2025)

2025
[55]

Xu Zhang, Yong Xu, Qingwei Lin, Bo Qiao, Hongyu Zhang, Yingnong Dang, Chunyu Xie, Xinsheng Yang, Qian Cheng, Ze Li, et al. 2019. Robust log-based anomaly detection on unstable log data. InProceedings of the 2019 27th ACM joint meeting on European software engineering conference and symposium on the foundations of software engineering. 807–817

2019
[56]

Xiang Zhou, Xin Peng, Tao Xie, Jun Sun, Chao Ji, Dewei Liu, Qilin Xiang, and Chuan He. 2019. Latent error prediction and fault localization for microservice applications by learning from system trace logs. InProceedings of the 2019 27th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering (...

2019
[57]

Jieming Zhu, Shilin He, Pinjia He, Jinyang Liu, and Michael R Lyu. 2023. Loghub: A large collection of system log datasets for ai-driven log analytics. In2023 IEEE 34th International Symposium on Software Reliability Engineering (ISSRE). IEEE, 355–366

2023
[58]

Jieming Zhu, Shilin He, Jinyang Liu, Pinjia He, Qi Xie, Zibin Zheng, and Michael R Lyu. 2019. Tools and benchmarks for automated log parsing. In2019 IEEE/ACM 41st International Conference on Software Engineering: Software Engineering in Practice (ICSE-SEIP). IEEE, 121–130

2019

[1] [1]

NLTK Project

2001. NLTK Project. https://www.nltk.org/howto/wordnet.html. Oneline; Accessed: 2026-03-16

2001

[2] [2]

Industrial-Strength Natural Language Processing

2016. Industrial-Strength Natural Language Processing. https://spacy.io/. Online; Accessed: 2026-03-16

2016

[3] [3]

OpenAI API

2026. OpenAI API. https://openai.com/blog/openai-api. Online; Accessed: 2026-03-16

2026

[4] [4]

Shan Ali, Chaima Boufaied, Domenico Bianculli, Paula Branco, Lionel Briand, and Nathan Aschbacher. 2023. An Empirical Study on Log-based Anomaly Detection Using Machine Learning.arXiv preprint arXiv:2307.16714(2023)

work page arXiv 2023

[5] [5]

O’Reilly Media, Inc

Steven Bird, Ewan Klein, and Edward Loper. 2009.Natural language processing with Python: analyzing text with the natural language toolkit. " O’Reilly Media, Inc. "

2009

[6] [6]

Boyuan Chen, Jian Song, Peng Xu, Xing Hu, and Zhen Ming Jiang. 2018. An automated approach to estimating code coverage measures via execution logs. In Proceedings of the 33rd ACM/IEEE International Conference on Automated Software Engineering. 305–316

2018

[7] [7]

Zhuangbin Chen, Jinyang Liu, Wenwei Gu, Yuxin Su, and Michael R Lyu. 2021. Ex- perience report: Deep learning-based system log analysis for anomaly detection. arXiv preprint arXiv:2107.05908(2021)

work page arXiv 2021

[8] [8]

Tianyu Cui, Ruowei Fu, Changchang Liu, Yuhe Ji, Wenwei Gu, Shenglin Zhang, Yongqian Sun, and Dan Pei. 2025. AetherLog: Log-based Root Cause Analysis by Integrating Large Language Models with Knowledge Graphs. In2025 IEEE 36th International Symposium on Software Reliability Engineering (ISSRE). IEEE, 49–60

2025

[9] [9]

Hetong Dai, Heng Li, Che-Shao Chen, Weiyi Shang, and Tse-Hsun Chen. 2020. Logram: Efficient Log Parsing Using 𝑛 n-Gram Dictionaries.IEEE Transactions on Software Engineering (TSE)48, 3 (2020), 879–892

2020

[10] [10]

Hetong Dai, Yiming Tang, Heng Li, and Weiyi Shang. 2023. PILAR: Studying and Mitigating the Influence of Configurations on Log Parsing. In2023 IEEE/ACM 45th International Conference on Software Engineering (ICSE). IEEE, 818–829

2023

[11] [11]

Min Du, Feifei Li, Guineng Zheng, and Vivek Srikumar. 2017. Deeplog: Anomaly detection and diagnosis from system logs through deep learning. InProceedings of the 2017 ACM SIGSAC conference on computer and communications security. 1285–1298

2017

[12] [12]

Qiang Fu, Jian-Guang Lou, Yi Wang, and Jiang Li. 2009. Execution anomaly detection in distributed systems through unstructured log analysis. In2009 ninth IEEE international conference on data mining (ICDM). IEEE, 149–158

2009

[13] [13]

Hossein Hamooni, Biplob Debnath, Jianwu Xu, Hui Zhang, Guofei Jiang, and Abdullah Mueen. 2016. Logmine: Fast pattern recognition for log analytics. InProceedings of the 25th ACM International on Conference on Information and Knowledge Management (CIKM). 1573–1582

2016

[14] [14]

Minghua He, Tong Jia, Chiming Duan, Huaqian Cai, Ying Li, and Gang Huang

[15] [15]

In2025 IEEE/ACM 47th International Conference on Soft- ware Engineering (ICSE)

Weakly-supervised log-based anomaly detection with inexact labels via multi-instance learning. In2025 IEEE/ACM 47th International Conference on Soft- ware Engineering (ICSE). IEEE, 2918–2930

[16] [16]

Pinjia He, Jieming Zhu, Shilin He, Jian Li, and Michael R Lyu. 2016. An evaluation study on log parsing and its use in log mining. In2016 46th annual IEEE/IFIP international conference on dependable systems and networks (DSN). IEEE, 654– 661

2016

[17] [17]

Pinjia He, Jieming Zhu, Zibin Zheng, and Michael R Lyu. 2017. Drain: An online log parsing approach with fixed depth tree. In2017 IEEE international conference on web services (ICWS). IEEE, 33–40

2017

[18] [18]

Junjie Huang, Zhihan Jiang, Zhuangbin Chen, and Michael Lyu. 2025. No more labelled examples? an unsupervised log parser with llms.Proceedings of the ACM on Software Engineering2, FSE (2025), 2406–2429

2025

[19] [19]

Yintong Huo, Yuxin Su, Cheryl Lee, and Michael R Lyu. 2023. Semparser: A semantic parser for log analytics. In2023 IEEE/ACM 45th International Conference on Software Engineering (ICSE). IEEE, 881–893

2023

[20] [20]

Zhihan Jiang, Jinyang Liu, Zhuangbin Chen, Yichen Li, Junjie Huang, Yintong Huo, Pinjia He, Jiazhen Gu, and Michael R Lyu. 2024. Lilac: Log parsing using llms with adaptive parsing cache.Proceedings of the ACM on Software Engineering 1, FSE (2024), 137–160

2024

[21] [21]

Zhihan Jiang, Jinyang Liu, Junjie Huang, Yichen Li, Yintong Huo, Jiazhen Gu, Zhuangbin Chen, Jieming Zhu, and Michael R Lyu. 2024. A large-scale evaluation for log parsing techniques: How far are we?. InProceedings of the 33rd ACM SIGSOFT International Symposium on Software Testing and Analysis. 223–234

2024

[22] [22]

Zhihan Jiang, Jinyang Liu, Yichen Li, Haiyu Huang, Xiao He, Tieying Zhang, Jianjun Chen, Yi Li, Rui Shi, and Michael R Lyu. 2025. LogPilot: Intent-aware and Scalable Alert Diagnosis for Large-scale Online Service Systems.arXiv preprint arXiv:2509.25874(2025)

work page arXiv 2025

[23] [23]

Zhen Ming Jiang, Ahmed E Hassan, Parminder Flora, and Gilbert Hamann. 2008. Abstracting execution logs to execution events for enterprise applications (short paper). In2008 The Eighth International Conference on Quality Software. IEEE, 181–186

2008

[24] [24]

Van-Hoang Le and Hongyu Zhang. 2021. Log-based anomaly detection with- out log parsing. In2021 36th IEEE/ACM International Conference on Automated Software Engineering (ASE). IEEE, 492–504

2021

[25] [25]

Van-Hoang Le and Hongyu Zhang. 2023. Log Parsing with Prompt-based Few- shot Learning.arXiv preprint arXiv:2302.07435(2023)

work page arXiv 2023

[26] [26]

Xiaoyun Li, Hongyu Zhang, Van-Hoang Le, and Pengfei Chen. 2024. Logshrink: Effective log compression by leveraging commonality and variability of log data. InProceedings of the 46th IEEE/ACM International Conference on Software Engineering. 1–12

2024

[27] [27]

Jinyang Liu, Jieming Zhu, Shilin He, Pinjia He, Zibin Zheng, and Michael R Lyu. 2019. Logzip: Extracting hidden structures via iterative clustering for log compression. In2019 34th IEEE/ACM International Conference on Automated Software Engineering (ASE). IEEE, 863–873

2019

[28] [28]

Yudong Liu, Xu Zhang, Shilin He, Hongyu Zhang, Liqun Li, Yu Kang, Yong Xu, Minghua Ma, Qingwei Lin, Yingnong Dang, et al. 2022. Uniparser: A unified log parser for heterogeneous log data. InProceedings of the ACM Web Conference 2022 (WWW). 1893–1901

2022

[29] [29]

Jian-Guang Lou, Qiang Fu, Shenqi Yang, Ye Xu, and Jiang Li. 2010. Mining invariants from console logs for system problem detection. In2010 USENIX annual technical conference (USENIX ATC 10)

2010

[30] [30]

Zeyang Ma, Dong Jae Kim, and Tse-Hsun Chen. 2024. Librelog: Accurate and efficient unsupervised log parsing using open-source large language models. arXiv preprint arXiv:2408.01585(2024)

work page arXiv 2024

[31] [31]

Adetokunbo AO Makanju, A Nur Zincir-Heywood, and Evangelos E Milios. 2009. Clustering event logs using iterative partitioning. InProceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining (KDD). 1255–1264

2009

[32] [32]

Weibin Meng, Ying Liu, Yichen Zhu, Shenglin Zhang, Dan Pei, Yuqing Liu, Yihao Chen, Ruizhi Zhang, Shimin Tao, Pei Sun, et al. 2019. Loganomaly: Unsupervised detection of sequential and quantitative anomalies in unstructured logs.. InIjcai, Vol. 19. 4739–4745

2019

[33] [33]

Suphakit Niwattanakul, Jatsada Singthongchai, Ekkachai Naenudorn, and Su- pachanun Wanapu. 2013. Using of Jaccard coefficient for keywords similarity. InProceedings of the international multiconference of engineers and computer scientists, Vol. 1. 380–384

2013

[34] [34]

Paolo Notaro, Soroush Haeri, Jorge Cardoso, and Michael Gerndt. 2023. LogRule: Efficient Structured Log Mining for Root Cause Analysis.IEEE Transactions on Network and Service Management(2023)

2023

[35] [35]

David Ohana. 2020. A blog about Drain usage in IBM Cloud. https://developer.ibm.com/blogs/how-mining-log-templates-can-help-ai- ops-in-cloud-scale-data-centers/. Online; Accessed: 2026-03-16

2020

[36] [36]

OpenAI. [n. d.]. GPT-5.2. https://platform.openai.com/docs/models/gpt-5.2. One- line; Accessed: 2026-03-16

2026

[37] [37]

Daan Schipper, Maurício Aniche, and Arie van Deursen. 2019. Tracing back log data to its log statement: from research to practice. In2019 IEEE/ACM 16th International Conference on Mining Software Repositories (MSR). IEEE, 545–549

2019

[38] [38]

Issam Sedki, Abdelwahab Hamou-Lhadj, Otmane Ait-Mohamed, and Mo- hammed A Shehab. 2022. An Effective Approach for Parsing Large Log Files. In2022 IEEE International Conference on Software Maintenance and Evolution (ICSME). IEEE, 1–12

2022

[39] [39]

Shiwen Shan, Yintong Huo, Yuxin Su, Yichen Li, Dan Li, and Zibin Zheng. 2024. Face it yourselves: An llm-based two-stage strategy to localize configuration errors via logs. InProceedings of the 33rd ACM SIGSOFT international symposium on software testing and analysis. 13–25

2024

[40] [40]

Keiichi Shima. 2016. Length matters: Clustering system log messages using length of words.arXiv preprint arXiv:1611.03213(2016)

work page internal anchor Pith review Pith/arXiv arXiv 2016

[41] [41]

Donghwan Shin, Zanis Ali Khan, Domenico Bianculli, and Lionel Briand. 2021. A theoretical framework for understanding the relationship between log pars- ing and anomaly detection. InInternational Conference on Runtime Verification. Springer, 277–287

2021

[42] [42]

Risto Vaarandi. 2003. A data clustering algorithm for mining patterns from event logs. InProceedings of the 3rd IEEE Workshop on IP Operations & Management (IPOM)(IEEE Cat. No. 03EX764). Ieee, 119–126

2003

[43] [43]

Risto Vaarandi and Mauno Pihelgas. 2015. Logcluster-a data clustering and pattern mining algorithm for event logs. In2015 11th International conference on network and service management (CNSM). IEEE, 1–7. Conference’17, July 2017, Washington, DC, USA Shiwen Shan ¶, Yintong Huo ‡, Minxing Wang‡, Zhiying Wu ¶, Yuxin Su ¶, Zibin Zheng ¶

2015

[44] [44]

Lingzhi Wang, Nengwen Zhao, Junjie Chen, Pinnong Li, Wenchi Zhang, and Kaixin Sui. 2020. Root-cause metric location for microservice systems via log anomaly detection. In2020 IEEE international conference on web services (ICWS). IEEE, 142–150

2020

[45] [45]

Teng Wang, Xiaodong Liu, Shanshan Li, Xiangke Liao, Wang Li, and Qing Liao

[46] [46]

In2018 IEEE International Conference on Software Quality, Reliability and Security (QRS)

MisconfDoctor: diagnosing misconfiguration via log-based configuration testing. In2018 IEEE International Conference on Software Quality, Reliability and Security (QRS). IEEE, 1–12

[47] [47]

Thorsten Wittkopp, Philipp Wiesner, and Odej Kao. 2024. Logrca: Log-based root cause analysis for distributed services. InEuropean Conference on Parallel Processing. Springer, 362–376

2024

[48] [48]

Pei Xiao, Tong Jia, Chiming Duan, Minghua He, Weijie Hong, Xixuan Yang, Yihan Wu, Ying Li, and Gang Huang. 2025. Clslog: Collaborating large and small models for log-based anomaly detection. InProceedings of the 33rd ACM International Conference on the Foundations of Software Engineering. 686–690

2025

[49] [49]

Yi Xiao, Van-Hoang Le, and Hongyu Zhang. 2024. free: Towards more practical log parsing with large language models. InProceedings of the 39th IEEE/ACM International Conference on Automated Software Engineering. 153–165

2024

[50] [50]

Weiyuan Xu, Juntao Luo, Tao Huang, Kaixin Sui, Jie Geng, Qijun Ma, Isami Akasaka, Xiaoxue Shi, Jing Tang, and Peng Cai. 2025. LogSage: An LLM-based framework for CI/CD failure detection and remediation with industrial validation. arXiv preprint arXiv:2506.03691(2025)

work page arXiv 2025

[51] [51]

Siyu Yu, Pinjia He, Ningjiang Chen, and Yifan Wu. 2023. Brain: Log parsing with bidirectional parallel tree.IEEE Transactions on Services Computing16, 5 (2023), 3224–3237

2023

[52] [52]

Siyu Yu, Yifan Wu, Ying Li, and Pinjia He. 2024. Unlocking the power of numbers: Log compression via numeric token parsing. InProceedings of the 39th IEEE/ACM International Conference on Automated Software Engineering. 919–930

2024

[53] [53]

Ding Yuan, Haohui Mai, Weiwei Xiong, Lin Tan, Yuanyuan Zhou, and Shankar Pasupathy. 2010. Sherlog: error diagnosis by connecting clues from run-time logs. InProceedings of the fifteenth International Conference on Architectural support for programming languages and operating systems. 143–154

2010

[54] [54]

Chenbo Zhang, Wenying Xu, Jinbu Liu, Lu Zhang, Guiyang Liu, Jihong Guan, Qi Zhou, and Shuigeng Zhou. 2025. SemanticLog: Towards Effective and Efficient Large-Scale Semantic Log Parsing.IEEE Transactions on Software Engineering (2025)

2025

[55] [55]

Xu Zhang, Yong Xu, Qingwei Lin, Bo Qiao, Hongyu Zhang, Yingnong Dang, Chunyu Xie, Xinsheng Yang, Qian Cheng, Ze Li, et al. 2019. Robust log-based anomaly detection on unstable log data. InProceedings of the 2019 27th ACM joint meeting on European software engineering conference and symposium on the foundations of software engineering. 807–817

2019

[56] [56]

Xiang Zhou, Xin Peng, Tao Xie, Jun Sun, Chao Ji, Dewei Liu, Qilin Xiang, and Chuan He. 2019. Latent error prediction and fault localization for microservice applications by learning from system trace logs. InProceedings of the 2019 27th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering (...

2019

[57] [57]

Jieming Zhu, Shilin He, Pinjia He, Jinyang Liu, and Michael R Lyu. 2023. Loghub: A large collection of system log datasets for ai-driven log analytics. In2023 IEEE 34th International Symposium on Software Reliability Engineering (ISSRE). IEEE, 355–366

2023

[58] [58]

Jieming Zhu, Shilin He, Jinyang Liu, Pinjia He, Qi Xie, Zibin Zheng, and Michael R Lyu. 2019. Tools and benchmarks for automated log parsing. In2019 IEEE/ACM 41st International Conference on Software Engineering: Software Engineering in Practice (ICSE-SEIP). IEEE, 121–130

2019