pith. sign in

arxiv: 2605.26005 · v1 · pith:Z2LYA454new · submitted 2026-05-25 · 💻 cs.SE

CelerLog: Fast Log Parsing via Dynamic Routing

Pith reviewed 2026-06-29 20:18 UTC · model grok-4.3

classification 💻 cs.SE
keywords log parsingdynamic routingLLM log analysisstatistical log parsinghybrid log parserlog analysis efficiencysemantic inferencecost reduction
0
0 comments X

The pith

CelerLog routes logs by statistical density so most avoid LLMs while accuracy holds.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper proposes CelerLog, a log parser that first classifies incoming logs into dense groups showing strong statistical patterns and sparse groups that lack them. Dense logs are handled by a fast statistical processor while only sparse logs reach an LLM for semantic inference. This selective approach avoids running expensive LLM calls on the repetitive majority of logs. Experiments across 14 public datasets show the method leads in performance, runs 7.9x to 18.6x faster than full LLM parsers, and cuts token consumption by 80.2 to 94.1 percent. A sympathetic reader would care because log parsing supports automated system monitoring and current semantic methods remain too slow and costly at scale.

Core claim

CelerLog introduces a dynamic routing mechanism to classify logs into dense and sparse groups. Logs with strong statistical patterns are processed by an efficient statistical processor while sparse groups lacking such patterns are routed to an LLM. This hybrid strategy avoids unnecessary LLM invocations and achieves leading performance over state-of-the-art baselines on 14 public datasets, with speeds 7.9x to 18.6x faster than LLM methods and up to 1.5x faster than Drain, plus token reductions of 80.2-94.1 percent and LLM invocation reductions of 86.4-90.9 percent.

What carries the argument

The dynamic routing mechanism that partitions logs into dense (statistically patterned) and sparse groups for selective statistical or LLM processing.

If this is right

  • Most logs exhibit repetitive patterns that statistical methods can extract without semantic reasoning.
  • Only logs lacking statistical patterns require the slower, costlier LLM path.
  • Overall parsing latency and cost drop substantially while accuracy remains competitive with pure LLM or pure statistical baselines.
  • The same routing principle reduces both token consumption and the number of LLM calls by more than 80 percent on standard benchmarks.
  • The hybrid design outperforms both Drain and full LLM parsers on the 14 evaluated datasets.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • If the routing decision itself stays accurate across varied log sources, the same density-based split could apply to other tasks that mix patterned and novel data.
  • Lower LLM invocation rates could make high-volume log analysis practical on hardware with limited compute or budget.
  • Further tuning the classifier to reduce even small routing overhead would compound the observed speed gains.
  • The observed cost reductions suggest the method scales to production volumes where full LLM parsing is currently prohibitive.

Load-bearing premise

Incoming logs can be reliably and cheaply partitioned into dense and sparse groups such that classification overhead stays negligible and misrouting does not degrade end-to-end accuracy or cost savings.

What would settle it

A new log dataset on which the classifier routes a large fraction of logs to the wrong processor, producing either accuracy below current baselines or no measurable drop in LLM token use.

Figures

Figures reproduced from arXiv: 2605.26005 by Minxing Wang, Shiwen Shan, Yintong Huo, Yuxin Su, Zhiying Wu, Zibin Zheng.

Figure 1
Figure 1. Figure 1: The general paradigm of log parsing. effectiveness due to their lack of semantic awareness, struggling to handle complex log patterns accurately. (2) Semantic-based parsers, on the other hand, exploit semantic-aware models to distinguish dynamic parameters from static text. Recently, LLM-based parsers have emerged as a dominant solution in this category. By leverag￾ing the powerful natural language underst… view at source ↗
Figure 2
Figure 2. Figure 2: The parsing effectiveness-cost of log parsers. We [PITH_FULL_IMAGE:figures/full_fig_p002_2.png] view at source ↗
Figure 4
Figure 4. Figure 4: The overview of CelerLog. 2.2 Dense & Sparse Log Groups To address the above question, we investigate the statistical distri￾bution of log data. Our goal is to determine whether there exists a boundary that distinguishes logs suitable for syntax-based parsing from those requiring semantic inference. Inspiration & Observation. We hypothesize that if a log tem￾plate generates multiple unique messages with va… view at source ↗
Figure 5
Figure 5. Figure 5: An example of the anchor-based merging phase of the Router. [PITH_FULL_IMAGE:figures/full_fig_p004_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: An example of the Statistical Processor. [PITH_FULL_IMAGE:figures/full_fig_p005_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: The prompt of the LLM-based processor. as a list of exact variable strings. To ensure robustness, we pro￾cess the LLM’s output by verifying that the identified variables are indeed present in the original log message. We then mask these validated variable substrings with a placeholder (e.g., <*>) to gen￾erate the final template. If the LLM fails to identify any variables or produces an invalid format, we r… view at source ↗
Figure 8
Figure 8. Figure 8: Average LLM token consumption of LLM-based log parsers and CelerLog. CelerLog achieves the lowest token consumption. CelerLog LUNAR LogBatcher LILAC 0 50 100 150 200 250 300 350 400 # Avg. Invocation 33.786 370.905 283.810 248.690 [PITH_FULL_IMAGE:figures/full_fig_p007_8.png] view at source ↗
Figure 10
Figure 10. Figure 10: Performance and cost trade-offs of log parsers. [PITH_FULL_IMAGE:figures/full_fig_p008_10.png] view at source ↗
Figure 11
Figure 11. Figure 11: The parsing results under different similarity [PITH_FULL_IMAGE:figures/full_fig_p009_11.png] view at source ↗
Figure 12
Figure 12. Figure 12: The parsing results under different top- [PITH_FULL_IMAGE:figures/full_fig_p009_12.png] view at source ↗
read the original abstract

Log parsing is a fundamental step for automated log analysis, which transforms raw log messages into structured formats. Existing syntax-based parsers struggle with complex logs because they lack semantic reasoning ability. Emerging LLM-powered semantic parsers achieve high accuracy but suffer from prohibitive latency and token costs because they apply semantic inference across all logs. Our key observation is that not all logs necessitate complex semantic understanding: a vast majority of logs exhibit repetitive patterns that can be extracted via straightforward statistical analysis. Driven by this insight, we propose CelerLog, a fast and effective log parser. CelerLog introduces a dynamic routing mechanism to classify logs into dense and sparse groups. Logs with strong statistical patterns (dense groups) are processed by an efficient statistical processor, whereas the sparse groups lacking such patterns are routed to an LLM for semantic inference. This hybrid strategy avoids unnecessary LLM invocations. Extensive experiments on 14 public datasets show that CelerLog achieves leading performance over state-of-the-art baselines and is 7.9x to 18.6x faster than LLM methods and up to 1.5x faster than Drain. Additionally, it reduces costs by decreasing token consumption by 80.2% - 94.1% and LLM invocations by 86.4% - 90.9%.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 1 minor

Summary. The paper presents CelerLog, a hybrid log parser that uses a dynamic routing mechanism to send logs exhibiting strong statistical patterns (dense groups) to an efficient statistical processor and sparse logs lacking such patterns to an LLM for semantic inference. On 14 public datasets it claims leading performance over baselines, with speedups of 7.9x–18.6x versus LLM methods and up to 1.5x versus Drain, plus token-consumption reductions of 80.2%–94.1% and LLM-invocation reductions of 86.4%–90.9%.

Significance. If the routing step reliably partitions logs with negligible overhead and error, the hybrid design would materially improve the cost–accuracy trade-off for semantic log parsing, allowing LLM use to be restricted to a small minority of inputs while preserving overall accuracy.

major comments (2)
  1. [Abstract] Abstract: the headline speed and cost claims rest on the premise that the dynamic router correctly identifies the 'vast majority' of logs as dense and routes only true sparse cases to the LLM; yet the abstract supplies no classifier description (features, threshold, training), no routing-accuracy or misrouting-rate figures, and no ablation showing end-to-end accuracy or latency under routing mistakes. This is load-bearing: even modest misrouting would either erode the reported savings or degrade parsing quality.
  2. [Abstract] Abstract: only aggregate speedups and accuracy are stated across the 14 datasets; no per-dataset tables, error bars, or breakdowns of routing decisions are mentioned, preventing verification that the claimed gains are consistent rather than driven by a few favorable datasets.
minor comments (1)
  1. The abstract would benefit from a one-sentence description of the statistical processor employed for the dense group.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive comments on the abstract. We agree that the abstract would benefit from additional context on the routing mechanism and consistency of results. We will revise the abstract accordingly. Point-by-point responses are below.

read point-by-point responses
  1. Referee: [Abstract] Abstract: the headline speed and cost claims rest on the premise that the dynamic router correctly identifies the 'vast majority' of logs as dense and routes only true sparse cases to the LLM; yet the abstract supplies no classifier description (features, threshold, training), no routing-accuracy or misrouting-rate figures, and no ablation showing end-to-end accuracy or latency under routing mistakes. This is load-bearing: even modest misrouting would either erode the reported savings or degrade parsing quality.

    Authors: We acknowledge that the provided abstract is concise and does not include a description of the classifier, routing accuracy metrics, or an ablation on misrouting effects. The full manuscript details the routing mechanism (features, threshold, and training) in Section 3. Routing accuracy and misrouting rates are reported in the experimental evaluation. We agree an explicit ablation on routing errors is warranted and will add one. We will revise the abstract to include a brief statement on routing performance. revision: yes

  2. Referee: [Abstract] Abstract: only aggregate speedups and accuracy are stated across the 14 datasets; no per-dataset tables, error bars, or breakdowns of routing decisions are mentioned, preventing verification that the claimed gains are consistent rather than driven by a few favorable datasets.

    Authors: We agree the abstract reports only aggregate figures. The manuscript contains per-dataset tables for accuracy, runtime, and cost metrics along with error bars and routing decision breakdowns in the experimental section. These show consistent gains. We will revise the abstract to note that improvements hold across all datasets. revision: yes

Circularity Check

0 steps flagged

No circularity; empirical system evaluated on external benchmarks

full rationale

The paper describes an engineering hybrid parser whose performance numbers (speedups, token reductions) are measured outcomes on 14 public datasets against external baselines. No equations, fitted parameters renamed as predictions, or self-citation chains appear in the supplied text. The routing mechanism is presented as a design choice whose accuracy is asserted via experiment rather than derived from prior self-work. This is the normal non-circular case for an applied systems paper.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract supplies no explicit free parameters, axioms, or invented entities; the routing threshold and group definitions are mentioned only conceptually.

pith-pipeline@v0.9.1-grok · 5769 in / 1020 out tokens · 25814 ms · 2026-06-29T20:18:33.516813+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

58 extracted references · 7 canonical work pages · 1 internal anchor

  1. [1]

    NLTK Project

    2001. NLTK Project. https://www.nltk.org/howto/wordnet.html. Oneline; Accessed: 2026-03-16

  2. [2]

    Industrial-Strength Natural Language Processing

    2016. Industrial-Strength Natural Language Processing. https://spacy.io/. Online; Accessed: 2026-03-16

  3. [3]

    OpenAI API

    2026. OpenAI API. https://openai.com/blog/openai-api. Online; Accessed: 2026-03-16

  4. [4]

    Shan Ali, Chaima Boufaied, Domenico Bianculli, Paula Branco, Lionel Briand, and Nathan Aschbacher. 2023. An Empirical Study on Log-based Anomaly Detection Using Machine Learning.arXiv preprint arXiv:2307.16714(2023)

  5. [5]

    O’Reilly Media, Inc

    Steven Bird, Ewan Klein, and Edward Loper. 2009.Natural language processing with Python: analyzing text with the natural language toolkit. " O’Reilly Media, Inc. "

  6. [6]

    Boyuan Chen, Jian Song, Peng Xu, Xing Hu, and Zhen Ming Jiang. 2018. An automated approach to estimating code coverage measures via execution logs. In Proceedings of the 33rd ACM/IEEE International Conference on Automated Software Engineering. 305–316

  7. [7]

    Zhuangbin Chen, Jinyang Liu, Wenwei Gu, Yuxin Su, and Michael R Lyu. 2021. Ex- perience report: Deep learning-based system log analysis for anomaly detection. arXiv preprint arXiv:2107.05908(2021)

  8. [8]

    Tianyu Cui, Ruowei Fu, Changchang Liu, Yuhe Ji, Wenwei Gu, Shenglin Zhang, Yongqian Sun, and Dan Pei. 2025. AetherLog: Log-based Root Cause Analysis by Integrating Large Language Models with Knowledge Graphs. In2025 IEEE 36th International Symposium on Software Reliability Engineering (ISSRE). IEEE, 49–60

  9. [9]

    Hetong Dai, Heng Li, Che-Shao Chen, Weiyi Shang, and Tse-Hsun Chen. 2020. Logram: Efficient Log Parsing Using 𝑛 n-Gram Dictionaries.IEEE Transactions on Software Engineering (TSE)48, 3 (2020), 879–892

  10. [10]

    Hetong Dai, Yiming Tang, Heng Li, and Weiyi Shang. 2023. PILAR: Studying and Mitigating the Influence of Configurations on Log Parsing. In2023 IEEE/ACM 45th International Conference on Software Engineering (ICSE). IEEE, 818–829

  11. [11]

    Min Du, Feifei Li, Guineng Zheng, and Vivek Srikumar. 2017. Deeplog: Anomaly detection and diagnosis from system logs through deep learning. InProceedings of the 2017 ACM SIGSAC conference on computer and communications security. 1285–1298

  12. [12]

    Qiang Fu, Jian-Guang Lou, Yi Wang, and Jiang Li. 2009. Execution anomaly detection in distributed systems through unstructured log analysis. In2009 ninth IEEE international conference on data mining (ICDM). IEEE, 149–158

  13. [13]

    Hossein Hamooni, Biplob Debnath, Jianwu Xu, Hui Zhang, Guofei Jiang, and Abdullah Mueen. 2016. Logmine: Fast pattern recognition for log analytics. InProceedings of the 25th ACM International on Conference on Information and Knowledge Management (CIKM). 1573–1582

  14. [14]

    Minghua He, Tong Jia, Chiming Duan, Huaqian Cai, Ying Li, and Gang Huang

  15. [15]

    In2025 IEEE/ACM 47th International Conference on Soft- ware Engineering (ICSE)

    Weakly-supervised log-based anomaly detection with inexact labels via multi-instance learning. In2025 IEEE/ACM 47th International Conference on Soft- ware Engineering (ICSE). IEEE, 2918–2930

  16. [16]

    Pinjia He, Jieming Zhu, Shilin He, Jian Li, and Michael R Lyu. 2016. An evaluation study on log parsing and its use in log mining. In2016 46th annual IEEE/IFIP international conference on dependable systems and networks (DSN). IEEE, 654– 661

  17. [17]

    Pinjia He, Jieming Zhu, Zibin Zheng, and Michael R Lyu. 2017. Drain: An online log parsing approach with fixed depth tree. In2017 IEEE international conference on web services (ICWS). IEEE, 33–40

  18. [18]

    Junjie Huang, Zhihan Jiang, Zhuangbin Chen, and Michael Lyu. 2025. No more labelled examples? an unsupervised log parser with llms.Proceedings of the ACM on Software Engineering2, FSE (2025), 2406–2429

  19. [19]

    Yintong Huo, Yuxin Su, Cheryl Lee, and Michael R Lyu. 2023. Semparser: A semantic parser for log analytics. In2023 IEEE/ACM 45th International Conference on Software Engineering (ICSE). IEEE, 881–893

  20. [20]

    Zhihan Jiang, Jinyang Liu, Zhuangbin Chen, Yichen Li, Junjie Huang, Yintong Huo, Pinjia He, Jiazhen Gu, and Michael R Lyu. 2024. Lilac: Log parsing using llms with adaptive parsing cache.Proceedings of the ACM on Software Engineering 1, FSE (2024), 137–160

  21. [21]

    Zhihan Jiang, Jinyang Liu, Junjie Huang, Yichen Li, Yintong Huo, Jiazhen Gu, Zhuangbin Chen, Jieming Zhu, and Michael R Lyu. 2024. A large-scale evaluation for log parsing techniques: How far are we?. InProceedings of the 33rd ACM SIGSOFT International Symposium on Software Testing and Analysis. 223–234

  22. [22]

    Zhihan Jiang, Jinyang Liu, Yichen Li, Haiyu Huang, Xiao He, Tieying Zhang, Jianjun Chen, Yi Li, Rui Shi, and Michael R Lyu. 2025. LogPilot: Intent-aware and Scalable Alert Diagnosis for Large-scale Online Service Systems.arXiv preprint arXiv:2509.25874(2025)

  23. [23]

    Zhen Ming Jiang, Ahmed E Hassan, Parminder Flora, and Gilbert Hamann. 2008. Abstracting execution logs to execution events for enterprise applications (short paper). In2008 The Eighth International Conference on Quality Software. IEEE, 181–186

  24. [24]

    Van-Hoang Le and Hongyu Zhang. 2021. Log-based anomaly detection with- out log parsing. In2021 36th IEEE/ACM International Conference on Automated Software Engineering (ASE). IEEE, 492–504

  25. [25]

    Van-Hoang Le and Hongyu Zhang. 2023. Log Parsing with Prompt-based Few- shot Learning.arXiv preprint arXiv:2302.07435(2023)

  26. [26]

    Xiaoyun Li, Hongyu Zhang, Van-Hoang Le, and Pengfei Chen. 2024. Logshrink: Effective log compression by leveraging commonality and variability of log data. InProceedings of the 46th IEEE/ACM International Conference on Software Engineering. 1–12

  27. [27]

    Jinyang Liu, Jieming Zhu, Shilin He, Pinjia He, Zibin Zheng, and Michael R Lyu. 2019. Logzip: Extracting hidden structures via iterative clustering for log compression. In2019 34th IEEE/ACM International Conference on Automated Software Engineering (ASE). IEEE, 863–873

  28. [28]

    Yudong Liu, Xu Zhang, Shilin He, Hongyu Zhang, Liqun Li, Yu Kang, Yong Xu, Minghua Ma, Qingwei Lin, Yingnong Dang, et al. 2022. Uniparser: A unified log parser for heterogeneous log data. InProceedings of the ACM Web Conference 2022 (WWW). 1893–1901

  29. [29]

    Jian-Guang Lou, Qiang Fu, Shenqi Yang, Ye Xu, and Jiang Li. 2010. Mining invariants from console logs for system problem detection. In2010 USENIX annual technical conference (USENIX ATC 10)

  30. [30]

    Zeyang Ma, Dong Jae Kim, and Tse-Hsun Chen. 2024. Librelog: Accurate and efficient unsupervised log parsing using open-source large language models. arXiv preprint arXiv:2408.01585(2024)

  31. [31]

    Adetokunbo AO Makanju, A Nur Zincir-Heywood, and Evangelos E Milios. 2009. Clustering event logs using iterative partitioning. InProceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining (KDD). 1255–1264

  32. [32]

    Weibin Meng, Ying Liu, Yichen Zhu, Shenglin Zhang, Dan Pei, Yuqing Liu, Yihao Chen, Ruizhi Zhang, Shimin Tao, Pei Sun, et al. 2019. Loganomaly: Unsupervised detection of sequential and quantitative anomalies in unstructured logs.. InIjcai, Vol. 19. 4739–4745

  33. [33]

    Suphakit Niwattanakul, Jatsada Singthongchai, Ekkachai Naenudorn, and Su- pachanun Wanapu. 2013. Using of Jaccard coefficient for keywords similarity. InProceedings of the international multiconference of engineers and computer scientists, Vol. 1. 380–384

  34. [34]

    Paolo Notaro, Soroush Haeri, Jorge Cardoso, and Michael Gerndt. 2023. LogRule: Efficient Structured Log Mining for Root Cause Analysis.IEEE Transactions on Network and Service Management(2023)

  35. [35]

    David Ohana. 2020. A blog about Drain usage in IBM Cloud. https://developer.ibm.com/blogs/how-mining-log-templates-can-help-ai- ops-in-cloud-scale-data-centers/. Online; Accessed: 2026-03-16

  36. [36]

    OpenAI. [n. d.]. GPT-5.2. https://platform.openai.com/docs/models/gpt-5.2. One- line; Accessed: 2026-03-16

  37. [37]

    Daan Schipper, Maurício Aniche, and Arie van Deursen. 2019. Tracing back log data to its log statement: from research to practice. In2019 IEEE/ACM 16th International Conference on Mining Software Repositories (MSR). IEEE, 545–549

  38. [38]

    Issam Sedki, Abdelwahab Hamou-Lhadj, Otmane Ait-Mohamed, and Mo- hammed A Shehab. 2022. An Effective Approach for Parsing Large Log Files. In2022 IEEE International Conference on Software Maintenance and Evolution (ICSME). IEEE, 1–12

  39. [39]

    Shiwen Shan, Yintong Huo, Yuxin Su, Yichen Li, Dan Li, and Zibin Zheng. 2024. Face it yourselves: An llm-based two-stage strategy to localize configuration errors via logs. InProceedings of the 33rd ACM SIGSOFT international symposium on software testing and analysis. 13–25

  40. [40]

    Keiichi Shima. 2016. Length matters: Clustering system log messages using length of words.arXiv preprint arXiv:1611.03213(2016)

  41. [41]

    Donghwan Shin, Zanis Ali Khan, Domenico Bianculli, and Lionel Briand. 2021. A theoretical framework for understanding the relationship between log pars- ing and anomaly detection. InInternational Conference on Runtime Verification. Springer, 277–287

  42. [42]

    Risto Vaarandi. 2003. A data clustering algorithm for mining patterns from event logs. InProceedings of the 3rd IEEE Workshop on IP Operations & Management (IPOM)(IEEE Cat. No. 03EX764). Ieee, 119–126

  43. [43]

    Risto Vaarandi and Mauno Pihelgas. 2015. Logcluster-a data clustering and pattern mining algorithm for event logs. In2015 11th International conference on network and service management (CNSM). IEEE, 1–7. Conference’17, July 2017, Washington, DC, USA Shiwen Shan ¶, Yintong Huo ‡, Minxing Wang‡, Zhiying Wu ¶, Yuxin Su ¶, Zibin Zheng ¶

  44. [44]

    Lingzhi Wang, Nengwen Zhao, Junjie Chen, Pinnong Li, Wenchi Zhang, and Kaixin Sui. 2020. Root-cause metric location for microservice systems via log anomaly detection. In2020 IEEE international conference on web services (ICWS). IEEE, 142–150

  45. [45]

    Teng Wang, Xiaodong Liu, Shanshan Li, Xiangke Liao, Wang Li, and Qing Liao

  46. [46]

    In2018 IEEE International Conference on Software Quality, Reliability and Security (QRS)

    MisconfDoctor: diagnosing misconfiguration via log-based configuration testing. In2018 IEEE International Conference on Software Quality, Reliability and Security (QRS). IEEE, 1–12

  47. [47]

    Thorsten Wittkopp, Philipp Wiesner, and Odej Kao. 2024. Logrca: Log-based root cause analysis for distributed services. InEuropean Conference on Parallel Processing. Springer, 362–376

  48. [48]

    Pei Xiao, Tong Jia, Chiming Duan, Minghua He, Weijie Hong, Xixuan Yang, Yihan Wu, Ying Li, and Gang Huang. 2025. Clslog: Collaborating large and small models for log-based anomaly detection. InProceedings of the 33rd ACM International Conference on the Foundations of Software Engineering. 686–690

  49. [49]

    Yi Xiao, Van-Hoang Le, and Hongyu Zhang. 2024. free: Towards more practical log parsing with large language models. InProceedings of the 39th IEEE/ACM International Conference on Automated Software Engineering. 153–165

  50. [50]

    Weiyuan Xu, Juntao Luo, Tao Huang, Kaixin Sui, Jie Geng, Qijun Ma, Isami Akasaka, Xiaoxue Shi, Jing Tang, and Peng Cai. 2025. LogSage: An LLM-based framework for CI/CD failure detection and remediation with industrial validation. arXiv preprint arXiv:2506.03691(2025)

  51. [51]

    Siyu Yu, Pinjia He, Ningjiang Chen, and Yifan Wu. 2023. Brain: Log parsing with bidirectional parallel tree.IEEE Transactions on Services Computing16, 5 (2023), 3224–3237

  52. [52]

    Siyu Yu, Yifan Wu, Ying Li, and Pinjia He. 2024. Unlocking the power of numbers: Log compression via numeric token parsing. InProceedings of the 39th IEEE/ACM International Conference on Automated Software Engineering. 919–930

  53. [53]

    Ding Yuan, Haohui Mai, Weiwei Xiong, Lin Tan, Yuanyuan Zhou, and Shankar Pasupathy. 2010. Sherlog: error diagnosis by connecting clues from run-time logs. InProceedings of the fifteenth International Conference on Architectural support for programming languages and operating systems. 143–154

  54. [54]

    Chenbo Zhang, Wenying Xu, Jinbu Liu, Lu Zhang, Guiyang Liu, Jihong Guan, Qi Zhou, and Shuigeng Zhou. 2025. SemanticLog: Towards Effective and Efficient Large-Scale Semantic Log Parsing.IEEE Transactions on Software Engineering (2025)

  55. [55]

    Xu Zhang, Yong Xu, Qingwei Lin, Bo Qiao, Hongyu Zhang, Yingnong Dang, Chunyu Xie, Xinsheng Yang, Qian Cheng, Ze Li, et al. 2019. Robust log-based anomaly detection on unstable log data. InProceedings of the 2019 27th ACM joint meeting on European software engineering conference and symposium on the foundations of software engineering. 807–817

  56. [56]

    Xiang Zhou, Xin Peng, Tao Xie, Jun Sun, Chao Ji, Dewei Liu, Qilin Xiang, and Chuan He. 2019. Latent error prediction and fault localization for microservice applications by learning from system trace logs. InProceedings of the 2019 27th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering (...

  57. [57]

    Jieming Zhu, Shilin He, Pinjia He, Jinyang Liu, and Michael R Lyu. 2023. Loghub: A large collection of system log datasets for ai-driven log analytics. In2023 IEEE 34th International Symposium on Software Reliability Engineering (ISSRE). IEEE, 355–366

  58. [58]

    Jieming Zhu, Shilin He, Jinyang Liu, Pinjia He, Qi Xie, Zibin Zheng, and Michael R Lyu. 2019. Tools and benchmarks for automated log parsing. In2019 IEEE/ACM 41st International Conference on Software Engineering: Software Engineering in Practice (ICSE-SEIP). IEEE, 121–130