Towards Explorative IRBL: Combining Semantic Retrieval with LLM-driven Iterative Code Exploration
Pith reviewed 2026-05-19 01:59 UTC · model grok-4.3
The pith
GenLoc identifies more buggy files by combining semantic retrieval with LLM iterative exploration.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
GenLoc is a new technique for bug localization that merges semantic retrieval with LLM-driven iterative code-exploration functions. These functions let the model analyze the codebase step by step to find the source files responsible for a given bug report. When tested on three benchmarks with large Java datasets and the Python SWE-bench Lite, GenLoc outperforms traditional, deep learning, and other LLM-based methods and succeeds on bugs that the others miss.
What carries the argument
The key mechanism is the set of LLM-driven code-exploration functions that enable iterative gathering and analysis of code context starting from semantically similar files.
If this is right
- More bugs get localized correctly even when vocabulary does not match the bug report directly.
- The approach works across different programming languages and project sizes.
- LLMs can be used effectively for code tasks without being overwhelmed by entire repositories or limited to fixed candidates.
- Developers benefit from fewer false positives in the list of suspected files.
Where Pith is reading between the lines
- This method points to a broader principle that LLMs perform better on complex software tasks when they can actively query the codebase rather than receive static inputs.
- Similar iterative exploration could be applied to tasks like automated program repair or vulnerability detection.
- Testing on even larger codebases or with different LLMs would help confirm the robustness of the gains.
Load-bearing premise
That the LLM can use the exploration functions to collect enough relevant context without overlooking important files or adding misleading information.
What would settle it
If experiments on the Java and SWE-bench Lite benchmarks show that GenLoc does not achieve higher performance metrics than the compared methods or does not localize any additional bugs, the claim of substantial outperformance would not hold.
Figures
read the original abstract
Information Retrieval-based Bug Localization (IRBL) aims to identify buggy source files for a given bug report. Traditional and deep learning-based IRBL techniques often suffer from vocabulary mismatch and dependence on project-specific metadata. In contrast, recent Large Language Model (LLM)-based approaches struggle to provide appropriate context to the model: they either restrict analysis to a fixed set of candidate files, overwhelm the model with repository-wide information, or rely on explicit bug report cues to guide context collection. To address these issues, we propose GenLoc, a technique that combines semantic retrieval with LLM-driven code-exploration functions to iteratively analyze the code base and identify buggy files. We evaluate GenLoc on three complementary benchmarks, including large-scale and recent Java datasets as well as the Python based SWE-bench Lite dataset. Results demonstrate that GenLoc substantially outperforms traditional IRBL, deep learning-based approaches and recent LLM-based methods, while also localizing bugs that other techniques fail to detect.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes GenLoc, an IRBL technique that combines semantic retrieval with LLM-driven iterative code-exploration functions to gather context and identify buggy files. It addresses limitations of traditional IRBL (vocabulary mismatch), DL-based methods (project-specific metadata), and prior LLM approaches (fixed candidates, repository overload, or cue dependence). Evaluation on three benchmarks (large-scale/recent Java datasets plus SWE-bench Lite) claims substantial outperformance over baselines and the ability to localize bugs missed by other techniques.
Significance. If the results are robust, the work offers a practical advance in LLM-assisted bug localization for large codebases by making context collection iterative and function-driven rather than static or exhaustive. This could improve recall on complex bugs where fixed retrieval fails, with direct relevance to software maintenance tools.
major comments (2)
- The central performance claims on SWE-bench Lite and the Java benchmarks rest on the assumption that the LLM-driven iterative exploration reliably retrieves sufficient relevant files without systematic under-retrieval or noise accumulation. The method description provides no coverage guarantees, backtracking, or post-exploration filtering, leaving open the possibility that reported gains are partly due to fortunate retrieval rather than the technique itself.
- Evaluation section: while the abstract states 'substantial outperformance,' the manuscript supplies no per-bug breakdown, statistical significance tests, or ablation isolating the iterative exploration component from the semantic retrieval baseline. This makes it difficult to verify that the exploration step is load-bearing for the cross-technique and cross-benchmark superiority claims.
minor comments (2)
- Abstract: the claim of localizing 'bugs that other techniques fail to detect' would be stronger with a brief quantitative note (e.g., number of unique bugs or recall@N delta) rather than a qualitative statement.
- Notation and figures: ensure that the exploration functions are given explicit pseudocode or a clear interface definition so readers can reproduce the iterative loop.
Simulated Author's Rebuttal
We thank the referee for the detailed and constructive review. We address the major comments point by point below, indicating where revisions will be made to the manuscript.
read point-by-point responses
-
Referee: The central performance claims on SWE-bench Lite and the Java benchmarks rest on the assumption that the LLM-driven iterative exploration reliably retrieves sufficient relevant files without systematic under-retrieval or noise accumulation. The method description provides no coverage guarantees, backtracking, or post-exploration filtering, leaving open the possibility that reported gains are partly due to fortunate retrieval rather than the technique itself.
Authors: We agree that formal coverage guarantees, backtracking, or post-exploration filtering are not described in the current manuscript. The iterative process is intended to allow the LLM to progressively gather context via function calls guided by initial semantic retrieval, which empirically reduces irrelevant exploration in our experiments. To address the concern directly, we will add a dedicated limitations subsection discussing retrieval reliability risks and include new empirical results on average files explored per bug and retrieval success rates in the revised manuscript. revision: partial
-
Referee: Evaluation section: while the abstract states 'substantial outperformance,' the manuscript supplies no per-bug breakdown, statistical significance tests, or ablation isolating the iterative exploration component from the semantic retrieval baseline. This makes it difficult to verify that the exploration step is load-bearing for the cross-technique and cross-benchmark superiority claims.
Authors: We acknowledge that these analyses are absent from the current evaluation section and would strengthen the claims. In the revised manuscript we will add statistical significance tests (McNemar’s test for top-k localization and Wilcoxon signed-rank for MRR/MAP differences) across all benchmarks. We will also report a new ablation comparing the full GenLoc pipeline against the semantic-retrieval-only baseline. A per-bug success breakdown will be provided in supplementary material. revision: yes
Circularity Check
No circularity: empirical claims rest on external benchmarks
full rationale
The paper proposes GenLoc as a hybrid IRBL technique and supports its performance claims through direct evaluation on three independent benchmarks (including SWE-bench Lite). No equations, fitted parameters, or self-referential derivations appear in the provided text. The central results are comparisons against external baselines rather than quantities defined by the authors' own prior outputs or ansatzes. Self-citations, if present, are not load-bearing for the reported gains. The derivation chain is therefore self-contained against external data.
Axiom & Free-Parameter Ledger
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
GenLoc combines semantic retrieval with LLM-driven code-exploration functions... ReAct framework... five external functions (search_file, search_method, get_candidate_filenames, ...)
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
embedding-based retrieval... cosine similarity... GPT-4o mini... Accuracy@1 44.01% on Ye et al. dataset
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Forward citations
Cited by 2 Pith papers
-
BLAgent: Agentic RAG for File-Level Bug Localization
BLAgent achieves over 78% Top-1 accuracy on SWE-bench Lite for file-level bug localization using agentic RAG, at 18x lower cost than baselines, and boosts end-to-end APR success by over 20%.
-
SkillLens: Adaptive Multi-Granularity Skill Reuse for Cost-Efficient LLM Agents
SkillLens organizes skills into policies-strategies-procedures-primitives layers, retrieves via degree-corrected random walk, and uses a verifier for local adaptation, yielding up to 6.31 pp gains on MuLocbench and ra...
Reference graph
Works this paper leans on
-
[1]
[n. d.]. AWS Machine Learning Blog. https://aws.amazon.com/blogs/machine-learning/amazon-bedrock-knowledge- bases-now-supports-advanced-parsing-chunking-and-query-reformulation-giving-greater-control-of-accuracy-in- rag-based-applications/. Accessed: 2025-05-25
work page 2025
-
[2]
[n. d.]. Chroma. https://www.trychroma.com/. Accessed: 2025-05-25
work page 2025
-
[3]
[n. d.]. GPT-4o mini. https://platform.openai.com/docs/models/gpt-4o-mini. Accessed: 2025-05-25
work page 2025
-
[4]
[n. d.]. text-embedding-3-small. https://platform.openai.com/docs/models/text-embedding-3-small. Accessed: 2025- 05-25
work page 2025
-
[5]
[n. d.]. Tree-sitter. https://github.com/tree-sitter/tree-sitter. Accessed: 2025-04-17
work page 2025
-
[6]
Bui Thi Mai Anh and Nguyen Viet Luyen. 2021. An imbalanced deep learning model for bug localization. InProceedings of the 28th Asia-Pacific Software Engineering Conference Workshops. IEEE, 32–40
work page 2021
-
[7]
John Anvik, Lyndon Hiew, and Gail C Murphy. 2006. Who should fix this bug?. InProceedings of the 28th International Conference on Software Engineering. 361–370
work page 2006
-
[8]
Nicolas Bettenburg, Sascha Just, Adrian Schröter, Cathrin Weiss, Rahul Premraj, and Thomas Zimmermann. 2008. What makes a good bug report?. InProceedings of the 16th International Symposium on Foundations of Software Engineering. 308–318
work page 2008
-
[9]
Lili Bo, Wangjie Ji, Xiaobing Sun, Ting Zhang, Xiaoxue Wu, and Ying Wei. 2024. ChatBR: Automated assessment and improvement of bug report quality using ChatGPT. InProceedings of the 39th International Conference on Automated Software Engineering. 1472–1483
work page 2024
-
[10]
Junming Cao, Shouliang Yang, Wenhui Jiang, Hushuang Zeng, Beijun Shen, and Hao Zhong. 2020. Bugpecker: Locating faulty methods with deep learning on revision graphs. InProceedings of the 35th International Conference on Automated Software Engineering. 1214–1218
work page 2020
-
[11]
Partha Chakraborty, Mahmoud Alfadel, and Meiyappan Nagappan. 2024. Rlocator: Reinforcement learning for bug localization.IEEE Transactions on Software Engineering(2024)
work page 2024
- [12]
-
[13]
Agnieszka Ciborowska and Kostadin Damevski. 2022. Fast changeset-based bug localization with bert. InProceedings of the 44th International Conference on Software Engineering. 946–957
work page 2022
-
[14]
Yali Du and Zhongxing Yu. 2023. Pre-training code representation with semantic flow graph for effective bug localization. InProceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering. 579–591
work page 2023
-
[15]
Mikołaj Fejzer, Jakub Narębski, Piotr Przymus, and Krzysztof Stencel. 2021. Tracking buggy files: New efficient adaptive bug localization algorithm.IEEE Transactions on Software Engineering48, 7 (2021), 2557–2569
work page 2021
-
[16]
Jiaxuan Han, Cheng Huang, Siqi Sun, Zhonglin Liu, and Jiayong Liu. 2023. bjXnet: an improved bug localization model based on code property graph and attention mechanism.Automated Software Engineering30, 1 (2023), 12
work page 2023
- [17]
-
[18]
Shahid Iqbal, Rashid Naseem, Salman Jan, Sami Alshmrany, Muhammad Yasar, and Arshad Ali. 2020. Determining bug prioritization using feature reduction and clustering with classification.IEEE Access8 (2020), 215661–215678
work page 2020
-
[19]
Sungmin Kang, Gabin An, and Shin Yoo. 2024. A quantitative and qualitative evaluation of LLM-based explainable fault localization.Proceedings of the 33rd ACM International Conference on the Foundations of Software Engineering (2024), 1424–1446
work page 2024
-
[20]
Sungmin Kang, Juyeon Yoon, and Shin Yoo. 2023. Large language models are few-shot testers: Exploring llm-based general bug reproduction. InProceedings of the 45th International Conference on Software Engineering. IEEE, 2312–2323
work page 2023
-
[21]
Tushar Khot, Harsh Trivedi, Matthew Finlayson, Yao Fu, Kyle Richardson, Peter Clark, and Ashish Sabharwal. 2022. Decomposed prompting: A modular approach for solving complex tasks.arXiv preprint arXiv:2210.02406(2022)
work page internal anchor Pith review arXiv 2022
-
[22]
Dongsun Kim, Yida Tao, Sunghun Kim, and Andreas Zeller. 2013. Where should we fix this bug? a two-phase recommendation model.IEEE Transactions on Software Engineering39, 11 (2013), 1597–1610
work page 2013
-
[23]
Misoo Kim and Eunseok Lee. 2019. A novel approach to automatic query reformulation for ir-based bug localization. InProceedings of the 34th ACM/SIGAPP Symposium on Applied Computing. 1752–1759
work page 2019
-
[24]
Misoo Kim and Eunseok Lee. 2021. Are datasets for information retrieval-based bug localization techniques trustworthy? Impact analysis of bug types on IRBL.Empirical Software Engineering26 (2021), 1–66. , Vol. 1, No. 1, Article . Publication date: October 2018. 20 Trovato et al
work page 2021
-
[25]
Pavneet Singh Kochhar, Xin Xia, David Lo, and Shanping Li. 2016. Practitioners’ expectations on automated fault localization. InProceedings of the 25th International Symposium on Software Testing and Analysis. 165–176
work page 2016
-
[26]
Aobo Kong, Shiwan Zhao, Hao Chen, Qicheng Li, Yong Qin, Ruiqi Sun, Xin Zhou, Enzhi Wang, and Xiaohang Dong
- [27]
-
[28]
An Ngoc Lam, Anh Tuan Nguyen, Hoan Anh Nguyen, and Tien N Nguyen. 2015. Combining deep learning with information retrieval to localize buggy files for bug reports. InProceedings of the 30th International Conference on Automated Software Engineering. IEEE, 476–481
work page 2015
-
[29]
An Ngoc Lam, Anh Tuan Nguyen, Hoan Anh Nguyen, and Tien N Nguyen. 2017. Bug localization with combination of deep learning and information retrieval. InProceedings of the 25th International Conference on Program Comprehension. IEEE, 218–229
work page 2017
-
[30]
Jaekwon Lee, Dongsun Kim, Tegawendé F Bissyandé, Woosung Jung, and Yves Le Traon. 2018. Bench4bl: reproducibility study on the performance of ir-based bug localization. InProceedings of the 27th International Symposium on Software Testing and Analysis. 61–72
work page 2018
-
[31]
Jae Yong Lee, Sungmin Kang, Juyeon Yoon, and Shin Yoo. 2024. The github recent bugs dataset for evaluating llm-based debugging applications. InProceedings of the International Conference on Software Testing, Verification and Validation. IEEE, 442–444
work page 2024
-
[32]
Yue Li, Bohan Liu, Ting Zhang, Zhiqi Wang, David Lo, Lanxin Yang, Jun Lyu, and He Zhang. 2025. A Knowledge Enhanced Large Language Model for Bug Localization.Proceedings of the 33rd ACM International Conference on the Foundations of Software Engineering(2025), 1914–1936
work page 2025
-
[33]
Zhengliang Li, Zhiwei Jiang, Qiguo Huang, and Qing Gu. 2025. LLM-BL: Large Language Models are Zero-Shot Rankers for Bug Localization. InProceedings of the 33rd International Conference on Program Comprehension. IEEE Computer Society, 548–559
work page 2025
-
[34]
Hongliang Liang, Dengji Hang, and Xiangyu Li. 2022. Modeling function-level interactions for file-level bug localization. Empirical Software Engineering27, 7 (2022), 186
work page 2022
-
[35]
Guangliang Liu, Yang Lu, Ke Shi, Jingfei Chang, and Xing Wei. 2019. Mapping bug reports to relevant source code files based on the vector space model and word embedding.IEEE Access7 (2019), 78870–78881
work page 2019
-
[36]
Nelson F Liu, Kevin Lin, John Hewitt, Ashwin Paranjape, Michele Bevilacqua, Fabio Petroni, and Percy Liang. 2023. Lost in the middle: How language models use long contexts.arXiv preprint arXiv:2307.03172(2023)
work page internal anchor Pith review Pith/arXiv arXiv 2023
-
[37]
Zheng Liu, Yujia Zhou, Yutao Zhu, Jianxun Lian, Chaozhuo Li, Zhicheng Dou, Defu Lian, and Jian-Yun Nie. 2024. Information retrieval meets large language models. InCompanion Proceedings of the ACM Web Conference 2024. 1586–1589
work page 2024
-
[38]
Stacy K Lukins, Nicholas A Kraft, and Letha H Etzkorn. 2008. Source code retrieval for bug localization using latent dirichlet allocation. InProceedings of the 15th Working Conference on Reverse Engineering. IEEE, 155–164
work page 2008
-
[39]
Stacy K Lukins, Nicholas A Kraft, and Letha H Etzkorn. 2010. Bug localization using latent dirichlet allocation. Information and Software Technology52, 9 (2010), 972–990
work page 2010
-
[40]
Zhengmao Luo, Wenyao Wang, and Caichun Cen. 2022. Improving bug localization with effective contrastive learning representation.IEEE Access11 (2022), 32523–32533
work page 2022
-
[41]
Yingwei Ma, Qingping Yang, Rongyu Cao, Binhua Li, Fei Huang, and Yongbin Li. 2025. Alibaba lingmaagent: Improving automated issue resolution via comprehensive repository exploration. InProceedings of the 33rd ACM International Conference on the Foundations of Software Engineering. 238–249
work page 2025
-
[42]
Patrick E McKnight and Julius Najab. 2010. Mann-whitney U test.The Corsini encyclopedia of psychology(2010), 1–1
work page 2010
-
[43]
Anh Tuan Nguyen, Tung Thanh Nguyen, Jafar Al-Kofahi, Hung Viet Nguyen, and Tien N Nguyen. 2011. A topic-based approach for narrowing the search space of buggy files from a bug report. InProceedings of the 26th International Conference on Automated Software Engineering. IEEE, 263–272
work page 2011
-
[44]
Suphakit Niwattanakul, Jatsada Singthongchai, Ekkachai Naenudorn, and Supachanun Wanapu. 2013. Using of Jaccard coefficient for keywords similarity. InProceedings of the International Multiconference of Engineers and Computer Scientists, Vol. 1. 380–384
work page 2013
- [45]
-
[46]
Michael Pradel, Vijayaraghavan Murali, Rebecca Qian, Mateusz Machalica, Erik Meijer, and Satish Chandra. 2020. Scaffle: Bug localization on millions of files. InProceedings of the 29th ACM SIGSOFT International Symposium on Software Testing and Analysis. 225–236
work page 2020
-
[47]
Binhang Qi, Hailong Sun, Wei Yuan, Hongyu Zhang, and Xiangxin Meng. 2021. Dreamloc: A deep relevance matching- based framework for bug localization.IEEE Transactions on Reliability71, 1 (2021), 235–249
work page 2021
-
[48]
Yihao Qin, Shangwen Wang, Yan Lei, Zhuo Zhang, Bo Lin, Xin Peng, Jun Ma, Liqian Chen, and Xiaoguang Mao. 2025. Fault Localization from the Semantic Code Search Perspective. (2025). doi:10.1145/3757915
-
[49]
Yihao Qin, Shangwen Wang, Yiling Lou, Jinhao Dong, Kaixin Wang, Xiaoling Li, and Xiaoguang Mao. 2025. Soap FL: A Standard Operating Procedure for LLM-based Method-Level Fault Localization.IEEE Transactions on Software , Vol. 1, No. 1, Article . Publication date: October 2018. Leveraging Large Language Model for Information Retrieval-based Bug Localization...
work page 2025
- [50]
-
[51]
Mohammad Masudur Rahman and Chanchal K Roy. 2018. Improving ir-based bug localization with context-aware query reformulation. InProceedings of the 26th ACM joint meeting on European software Engineering Conference and Symposium on the Foundations of Software Engineering. 621–632
work page 2018
-
[52]
Matthew Renze. 2024. The effect of sampling temperature on problem solving in large language models. InFindings of the Association for Computational Linguistics: EMNLP 2024. 7346–7356
work page 2024
-
[53]
Haifeng Ruan, Yuntong Zhang, and Abhik Roychoudhury. 2025. SpecRover: Code Intent Extraction via LLMs. (2025), 963–974. https://doi.org/10.1109/ICSE55347.2025.00080
-
[54]
Ripon K Saha, Matthew Lease, Sarfraz Khurshid, and Dewayne E Perry. 2013. Improving bug localization using structured information retrieval. InProceedings of the 28th International Conference on Automated Software Engineering. IEEE, 345–355
work page 2013
-
[55]
Asif Mohammed Samir and Mohammad Masudur Rahman. 2025. Improved IR-Based Bug Localization with Intelligent Relevance Feedback. InProceedings of the 33rd International Conference on Program Comprehension. IEEE, 560–571
work page 2025
-
[56]
Bunyamin Sisman and Avinash C Kak. 2012. Incorporating version histories in information retrieval based bug localization. InProceedings of the 9th IEEE Working Conference on Mining Software Repositories. IEEE, 50–59
work page 2012
-
[57]
Mozhan Soltani, Felienne Hermans, and Thomas Bäck. 2020. The significance of bug report elements.Empirical Software Engineering25, 6 (2020), 5255–5294
work page 2020
-
[58]
Harald Steck, Chaitanya Ekanadham, and Nathan Kallus. 2024. Is cosine-similarity of embeddings really about similarity?. InCompanion Proceedings of the ACM Web Conference. 887–890
work page 2024
-
[59]
Wei Tao, Yucheng Zhou, Yanlin Wang, Wenqiang Zhang, Hongyu Zhang, and Yu Cheng. 2024. Magis: Llm-based multi-agent framework for github issue resolution.Advances in Neural Information Processing Systems37 (2024), 51963–51993
work page 2024
-
[60]
Stephen W Thomas, Meiyappan Nagappan, Dorothea Blostein, and Ahmed E Hassan. 2013. The impact of classifier configuration and classifier combination on bug localization.IEEE Transactions on Software Engineering39, 10 (2013), 1427–1443
work page 2013
-
[61]
Yao Tian, Ziyang Yue, Ruiyuan Zhang, Xi Zhao, Bolong Zheng, and Xiaofang Zhou. 2023. Approximate Nearest Neighbor Search in High Dimensional Vector Databases: Current Research and Future Directions.IEEE Data Eng. Bull. 46, 3 (2023), 39–54
work page 2023
-
[62]
Bei Wang, Ling Xu, Meng Yan, Chao Liu, and Ling Liu. 2020. Multi-dimension convolutional neural network for bug localization.IEEE Transactions on Services Computing15, 3 (2020), 1649–1663
work page 2020
-
[63]
Shaowei Wang and David Lo. 2014. Version history, similar report, and structure: Putting them together for improved bug localization. InProceedings of the 22nd International Conference on Program Comprehension. 53–63
work page 2014
-
[64]
Ming Wen, Rongxin Wu, and Shing-Chi Cheung. 2016. Locus: Locating bugs from software changes. InProceedings of the 31st International Conference on Automated Software Engineering. 262–273
work page 2016
-
[65]
Ratnadira Widyasari, Jia Wei Ang, Truong Giang Nguyen, Neil Sharma, and David Lo. 2024. Demystifying faulty code: Step-by-step reasoning for explainable fault localization. (2024), 568–579
work page 2024
-
[66]
Ratnadira Widyasari, Stefanus Agus Haryono, Ferdian Thung, Jieke Shi, Constance Tan, Fiona Wee, Jack Phan, and David Lo. 2022. On the influence of biases in bug localization: Evaluation and benchmark. InProceedings of the International Conference on Software Analysis, Evolution and Reengineering. IEEE, 128–139
work page 2022
-
[67]
Chu-Pan Wong, Yingfei Xiong, Hongyu Zhang, Dan Hao, Lu Zhang, and Hong Mei. 2014. Boosting bug-report-oriented fault localization with segmentation and stack-trace analysis. InProceedings of the International Conference on Software Maintenance and Evolution. IEEE, 181–190
work page 2014
-
[68]
Eric Wong, Ruizhi Gao, Yihao Li, Rui Abreu, and Franz Wotawa
W. Eric Wong, Ruizhi Gao, Yihao Li, Rui Abreu, and Franz Wotawa. 2016. A Survey on Software Fault Localization. IEEE Transactions on Software Engineering42, 8 (2016), 707–740
work page 2016
-
[69]
2023.Handbook of software fault localization: foundations and advances
W Eric Wong and TH Tse. 2023.Handbook of software fault localization: foundations and advances. John Wiley & Sons
work page 2023
- [70]
- [71]
-
[72]
Chunqiu Steven Xia, Yinlin Deng, Soren Dunn, and Lingming Zhang. 2025. Demystifying LLM-Based Software Engineering Agents.Proceedings of the 33rd ACM International Conference on the Foundations of Software Engineering (2025), 801–824
work page 2025
-
[73]
Yan Xiao, Jacky Keung, Kwabena E Bennin, and Qing Mi. 2019. Improving bug localization with word embedding and enhanced convolutional neural networks.Information and Software Technology105 (2019), 17–29. , Vol. 1, No. 1, Article . Publication date: October 2018. 22 Trovato et al
work page 2019
-
[74]
Chuyang Xu, Zhongxin Liu, Xiaoxue Ren, Gehao Zhang, Ming Liang, and David Lo. 2025. FlexFL: Flexible and Effective Fault Localization with Open-Source Large Language Models.IEEE Transactions on Software Engineering(2025)
work page 2025
-
[75]
Shunyu Yao, Jeffrey Zhao, Dian Yu, Nan Du, Izhak Shafran, Karthik Narasimhan, and Yuan Cao. 2023. React: Synergizing reasoning and acting in language models. InInternational Conference on Learning Representations
work page 2023
-
[76]
Xin Ye, Razvan Bunescu, and Chang Liu. 2014. Learning to rank relevant files for bug reports using domain knowledge. InProceedings of the 22nd ACM SIGSOFT International Symposium on Foundations of Software Engineering. 689–699
work page 2014
-
[77]
Xin Ye, Razvan Bunescu, and Chang Liu. 2015. Mapping bug reports to relevant files: A ranking model, a fine-grained benchmark, and feature evaluation.IEEE Transactions on Software Engineering42, 4 (2015), 379–402
work page 2015
-
[78]
Xin Ye, Hui Shen, Xiao Ma, Razvan Bunescu, and Chang Liu. 2016. From word embeddings to document similarities for improved information retrieval in software engineering. InProceedings of the 38th International Conference on Software Engineering. 404–415
work page 2016
-
[79]
Klaus Changsun Youm, June Ahn, and Eunseok Lee. 2017. Improved bug localization based on code change histories and bug reports.Information and Software Technology82 (2017), 177–192
work page 2017
-
[80]
Abubakar Zakari, Sai Peck Lee, Rui Abreu, Babiker Hussien Ahmed, and Rasheed Abubakar Rasheed. 2020. Multiple fault localization of software programs: A systematic literature review.Information and Software Technology124 (2020), 106312
work page 2020
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.