You are required to read and agree to the below before accessing a full-text version of an article in the IDE article repository.
The full-text document you are about to access is subject to national and international copyright laws. In most cases (but not necessarily all) the consequence is that personal use is allowed given that the copyright owner is duly acknowledged and respected. All other use (typically) require an explicit permission (often in writing) by the copyright owner.
For the reports in this repository we specifically note that
- the use of articles under IEEE copyright is governed by the IEEE copyright policy (available at http://www.ieee.org/web/publications/rights/copyrightpolicy.html)
- the use of articles under ACM copyright is governed by the ACM copyright policy (available at http://www.acm.org/pubs/copyright_policy/)
- technical reports and other articles issued by M‰lardalen University is free for personal use. For other use, the explicit consent of the authors is required
- in other cases, please contact the copyright owner for detailed information
By accepting I agree to acknowledge and respect the rights of the copyright owner of the document I am about to access.
If you are in doubt, feel free to contact webmaster@ide.mdh.se
Bug-Report–Driven Fault Localization: Industrial Benchmarking and Lessons Learned at ABB Robotics
Publication Type:
Conference/Workshop Paper
Venue:
International Conference on Evaluation and Assessment in Software Engineering 2026
Abstract
Software quality assurance remains a major challenge in industrial environments, where large-scale and long-lived systems inevitably accumulate defects. Identifying the location of a fault is often time-consuming and costly, particularly during maintenance phases when developers must rely primarily on textual bug reports rather than complete runtime or code-level context. In this study, we investigated whether artificial intelligence can support fault localization using only the natural-language content of bug reports. By relying exclusively on textual information, our approach requires no access to source code, execution traces, or static analysis artifacts, making it directly deployable within existing industrial maintenance workflows.We framed fault localization as a supervised text classification problem and evaluated three traditional machine learning models (Logistic Regression, Support Vector Machine, and Random Forest) and two fine-tuned transformer-based language models (RoBERTa-Base and Distil-RoBERTa). Our evaluation used proprietary data from ABB Robotics in Västerås, Sweden, comprising approximately five years of resolved industrial bug reports, each linked to its verified code fix. This setting allowed us to assess model effectiveness under realistic industrial constraints.Our results showed that traditional models using term frequency– inverse document frequency features consistently outperformed the fine-tuned language models on this dataset, while data augmentation substantially improved Random Forest performance. These findings challenge the assumption that transformer-based models universally outperform classical approaches in industrial contexts with domain-specific data. Overall, we demonstrated that historical bug reports can be systematically leveraged for text-based, artificial intelligence-assisted fault localization, providing a scalable, low-cost, and empirically grounded complement to traditional debugging practices in industry.
Bibtex
@inproceedings{Hall7386,
author = {Pernilla Hall and Anton Ununger and Riccardo Rubei and Alessio Bucaioni},
title = {Bug-Report–Driven Fault Localization: Industrial Benchmarking and Lessons Learned at ABB Robotics},
month = {June},
year = {2026},
booktitle = {International Conference on Evaluation and Assessment in Software Engineering 2026},
url = {http://www.es.mdu.se/publications/7386-}
}