Please use this identifier to cite or link to this item:
http://bura.brunel.ac.uk/handle/2438/30820| Title: | Domain Tailored Large Language Models for Log Mask Prediction in Cellular Network Diagnostics |
| Authors: | Taheri, S Ihalage, A Mishra, P Coaker, S Muhammad, F Al-Raweshidy, H |
| Keywords: | telecommunications;machine learning;LLM;log analysis;network diagnostics |
| Issue Date: | 17-Feb-2025 |
| Publisher: | Institute of Electrical and Electronics Engineers (IEEE) |
| Citation: | Taheri, S. et al. (2025) 'Domain Tailored Large Language Models for Log Mask Prediction in Cellular Network Diagnostics', IEEE Transactions on Network and Service Management, 22 (3), pp. 2370 - 2381. doi: 10.1109/tnsm.2025.3541384. |
| Abstract: | Software logs generated by dedicated network testing hardware are often complex and bear minimal similarity to natural language, requiring the expertise of engineers to understand and capture defects recorded in these logs. This manual process is inefficient and expensive for both service providers and their clients. In this study, we demonstrate the transformative potential of Artificial Intelligence (AI), specifically through domain-tailoring of Large Language Models (LLMs) like RoBERTa, BigBird, and Flan-T5, to streamline the process of defect diagnostics. Particularly, we pre-train these models ground up on a real industrial telecommunications log corpus, and perform finetuning on a multi-label classification objective. This facilitates identifying a correct set of log points to be enabled for rapid detection of defects that arise during network testing. Despite encountering several challenges such as intricate text structures, heavily skewed label distribution, and inconsistencies in historical data labelling, our tailored LLMs achieve commendable performance on previously unseen defect cases, significantly reducing the turnaround times. This research not only serves as an exemplar for adapting LLMs in telecommunications industry for automated defect diagnostics, but also has wide implications for software log analysis across various industries. |
| URI: | https://bura.brunel.ac.uk/handle/2438/30820 |
| DOI: | https://doi.org/10.1109/tnsm.2025.3541384 |
| Other Identifiers: | ORCiD: Achintha Ihalage https://orcid.org/0000-0002-4250-187X ORCiD: Hamed Al-Raweshidy https://orcid.org/0000-0002-3702-8192 |
| Appears in Collections: | Dept of Electronic and Electrical Engineering Research Papers |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| FullText.pdf | For the purpose of open access, the author has applied a Creative Commons Attribution (CC BY) licence to any Author Accepted Manuscript version arising. | 7.04 MB | Adobe PDF | View/Open |
This item is licensed under a Creative Commons License