Please use this identifier to cite or link to this item:
http://bura.brunel.ac.uk/handle/2438/30820
Title: | Domain Tailored Large Language Models for Log Mask Prediction in Cellular Network Diagnostics |
Authors: | Taheri, S Ihalage, A Mishra, P Coaker, S Muhammad, F Al-Raweshidy, H |
Keywords: | telecommunications;machine learning;LLM;log analysis;network diagnostics |
Issue Date: | 17-Feb-2025 |
Publisher: | Institute of Electrical and Electronics Engineers (IEEE) |
Citation: | Taheri, S. et al. (2025) 'Domain Tailored Large Language Models for Log Mask Prediction in Cellular Network Diagnostics', IEEE Transactions on Network and Service Management, 0 (early access), pp. 1 - 13. doi: 10.1109/tnsm.2025.3541384. |
Abstract: | Software logs generated by dedicated network testing hardware are often complex and bear minimal similarity to natural language, requiring the expertise of engineers to understand and capture defects recorded in these logs. This manual process is inefficient and expensive for both service providers and their clients. In this study, we demonstrate the transformative potential of Artificial Intelligence (AI), specifically through domain-tailoring of Large Language Models (LLMs) like RoBERTa, BigBird, and Flan-T5, to streamline the process of defect diagnostics. Particularly, we pre-train these models ground up on a real industrial telecommunications log corpus, and perform finetuning on a multi-label classification objective. This facilitates identifying a correct set of log points to be enabled for rapid detection of defects that arise during network testing. Despite encountering several challenges such as intricate text structures, heavily skewed label distribution, and inconsistencies in historical data labelling, our tailored LLMs achieve commendable performance on previously unseen defect cases, significantly reducing the turnaround times. This research not only serves as an exemplar for adapting LLMs in telecommunications industry for automated defect diagnostics, but also has wide implications for software log analysis across various industries. |
URI: | https://bura.brunel.ac.uk/handle/2438/30820 |
DOI: | https://doi.org/10.1109/tnsm.2025.3541384 |
Other Identifiers: | ORCiD: Achintha Ihalage https://orcid.org/0000-0002-4250-187X ORCiD: Hamed Al-Raweshidy https://orcid.org/0000-0002-3702-8192 |
Appears in Collections: | Dept of Electronic and Electrical Engineering Research Papers |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
FullText.pdf | Copyright © 2025 Institute of Electrical and Electronics Engineers (IEEE). Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works (https://journals.ieeeauthorcenter.ieee.org/become-an-ieee-journal-author/publishing-ethics/guidelines-and-policies/post-publication-policies/). | 6.89 MB | Adobe PDF | View/Open |
Items in BURA are protected by copyright, with all rights reserved, unless otherwise indicated.