Please use this identifier to cite or link to this item: http://bura.brunel.ac.uk/handle/2438/33522
Title: Ontology-Enriched Abstract Topic Modelling for Biomedical Trend Discovery: A MeSH-based LDA Framework with BI Visual Analytics
Authors: Altarawneh, A
Swift, S
Tucker, A
Arzoky, M
Keywords: biomedical research trends;MeSH enrichment;abstract-based analysis;topic modelling;latent Dirichlet Allocation;semantic ontology;business intelligence;trend detection
Issue Date: 12-May-2026
Publisher: Institute of Electrical and Electronics Engineers (IEEE)
Citation: Altarawneh, A. et al. (2026) 'Ontology-Enriched Abstract Topic Modelling for Biomedical Trend Discovery: A MeSH-based LDA Framework with BI Visual Analytics', 2026 2nd International Conference on Computational Intelligence Approaches and Applications (ICCIAA), Amman, Jordan, 12–14 May, pp. 1–8. doi: 10.1109/icciaa68481.2026.11544024.
Abstract: The explosive growth of biomedical literature increasingly makes it difficult to identify incipient topics and long-term developments. Citation-based indicators are slow to capture developments and often fail to represent influence. Whilst Latent Dirichlet Allocation (LDA) supports content-based trend discovery, its bag-of-words assumption can overlook synonymy, polysemy, and hierarchical relations that are central to biomedical terminology. Moreover, recent work provides limited direct evidence on how far MeSH-enriched abstract representations improve LDA-based trend analysis when compared against the original abstract text. To fill this gap, an ontology-driven methodology for the pre-topic modelling of article abstracts by enriching them with hierarchical Medical Subject Headings (MeSH) terms is introduced in this study. We evaluate four representations, namely original abstracts, MeSH parent categories, MeSH-matched terms with parents, and a fully enriched representation, using a decade-long Scopus corpus of 624,486 biomedical journal articles (2015–2024). Model performance is assessed using coherence, distinctiveness, perplexity, and stability. The fully enriched representation achieves the strongest overall results (coherence 0.606, distinctiveness 0.996, stability 0.725, perplexity 1782.040) relative to the original abstract baseline. Temporal topic analysis further reveals interpretable patterns, including the COVID-19 surge and longer-term structural shifts such as the rise of machine-learning diagnostics. These findings indicate that MeSH-guided abstract enrichment can strengthen topic quality and support more effective biomedical trend discovery through business-intelligence visual analytics.
URI: https://bura.brunel.ac.uk/handle/2438/33522
DOI: https://doi.org/10.1109/icciaa68481.2026.11544024
ISBN: 979-8-3315-5659-4
979-8-3315-5660-0
Other Identifiers: ORCiD: Stephen Swift https://orcid.org/0000-0001-8918-3365
ORCiD: Allan Tucker https://orcid.org/0000-0001-5105-3506
ORCiD: Mahir Arzoky https://orcid.org/0000-0002-2721-643X
Appears in Collections:Department of Computer Science Research Papers

Files in This Item:
File Description SizeFormat 
FullText.pdfCopyright © 2026 Institute of Electrical and Electronics Engineers (IEEE). Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works ( https://journals.ieeeauthorcenter.ieee.org/become-an-ieee-journal-author/publishing-ethics/guidelines-and-policies/post-publication-policies/ ).2.35 MBAdobe PDFView/Open


Items in BURA are protected by copyright, with all rights reserved, unless otherwise indicated.