Please use this identifier to cite or link to this item: http://bura.brunel.ac.uk/handle/2438/31126
Full metadata record
DC FieldValueLanguage
dc.contributor.authorAlmutairi, E-
dc.contributor.authorAbbod, M-
dc.contributor.authorHunaiti, Z-
dc.date.accessioned2025-05-03T10:04:03Z-
dc.date.available2025-05-03T10:04:03Z-
dc.date.issued2025-03-05-
dc.identifierORCiD: Maysam Abbod https://orcid.org/0000-0002-8515-7933-
dc.identifierORCiD: Ziad Hunaiti https://orcid.org/0000-0002-7048-2469-
dc.identifierArticle number 145-
dc.identifier.citationAlmutairi A., Abbod, M. and Hunaiti, Z. (2025) 'Prediction of Diabetes Using Statistical and Machine Learning Modelling Techniques', Algorithms, 18 (3), 145, pp. 1 - 21. doi: 10.3390/a18030145.en_US
dc.identifier.urihttps://bura.brunel.ac.uk/handle/2438/31126-
dc.descriptionData Availability Statement: The data can be shared upon request.en_US
dc.description.abstractStatistical and machine learning modelling techniques have been effectively used in the healthcare domain and the prediction of epidemiological chronic diseases such as diabetes, which is classified as an epidemic due to its high rates of global prevalence. These techniques are useful for the processes of description, prediction, and evaluation of various diseases, including diabetes. This paper models diabetes disease in Saudi Arabia using the most relevant risk factors, namely smoking, obesity, and physical inactivity for adults aged ≥25 years. The aim of this study is based on developing statistical and machine learning models for the purpose of studying the trends in incidence rates of diabetes over 15 years (1999–2013) and to obtain predictions for future levels of the disease up to 2025, to support health policy planning and resource allocation for controlling diabetes. Different models were developed, namely Multiple Linear Regression (MLR), Support Vector Regression (SVR), Bayesian Linear Regression (BLM), Adaptive Neuro-Fuzzy Inference model (ANFIS), and Artificial Neural Network (ANN). The performance of the developed models is evaluated using four statistical metrices: Mean Squared Error (MSE), Root Mean Squared Error (RMSE), Mean Absolute Percentage Error (MAPE), and coefficient of determination R-squared. Based on the results, it can be observed that the overall performance for all proposed models was reasonably good; however, the best results were achieved by the ANFIS model with RMSE = 0.04 and R2 = 0.99 for men’s training data, and RMSE = 0.02 and R2 = 0.99 for women’s training data.en_US
dc.description.sponsorshipThis research received no external funding.en_US
dc.format.extent1 - 21-
dc.format.mediumElectronic-
dc.languageEnglish-
dc.language.isoen_USen_US
dc.publisherMDPIen_US
dc.relation.urihttps://creativecommons.org/licenses/by/4.0/-
dc.rightsAttribution 4.0 International-
dc.subjectmachine learningen_US
dc.subjectdiabetesen_US
dc.subjectregressionen_US
dc.subjectstatistical metricesen_US
dc.titlePrediction of Diabetes Using Statistical and Machine Learning Modelling Techniquesen_US
dc.typeArticleen_US
dc.date.dateAccepted2025-02-26-
dc.identifier.doihttps://doi.org/10.3390/a18030145-
dc.relation.isPartOfAlgorithms-
pubs.issue3-
pubs.publication-statusPublished online-
pubs.volume18-
dc.identifier.eissn1999-4893-
dc.rights.licensehttps://creativecommons.org/licenses/by/4.0/legalcode.en-
dcterms.dateAccepted2025-02-26-
dc.rights.holderThe authors-
Appears in Collections:Dept of Electronic and Electrical Engineering Research Papers

Files in This Item:
File Description SizeFormat 
FullText.pdfCopyright © 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).3.15 MBAdobe PDFView/Open


Items in BURA are protected by copyright, with all rights reserved, unless otherwise indicated.