Please use this identifier to cite or link to this item: http://bura.brunel.ac.uk/handle/2438/31126
Title: Prediction of Diabetes Using Statistical and Machine Learning Modelling Techniques
Authors: Almutairi, E
Abbod, M
Hunaiti, Z
Keywords: machine learning;diabetes;regression;statistical metrices
Issue Date: 5-Mar-2025
Publisher: MDPI
Citation: Almutairi A., Abbod, M. and Hunaiti, Z. (2025) 'Prediction of Diabetes Using Statistical and Machine Learning Modelling Techniques', Algorithms, 18 (3), 145, pp. 1 - 21. doi: 10.3390/a18030145.
Abstract: Statistical and machine learning modelling techniques have been effectively used in the healthcare domain and the prediction of epidemiological chronic diseases such as diabetes, which is classified as an epidemic due to its high rates of global prevalence. These techniques are useful for the processes of description, prediction, and evaluation of various diseases, including diabetes. This paper models diabetes disease in Saudi Arabia using the most relevant risk factors, namely smoking, obesity, and physical inactivity for adults aged ≥25 years. The aim of this study is based on developing statistical and machine learning models for the purpose of studying the trends in incidence rates of diabetes over 15 years (1999–2013) and to obtain predictions for future levels of the disease up to 2025, to support health policy planning and resource allocation for controlling diabetes. Different models were developed, namely Multiple Linear Regression (MLR), Support Vector Regression (SVR), Bayesian Linear Regression (BLM), Adaptive Neuro-Fuzzy Inference model (ANFIS), and Artificial Neural Network (ANN). The performance of the developed models is evaluated using four statistical metrices: Mean Squared Error (MSE), Root Mean Squared Error (RMSE), Mean Absolute Percentage Error (MAPE), and coefficient of determination R-squared. Based on the results, it can be observed that the overall performance for all proposed models was reasonably good; however, the best results were achieved by the ANFIS model with RMSE = 0.04 and R2 = 0.99 for men’s training data, and RMSE = 0.02 and R2 = 0.99 for women’s training data.
Description: Data Availability Statement: The data can be shared upon request.
URI: https://bura.brunel.ac.uk/handle/2438/31126
DOI: https://doi.org/10.3390/a18030145
Other Identifiers: ORCiD: Maysam Abbod https://orcid.org/0000-0002-8515-7933
ORCiD: Ziad Hunaiti https://orcid.org/0000-0002-7048-2469
Article number 145
Appears in Collections:Dept of Electronic and Electrical Engineering Research Papers

Files in This Item:
File Description SizeFormat 
FullText.pdfCopyright © 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).3.15 MBAdobe PDFView/Open


Items in BURA are protected by copyright, with all rights reserved, unless otherwise indicated.