Please use this identifier to cite or link to this item: http://bura.brunel.ac.uk/handle/2438/26899
Title: Prediction Model of Type 2 Diabetes Mellitus for Oman Prediabetes Patients Using Artificial Neural Network and Six Machine Learning Classifiers
Authors: Al Sadi, K
Balachandran, W
Keywords: K-nearest neighbours (K-NN);support vector machine (SVM); naive Bayes (NB);decision tree;random forest (RF);linear discriminant analysis (LDA);artificial neural network (ANN);type 2 diabetes mellitus (T2DM);Pima Indian Diabetes (PID) dataset;machine learning (ML)
Issue Date: 11-Feb-2023
Publisher: MDPI
Citation: Al Sadi, K. and Balachandran, W. (2023) 'Prediction Model of Type 2 Diabetes Mellitus for Oman Prediabetes Patients Using Artificial Neural Network and Six Machine Learning Classifiers', Applied Sciences (Switzerland), 13 (4), 2344, pp. 1 - 22. doi: 10.3390/app13042344.
Abstract: Copyright © 2023 by the authors. The early diagnosis of type 2 diabetes mellitus (T2DM) will provide an early treatment intervention to control disease progression and minimise premature death. This paper presents artificial intelligence and machine learning prediction models for diagnosing T2DM in the Omani population more accurately and with less processing time using a specially created dataset. Six machine learning algorithms: K-nearest neighbours (K-NN), support vector machine (SVM), naive Bayes (NB), decision tree, random forest (RF), linear discriminant analysis (LDA), and artificial neural networks (ANN) were applied in MATLAB. All data used were clinical data collected manually from a prediabetes register and the Al Shifa health system of South Al Batinah Province in Oman. The results were compared with the most widely used Pima Indian Diabetes dataset. Eleven clinical features were taken into consideration for predicting T2DM. The random forest and decision tree models performed better than all the other algorithms, providing an accuracy of 98.38% for Oman data. When the same model and number of features were used, the accuracy obtained with the Oman dataset exceeded PID by 9.1%. The analysis showed that T2DM diagnosis efficiency increased with more features, which is of help in the case of many missing values.
Description: Data Availability Statement: The Espcialy created Omani Prediabetes dataset that support the findings of this study can be available from the corresponding author upon reasonable request. Pima Indian dataset can be found here: https://www.kaggle.com/kumargh/pimaindiansdiabetescsv (accessed on 15 November 2021).
URI: https://bura.brunel.ac.uk/handle/2438/26899
DOI: https://doi.org/10.3390/app13042344
Other Identifiers: ORCID iDs: Khoula Ali Al Sadi https://orcid.org/0000-0001-6077-4110; Wamadeva Balachandran https://orcid.org/0000-0002-4806-2257.
2344
Appears in Collections:Dept of Electronic and Electrical Engineering Research Papers

Files in This Item:
File Description SizeFormat 
FullText.pdfCopyright © 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).7.61 MBAdobe PDFView/Open


This item is licensed under a Creative Commons License Creative Commons