Design an Algorithm based on Data Mining to Predict Diabetes

Rafiei, Navid

Volume 23, Issue 1 (5-2023) Iran J Diabetes Metab. 2023, 23(1): 53-67 | Back to browse issues page

Mendeley

Zotero

RefWorks

Rafiei N. Design an Algorithm based on Data Mining to Predict Diabetes. Iran J Diabetes Metab. 2023; 23 (1) :53-67
URL: http://ijdld.tums.ac.ir/article-1-6202-en.html

Design an Algorithm based on Data Mining to Predict Diabetes

Navid Rafiei ^*

Department of Industrial Engineering, Bandar Abbas Branch, Islamic Azad University, Bandar Abbas, Iran , n.rafiei@iau-tnb.ac.ir

Abstract: (2261 Views)

Background: Diabetes entails a great quantity of deaths each year and a great quantity of people living with the disease do not find out their health status early sufficient. In this paper, we advance a data mining-based model for prematurely diagnosis and prediction of diabetes.
Methods: Although K-means is simple and can be utilized for a vast diversity of data kinds, it is wholly sensitive to initial locations of cluster centers which specify the final cluster result, which either enables an efficiently and adequate clustered dataset for the logistic regression model, or presents a lesser amount of data as a result of wrong clustering of the main dataset, thereby restricting the proficiency of the logistic regression model. The main purpose of this study is was to specify procedures of ameliorating the k-means clustering and logistic regression accuracy consequence. Therefore, our algorithm comprises of principal component analysis technique, k-means technique and logistic regression model.
Results: The results obtained from this study show that the ability to obtain the result of K-means clustering accuracy is much higher than what other researchers have obtained in similar studies. Also, compared to the results obtained from other algorithms, the logistic regression model was implemented at an improved level in predicting the onset of diabetes. Another real advantage is that the proposed algorithm was able to successfully model a new dataset.
Conclusion: In general, the proposed approach can be effectively used in predicting and early diagnosis of diabetes.

Keywords: Diabetes, Prediction, Principal component analysis, K-means, Logistic regression

Full-Text [PDF 983 kb] (897 Downloads)

Type of Study: Applicable | Subject: General