DATA MINING CLASSIFICATION ALGORITHMS FOR DIABETES DATASET USING WEKA TOOL

Rahma Fitria, Desvina Yulisda, Mutammimul Ula

Abstract


Data mining explores a huge amount of data to extract the information to be meaningful. In the field of public health, data mining hold a crucial contribution in predicting disease in early stage. In order to detect diseases, the patients need to conduct various tests. In the context of disease predicion, Data mining techniques aims to reduce the test that patients need to accomplish. Also the techniques is used to increase the accuracy rate of detection. Nowadays, diabetes attacks many adults in the world. Moreover, in order to reduce the number of adult having diabetes, an effective and efficient diabetes detection mechanism should be found. This report will apply some data mining techniques on diabetes dataset that has been downloaded at UCI Machine Learning Repository.Three kind of classification algorithm such as Naïve Bayes Classifier, Multilayer Perceptrons (MLP’s) and Desicion Tree (J.48) have been performed on this dataset. Obtained outcomes indicated that Naïve Bayes Classifier achieved the highest accuracy with 76,30%. As the result, this algorithm is a good method to classify and diagnose diabetes diseases on studying dataset.


Full Text:

PDF

References


Alpan, K., & Ilgi, G. S. (2020). Classification of Diabetes Dataset with Data Mining Techniques by Using WEKA Approach. 4th International Symposium on Multidisciplinary Studies and Innovative Technologies, ISMSIT 2020 - Proceedings. https://doi.org/10.1109/ISMSIT50672.2020.9254720

American Diabetes Association. (2005). Diabetes Mellitus and Other Categories of Description of Diabetes. World Health, 28(Suppl 1), 224102. https://doi.org/10.2337/diacare.27.2007.S5

Centers for Disease Control and Prevention. (2017). Diabetes and Prediabetes and improve the health of all people with diabetes . Retrieved from www.cdc.gov/chronicdisease

Chaves, L., & Marques, G. (2021). applied sciences Data Mining Techniques for Early Diagnosis of Diabetes. Appl. Sci., 11(2218), 1–12. https://doi.org/doi.org/10.3390/app11052218

Flach, P. A. (2004). Naive Bayesian Classification of Structured Data. Machine Learning, 57(1), 233–269.

Han, J., Kamber, M., & Pei, J. (2012). Data Mining Concepts and Techniques (Third Edit). Waltham: Morgan Kaufmann.

Hasdyna, N., & Dinata, R. K. (2020). Analisis Matthew Correlation Coefficient pada K-Nearest Neighbor dalam Klasifikasi Ikan Hias. INFORMAL: Informatics Journal, 5(2), 57-64.

Iyer, A., Jeyalatha, S., & Sumbaly, R. (2015). Diagnosis of Diabetes Using Classification Mining Techniques. International Journal of Data Mining & Knowledge Management Process (IJDKP), 5(1), 1–14.

Kumar, V., Mishra, B. K., Mazzara, M., Thanhx, D. N. H., & Verma, A. (2019). Prediction of malignant & benign breast cancer: A data mining approach in healthcare applications. ArXiv, 1–8.

Rahman, R. M., & Afroz, F. (2013). Comparison of Various Classification Techniques Using Different Data Mining Tools for Diabetes Diagnosis. Journal of Software Engineering and Applications, 2013(March), 85–97.

Shuja, M., Mittal, S., & Zaman, M. (2020). Effective Prediction of Type II Diabetes Mellitus Using Data Mining Classifiers and SMOTE. In Advances in Computing and Intelligent Systems (pp. 195–211). https://doi.org/10.1007/978-981-15-0222-4_17

Thirumal, P. C., & Nagarajan, N. (2015). Utilization Of Data Mining Techniques For Diagnosis Of Diabetes Mellitus - A Case Study. ARPN Journal of Engineering and Applied Sciences, 10(1), 8–13.

Ula, M., Ulva, A. F., & Mauliza, M. (2021). Implementasi Machine Learning Dengan Model Case Based Reasoning Dalam Mendiagnosa Gizi Buruk Pada Anak”. Jurnal Informatika Kaputama (JIK), 5(2), 333-339.




DOI: https://doi.org/10.29103/sisfo.v5i2.6236

Article Metrics

 Abstract Views : 855 times
 PDF Downloaded : 241 times

Refbacks

  • There are currently no refbacks.


Copyright (c) 2021 Rahma Fitria, Desvina Yulisda, Mutammimul Ula

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

 


 
 

Universitas Malikussaleh
 

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.