DATA MINING CLASSIFICATION ALGORITHMS FOR DIABETES DATASET USING WEKA TOOL
Abstract
Data mining explores a huge amount of data to extract the information to be meaningful. In the field of public health, data mining hold a crucial contribution in predicting disease in early stage. In order to detect diseases, the patients need to conduct various tests. In the context of disease predicion, Data mining techniques aims to reduce the test that patients need to accomplish. Also the techniques is used to increase the accuracy rate of detection. Nowadays, diabetes attacks many adults in the world. Moreover, in order to reduce the number of adult having diabetes, an effective and efficient diabetes detection mechanism should be found. This report will apply some data mining techniques on diabetes dataset that has been downloaded at UCI Machine Learning Repository.Three kind of classification algorithm such as Naïve Bayes Classifier, Multilayer Perceptrons (MLP’s) and Desicion Tree (J.48) have been performed on this dataset. Obtained outcomes indicated that Naïve Bayes Classifier achieved the highest accuracy with 76,30%. As the result, this algorithm is a good method to classify and diagnose diabetes diseases on studying dataset.
Full Text:
PDFReferences
Alpan, K., & Ilgi, G. S. (2020). Classification of Diabetes Dataset with Data Mining Techniques by Using WEKA Approach. 4th International Symposium on Multidisciplinary Studies and Innovative Technologies, ISMSIT 2020 - Proceedings. https://doi.org/10.1109/ISMSIT50672.2020.9254720
American Diabetes Association. (2005). Diabetes Mellitus and Other Categories of Description of Diabetes. World Health, 28(Suppl 1), 224102. https://doi.org/10.2337/diacare.27.2007.S5
Centers for Disease Control and Prevention. (2017). Diabetes and Prediabetes and improve the health of all people with diabetes . Retrieved from www.cdc.gov/chronicdisease
Chaves, L., & Marques, G. (2021). applied sciences Data Mining Techniques for Early Diagnosis of Diabetes. Appl. Sci., 11(2218), 1–12. https://doi.org/doi.org/10.3390/app11052218
Flach, P. A. (2004). Naive Bayesian Classification of Structured Data. Machine Learning, 57(1), 233–269.
Han, J., Kamber, M., & Pei, J. (2012). Data Mining Concepts and Techniques (Third Edit). Waltham: Morgan Kaufmann.
Hasdyna, N., & Dinata, R. K. (2020). Analisis Matthew Correlation Coefficient pada K-Nearest Neighbor dalam Klasifikasi Ikan Hias. INFORMAL: Informatics Journal, 5(2), 57-64.
Iyer, A., Jeyalatha, S., & Sumbaly, R. (2015). Diagnosis of Diabetes Using Classification Mining Techniques. International Journal of Data Mining & Knowledge Management Process (IJDKP), 5(1), 1–14.
Kumar, V., Mishra, B. K., Mazzara, M., Thanhx, D. N. H., & Verma, A. (2019). Prediction of malignant & benign breast cancer: A data mining approach in healthcare applications. ArXiv, 1–8.
Rahman, R. M., & Afroz, F. (2013). Comparison of Various Classification Techniques Using Different Data Mining Tools for Diabetes Diagnosis. Journal of Software Engineering and Applications, 2013(March), 85–97.
Shuja, M., Mittal, S., & Zaman, M. (2020). Effective Prediction of Type II Diabetes Mellitus Using Data Mining Classifiers and SMOTE. In Advances in Computing and Intelligent Systems (pp. 195–211). https://doi.org/10.1007/978-981-15-0222-4_17
Thirumal, P. C., & Nagarajan, N. (2015). Utilization Of Data Mining Techniques For Diagnosis Of Diabetes Mellitus - A Case Study. ARPN Journal of Engineering and Applied Sciences, 10(1), 8–13.
Ula, M., Ulva, A. F., & Mauliza, M. (2021). Implementasi Machine Learning Dengan Model Case Based Reasoning Dalam Mendiagnosa Gizi Buruk Pada Anak”. Jurnal Informatika Kaputama (JIK), 5(2), 333-339.
DOI: https://doi.org/10.29103/sisfo.v5i2.6236
Article Metrics
Abstract Views : 885 timesPDF Downloaded : 258 times
Refbacks
- There are currently no refbacks.
Copyright (c) 2021 Rahma Fitria, Desvina Yulisda, Mutammimul Ula
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Universitas Malikussaleh |
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.