Health Data Analysis for In-Depth Understanding of Patterns, Prediction, and Disease Management: A Case Study on Diabetes Mellitus

Authors

  • Syaiful Bachri Mustamin Department of Information Technology, Faculty of Science Technology and Health, Institut Sains Teknologi dan Kesehatan (ISTEK) ‘Aisyiyah Kendari, Kendari, 93116, Southeast Sulawesi, Indonesia Author
  • Muhammad Atnang Department of Information Technology, Faculty of Science Technology and Health, Institut Sains Teknologi dan Kesehatan (ISTEK) ‘Aisyiyah Kendari, Kendari, 93116, Southeast Sulawesi, Indonesia Author
  • Sahriani Sahriani Department of Information Technology, Faculty of Science Technology and Health, Institut Sains Teknologi dan Kesehatan (ISTEK) ‘Aisyiyah Kendari, Kendari, 93116, Southeast Sulawesi, Indonesia Author
  • Baso Sulham Department of Computer Engineering, Faculty of Engineering and Computer Technology, Institut Teknologi dan Sains Muhammadiyah Kolaka, Kolaka Utara, 93911, Southeast Sulawesi, Indonesia Author
  • Samsidar Samsidar Department of Information Technology, Faculty of Science Technology and Health, Institut Sains Teknologi dan Kesehatan (ISTEK) ‘Aisyiyah Kendari, Kendari, 93116, Southeast Sulawesi, Indonesia Author

DOI:

https://doi.org/10.63441/ijsth.v2i1.39

Keywords:

Diabetes mellitus, health data analysis, complication risk prediction, personalized disease management

Abstract

The present study aims to address the intricate nature of diabetes mellitus by employing data analysis to gain profound insights into individual health patterns, predict risks of complications, and formulate personalized solutions for disease management. Data were sourced from diverse repositories, including the UCI Machine Learning Repository, Kaggle, and Data.gov, encompassing medical records, laboratory histories, and lifestyle data of diabetes patients. Preprocessing involved outlier detection, normalization, and handling data imbalances using the Synthetic Minority Over-sampling Technique (SMOTE). Principal Component Analysis (PCA) was utilized for feature extraction to facilitate a comprehensive understanding of health patterns. Predictive models, namely Random Forest, Support Vector Machine, and Neural Network, underwent rigorous training and validation. Concurrently, disease management solutions were crafted based on model recommendations. Research findings demonstrated commendable performance, particularly with the Neural Network model achieving an AUC-ROC of 0.92. This study's contribution is anticipated to usher in novel approaches in chronic disease management, particularly diabetes, by applying data science principles to enhance comprehension, prediction, and disease management, potentially elevating the quality of life for patients.

Downloads

Published

2025-06-28

Issue

Section

Articles

How to Cite

Health Data Analysis for In-Depth Understanding of Patterns, Prediction, and Disease Management: A Case Study on Diabetes Mellitus. (2025). International Journal of Science Technology and Health, 2(1), 30-39. https://doi.org/10.63441/ijsth.v2i1.39