Analisis Kinerja Random Forest untuk Prediksi Penyakit Hipertensi

Authors

  • Nabila Christian Putri Hermawan Universitas Pendidikan Nasional
  • Adie Wahyudi Oktavia Gama Universitas Pendidikan Nasional

DOI:

https://doi.org/10.55606/juisik.v6i1.2144

Keywords:

Classification, Disease Prediction, Hypertension, Machine Learning, Random Forest

Abstract

Hypertension is one of the major cardiovascular diseases that contributes significantly to global mortality and disability rates and is widely recognized as a silent killer due to its frequent absence of early symptoms. The complexity of hypertension risk factors including demographic, clinical, anthropometric, lifestyle, and medical history variables necessitates a machine learning based predictive approach capable of producing accurate and consistent classifications to support early disease detection. This study develops a Random Forest model to predict hypertension risk using the Hypertension Risk Prediction dataset obtained from Kaggle. Data processing was conducted systematically through data cleaning, handling of missing values, outlier treatment using the IQR-based capping method, exploratory data analysis, feature transformation through appropriate encoding techniques, stratified train test data splitting, and numerical feature scaling. The model was subsequently evaluated using accuracy, precision, recall, F1-score, and confusion matrix metrics on the test dataset. The evaluation results indicate that the Random Forest model achieved high classification performance, with an accuracy rate of 95.47%, accompanied by balanced performance across classes. These findings suggest that Random Forest has strong potential as an interpretable and effective hypertension prediction model to support early detection efforts and the prevention of cardiovascular complications.

References

Banerjee, S., Dunn, P., Conard, S., & Ng, R. (2023). Large language modeling and classical AI methods for the future of healthcare. Journal of Medicine, Surgery, and Public Health, 1, 100026. https://doi.org/10.1016/j.glmedi.2023.100026

Cheraghi, Z., Azmi-Naei, B., Cheraghi, P., & Doosti-Irani, A. (2025). The global prevalence of uncontrolled hypertension: A systematic review and meta-analysis. BMC Public Health, 25(1), 4259. https://doi.org/10.1186/s12889-025-25553-4

Chowdhury, M. Z. I., Naeem, I., Quan, H., Leung, A. A., Sikdar, K. C., O’Beirne, M., & Turin, T. C. (2022). Prediction of hypertension using traditional regression and machine learning models: A systematic review and meta-analysis. PLOS ONE, 17(4), e0266334. https://doi.org/10.1371/journal.pone.0266334

Danang, D., Haryani, H., Aini, Q., Ramahdan, F. A., & Edwards, J. (2025). Empowering digital literacy through blockchain-based Alphasign for secure and sustainable e-governance.

Danang, D., Prasetya, F. A., & Siswanto, E. (2026). Design of intelligent street lighting systems based on motion and ambient light sensors. Journal of Multidisciplinary Research and Technology, 2(1), 65–79.

Effati, S., Kamarzardi-Torghabe, A., Azizi-Froutaghe, F., Atighi, I., & Ghiasi-Hafez, S. (2024). Web application using machine learning to predict cardiovascular disease and hypertension in mine workers. Scientific Reports, 14(1), 31662. https://doi.org/10.1038/s41598-024-80919-9

Eini, P., Rezayee, M., Kassulke, M., & Tremblay, J. (2026). Efficacy and comparative performance of machine learning models for stroke risk prediction in hypertensive patients: A systematic review and meta-analysis. International Journal of Cardiology: Cardiovascular Risk and Prevention, 28, 200564. https://doi.org/10.1016/j.ijcrp.2025.200564

Eldem, A. (2025). A new hybrid learning model for early diagnosis of hypertension using IoMT technologies. Ain Shams Engineering Journal, 16(8), 103490. https://doi.org/10.1016/j.asej.2025.103490

Engda, A. A., Salau, A. O., & Ajala, O. (2025). Classical machine learning approaches for early hypertension risk prediction: A systematic review. Applied AI Letters, 6(3), e70005. https://doi.org/10.1002/ail2.70005

Guo, S., Ge, J.-X., Liu, S.-N., Zhou, J.-Y., Li, C., Chen, H.-J., Chen, L., Shen, Y.-Q., & Zhou, Q.-L. (2023). Development of a convenient and effective hypertension risk prediction model and exploration of the relationship between serum ferritin and hypertension risk: A study based on NHANES 2017–March 2020. Frontiers in Cardiovascular Medicine, 10, 1224795. https://doi.org/10.3389/fcvm.2023.1224795

Hwang, S. H., Lee, H., Lee, J. H., Lee, M., Koyanagi, A., Smith, L., Rhee, S. Y., Yon, D. K., & Lee, J. (2024). Machine learning–based prediction for incident hypertension based on regular health checkup data: Derivation and validation in two independent nationwide cohorts in South Korea and Japan. Journal of Medical Internet Research, 26, e52794. https://doi.org/10.2196/52794

Islam, M. M., Alam, M. J., Maniruzzaman, M., Ahmed, N. A. M. F., Ali, M. S., Rahman, M. J., & Roy, D. C. (2023). Predicting the risk of hypertension using machine learning algorithms: A cross-sectional study in Ethiopia. PLOS ONE, 18(8), e0289613. https://doi.org/10.1371/journal.pone.0289613

Lee, H., & Tsoi, P. (2025). Feature-enhanced machine learning for all-cause mortality prediction in healthcare data (Version 1). arXiv. https://doi.org/10.48550/arXiv.2503.21241

Mirzaye, A. W., Saadatfar, H., & Nematollahi, M. A. (2026). Enhancing hypertension risk diagnosis using a hybrid machine learning framework: Leveraging body composition data. BioMed Research International, 2026, 6335947. https://doi.org/10.1155/bmri/6335947

Montagna, S., Pengo, M. F., Ferretti, S., Borghi, C., Ferri, C., Grassi, G., Muiesan, M. L., & Parati, G. (2022). Machine learning in hypertension detection: A study on World Hypertension Day data. Journal of Medical Systems, 47(1), 1. https://doi.org/10.1007/s10916-022-01900-5

Mroz, T., Griffin, M., Cartabuke, R., Laffin, L., Russo-Alvarez, G., Thomas, G., Smedira, N., Meese, T., Shost, M., & Habboub, G. (2024). Predicting hypertension control using machine learning. PLOS ONE, 19(3), e0299932. https://doi.org/10.1371/journal.pone.0299932

Murad, S. H., Tayfor, N. B., Mahmood, N. H., & Arman, L. (2025). Hybrid genetic algorithms-driven optimization of machine learning models for heart disease prediction. MethodsX, 15, 103510. https://doi.org/10.1016/j.mex.2025.103510

Mutale, B., Withanage, N. C., Mishra, P. K., Shen, J., Abdelrahman, K., & Fnais, M. S. (2024). A performance evaluation of random forest, artificial neural network, and support vector machine learning algorithms to predict spatio-temporal land use–land cover dynamics: A case from Lusaka and Colombo. Frontiers in Environmental Science, 12, 1431645. https://doi.org/10.3389/fenvs.2024.1431645

Naik, A., Nalepa, J., Wijata, A. M., Mahon, J., Mistry, D., Knowles, A. T., Dawson, E. A., Lip, G. Y. H., Olier, I., & Ortega-Martorell, S. (2025). Artificial intelligence and digital twins for the personalised prediction of hypertension risk. Computers in Biology and Medicine, 196, 110718. https://doi.org/10.1016/j.compbiomed.2025.110718

Narasimhan, G., & Victor, A. (2025). A hybrid approach with metaheuristic optimization and random forest in improving heart disease prediction. Scientific Reports, 15(1), 10971. https://doi.org/10.1038/s41598-024-73867-x

Naskinova, I., Kolev, M., Karova, D., & Milev, M. (2026). Machine learning-based blood pressure prediction using cardiovascular disease data: A comprehensive comparative study. Electronics, 15(2), 312. https://doi.org/10.3390/electronics15020312

Nematollahi, M. A., Jahangiri, S., Asadollahi, A., Salimi, M., Dehghan, A., Mashayekh, M., Roshanzamir, M., Gholamabbas, G., Alizadehsani, R., Bazrafshan, M., Bazrafshan, H., Bazrafshan Drissi, H., & Shariful Islam, S. M. (2023). Body composition predicts hypertension using machine learning methods: A cohort study. Scientific Reports, 13(1), 6885. https://doi.org/10.1038/s41598-023-34127-6

Reel, P. S., Reel, S., Van Kralingen, J. C., Langton, K., Lang, K., Erlic, Z., Larsen, C. K., Amar, L., Pamporaki, C., Mulatero, P., Blanchard, A., Kabat, M., Robertson, S., MacKenzie, S. M., Taylor, A. E., Peitzsch, M., Ceccato, F., Scaroni, C., Reincke, M., & Jefferson, E. (2022). Machine learning for classification of hypertension subtypes using multi-omics: A multi-centre, retrospective, data-driven study. eBioMedicine, 84, 104276. https://doi.org/10.1016/j.ebiom.2022.104276

Ribino, P., Di Napoli, C., Paragliola, G., & Serino, L. (2024). Hyper-parameter optimization through reinforcement learning for survival prediction of patients with heart failure. Procedia Computer Science, 239, 1754–1761. https://doi.org/10.1016/j.procs.2024.06.354

Sakka, Y., Qarashai, D., & Altarawneh, A. (2023). Predicting hypertension using machine learning: A case study at Petra University. International Journal of Advanced Computer Science and Applications, 14(3). https://doi.org/10.14569/IJACSA.2023.0140368

Schjerven, F. E., Ingeström, E. M. L., Steinsland, I., & Lindseth, F. (2024). Development of risk models of incident hypertension using machine learning on the HUNT study data. Scientific Reports, 14(1), 5609. https://doi.org/10.1038/s41598-024-56170-7

Seo, J.-W., Lee, S., & Yim, M. H. (2024). Machine learning approach for predicting hypertension based on body composition in South Korean adults. Bioengineering, 11(9), 921. https://doi.org/10.3390/bioengineering11090921

Septian, E., Khaefi, M. R., Athoillah, A., Aisyah, D. N., Hardhantyo, M., Rahman, F. M., & Manikam, L. (2025). Prediction of personalised hypertension using machine learning in Indonesian population. Journal of Medical Systems, 49(1), 137. https://doi.org/10.1007/s10916-025-02253-5

Vera-Ponce, V. J., Zuzunaga-Montoya, F. E., Ballena-Caicedo, J., Gutierrez De Carrillo, C. I., León-Figueroa, D. A., & Valladares-Garrido, M. J. (2025). Predictive variables and diagnostic performance of cross-sectional models for hypertension detection: A systematic review. Frontiers in Cardiovascular Medicine, 12, 1713531. https://doi.org/10.3389/fcvm.2025.1713531

Wu, Y., Xin, B., Wan, Q., Ren, Y., & Jiang, W. (2024). Risk factors and prediction models for cardiovascular complications of hypertension in older adults with machine learning: A cross-sectional study. Heliyon, 10(6), e27941. https://doi.org/10.1016/j.heliyon.2024.e27941

Yang, J., Wang, H., Liu, P., Lu, Y., Yao, M., & Yan, H. (2024). Prediction of hypertension risk based on multiple feature fusion. Journal of Biomedical Informatics, 157, 104701. https://doi.org/10.1016/j.jbi.2024.104701

Yi, J., Wang, L., Song, J., Liu, Y., Liu, J., Zhang, H., Lu, J., & Zheng, X. (2024). Development of a machine learning-based model for predicting individual responses to antihypertensive treatments. Nutrition, Metabolism and Cardiovascular Diseases. Advance online publication. https://doi.org/10.1016/j.numecd.2024.02.014

Downloads

Published

2026-03-04

How to Cite

Nabila Christian Putri Hermawan, & Adie Wahyudi Oktavia Gama. (2026). Analisis Kinerja Random Forest untuk Prediksi Penyakit Hipertensi. Jurnal Ilmiah Sistem Informasi Dan Ilmu Komputer, 6(1), 356–371. https://doi.org/10.55606/juisik.v6i1.2144