Analisis Kinerja Random Forest untuk Prediksi Penyakit Hipertensi
DOI:
https://doi.org/10.55606/juisik.v6i1.2144Keywords:
Classification, Disease Prediction, Hypertension, Machine Learning, Random ForestAbstract
Hypertension is one of the major cardiovascular diseases that contributes significantly to global mortality and disability rates and is widely recognized as a silent killer due to its frequent absence of early symptoms. The complexity of hypertension risk factors including demographic, clinical, anthropometric, lifestyle, and medical history variables necessitates a machine learning based predictive approach capable of producing accurate and consistent classifications to support early disease detection. This study develops a Random Forest model to predict hypertension risk using the Hypertension Risk Prediction dataset obtained from Kaggle. Data processing was conducted systematically through data cleaning, handling of missing values, outlier treatment using the IQR-based capping method, exploratory data analysis, feature transformation through appropriate encoding techniques, stratified train test data splitting, and numerical feature scaling. The model was subsequently evaluated using accuracy, precision, recall, F1-score, and confusion matrix metrics on the test dataset. The evaluation results indicate that the Random Forest model achieved high classification performance, with an accuracy rate of 95.47%, accompanied by balanced performance across classes. These findings suggest that Random Forest has strong potential as an interpretable and effective hypertension prediction model to support early detection efforts and the prevention of cardiovascular complications.
References
Banerjee, S., Dunn, P., Conard, S., & Ng, R. (2023). Large language modeling and classical AI methods for the future of healthcare. Journal of Medicine, Surgery, and Public Health, 1, 100026. https://doi.org/10.1016/j.glmedi.2023.100026
Cheraghi, Z., Azmi-Naei, B., Cheraghi, P., & Doosti-Irani, A. (2025). The global prevalence of uncontrolled hypertension: A systematic review and meta-analysis. BMC Public Health, 25(1), 4259. https://doi.org/10.1186/s12889-025-25553-4
Chowdhury, M. Z. I., Naeem, I., Quan, H., Leung, A. A., Sikdar, K. C., O’Beirne, M., & Turin, T. C. (2022). Prediction of hypertension using traditional regression and machine learning models: A systematic review and meta-analysis. PLOS ONE, 17(4), e0266334. https://doi.org/10.1371/journal.pone.0266334
Danang, D., Haryani, H., Aini, Q., Ramahdan, F. A., & Edwards, J. (2025). Empowering digital literacy through blockchain-based Alphasign for secure and sustainable e-governance.
Danang, D., Prasetya, F. A., & Siswanto, E. (2026). Design of intelligent street lighting systems based on motion and ambient light sensors. Journal of Multidisciplinary Research and Technology, 2(1), 65–79.
Effati, S., Kamarzardi-Torghabe, A., Azizi-Froutaghe, F., Atighi, I., & Ghiasi-Hafez, S. (2024). Web application using machine learning to predict cardiovascular disease and hypertension in mine workers. Scientific Reports, 14(1), 31662. https://doi.org/10.1038/s41598-024-80919-9
Eini, P., Rezayee, M., Kassulke, M., & Tremblay, J. (2026). Efficacy and comparative performance of machine learning models for stroke risk prediction in hypertensive patients: A systematic review and meta-analysis. International Journal of Cardiology: Cardiovascular Risk and Prevention, 28, 200564. https://doi.org/10.1016/j.ijcrp.2025.200564
Eldem, A. (2025). A new hybrid learning model for early diagnosis of hypertension using IoMT technologies. Ain Shams Engineering Journal, 16(8), 103490. https://doi.org/10.1016/j.asej.2025.103490
Engda, A. A., Salau, A. O., & Ajala, O. (2025). Classical machine learning approaches for early hypertension risk prediction: A systematic review. Applied AI Letters, 6(3), e70005. https://doi.org/10.1002/ail2.70005
Guo, S., Ge, J.-X., Liu, S.-N., Zhou, J.-Y., Li, C., Chen, H.-J., Chen, L., Shen, Y.-Q., & Zhou, Q.-L. (2023). Development of a convenient and effective hypertension risk prediction model and exploration of the relationship between serum ferritin and hypertension risk: A study based on NHANES 2017–March 2020. Frontiers in Cardiovascular Medicine, 10, 1224795. https://doi.org/10.3389/fcvm.2023.1224795
Hwang, S. H., Lee, H., Lee, J. H., Lee, M., Koyanagi, A., Smith, L., Rhee, S. Y., Yon, D. K., & Lee, J. (2024). Machine learning–based prediction for incident hypertension based on regular health checkup data: Derivation and validation in two independent nationwide cohorts in South Korea and Japan. Journal of Medical Internet Research, 26, e52794. https://doi.org/10.2196/52794
Islam, M. M., Alam, M. J., Maniruzzaman, M., Ahmed, N. A. M. F., Ali, M. S., Rahman, M. J., & Roy, D. C. (2023). Predicting the risk of hypertension using machine learning algorithms: A cross-sectional study in Ethiopia. PLOS ONE, 18(8), e0289613. https://doi.org/10.1371/journal.pone.0289613
Lee, H., & Tsoi, P. (2025). Feature-enhanced machine learning for all-cause mortality prediction in healthcare data (Version 1). arXiv. https://doi.org/10.48550/arXiv.2503.21241
Mirzaye, A. W., Saadatfar, H., & Nematollahi, M. A. (2026). Enhancing hypertension risk diagnosis using a hybrid machine learning framework: Leveraging body composition data. BioMed Research International, 2026, 6335947. https://doi.org/10.1155/bmri/6335947
Montagna, S., Pengo, M. F., Ferretti, S., Borghi, C., Ferri, C., Grassi, G., Muiesan, M. L., & Parati, G. (2022). Machine learning in hypertension detection: A study on World Hypertension Day data. Journal of Medical Systems, 47(1), 1. https://doi.org/10.1007/s10916-022-01900-5
Mroz, T., Griffin, M., Cartabuke, R., Laffin, L., Russo-Alvarez, G., Thomas, G., Smedira, N., Meese, T., Shost, M., & Habboub, G. (2024). Predicting hypertension control using machine learning. PLOS ONE, 19(3), e0299932. https://doi.org/10.1371/journal.pone.0299932
Murad, S. H., Tayfor, N. B., Mahmood, N. H., & Arman, L. (2025). Hybrid genetic algorithms-driven optimization of machine learning models for heart disease prediction. MethodsX, 15, 103510. https://doi.org/10.1016/j.mex.2025.103510
Mutale, B., Withanage, N. C., Mishra, P. K., Shen, J., Abdelrahman, K., & Fnais, M. S. (2024). A performance evaluation of random forest, artificial neural network, and support vector machine learning algorithms to predict spatio-temporal land use–land cover dynamics: A case from Lusaka and Colombo. Frontiers in Environmental Science, 12, 1431645. https://doi.org/10.3389/fenvs.2024.1431645
Naik, A., Nalepa, J., Wijata, A. M., Mahon, J., Mistry, D., Knowles, A. T., Dawson, E. A., Lip, G. Y. H., Olier, I., & Ortega-Martorell, S. (2025). Artificial intelligence and digital twins for the personalised prediction of hypertension risk. Computers in Biology and Medicine, 196, 110718. https://doi.org/10.1016/j.compbiomed.2025.110718
Narasimhan, G., & Victor, A. (2025). A hybrid approach with metaheuristic optimization and random forest in improving heart disease prediction. Scientific Reports, 15(1), 10971. https://doi.org/10.1038/s41598-024-73867-x
Naskinova, I., Kolev, M., Karova, D., & Milev, M. (2026). Machine learning-based blood pressure prediction using cardiovascular disease data: A comprehensive comparative study. Electronics, 15(2), 312. https://doi.org/10.3390/electronics15020312
Nematollahi, M. A., Jahangiri, S., Asadollahi, A., Salimi, M., Dehghan, A., Mashayekh, M., Roshanzamir, M., Gholamabbas, G., Alizadehsani, R., Bazrafshan, M., Bazrafshan, H., Bazrafshan Drissi, H., & Shariful Islam, S. M. (2023). Body composition predicts hypertension using machine learning methods: A cohort study. Scientific Reports, 13(1), 6885. https://doi.org/10.1038/s41598-023-34127-6
Reel, P. S., Reel, S., Van Kralingen, J. C., Langton, K., Lang, K., Erlic, Z., Larsen, C. K., Amar, L., Pamporaki, C., Mulatero, P., Blanchard, A., Kabat, M., Robertson, S., MacKenzie, S. M., Taylor, A. E., Peitzsch, M., Ceccato, F., Scaroni, C., Reincke, M., & Jefferson, E. (2022). Machine learning for classification of hypertension subtypes using multi-omics: A multi-centre, retrospective, data-driven study. eBioMedicine, 84, 104276. https://doi.org/10.1016/j.ebiom.2022.104276
Ribino, P., Di Napoli, C., Paragliola, G., & Serino, L. (2024). Hyper-parameter optimization through reinforcement learning for survival prediction of patients with heart failure. Procedia Computer Science, 239, 1754–1761. https://doi.org/10.1016/j.procs.2024.06.354
Sakka, Y., Qarashai, D., & Altarawneh, A. (2023). Predicting hypertension using machine learning: A case study at Petra University. International Journal of Advanced Computer Science and Applications, 14(3). https://doi.org/10.14569/IJACSA.2023.0140368
Schjerven, F. E., Ingeström, E. M. L., Steinsland, I., & Lindseth, F. (2024). Development of risk models of incident hypertension using machine learning on the HUNT study data. Scientific Reports, 14(1), 5609. https://doi.org/10.1038/s41598-024-56170-7
Seo, J.-W., Lee, S., & Yim, M. H. (2024). Machine learning approach for predicting hypertension based on body composition in South Korean adults. Bioengineering, 11(9), 921. https://doi.org/10.3390/bioengineering11090921
Septian, E., Khaefi, M. R., Athoillah, A., Aisyah, D. N., Hardhantyo, M., Rahman, F. M., & Manikam, L. (2025). Prediction of personalised hypertension using machine learning in Indonesian population. Journal of Medical Systems, 49(1), 137. https://doi.org/10.1007/s10916-025-02253-5
Vera-Ponce, V. J., Zuzunaga-Montoya, F. E., Ballena-Caicedo, J., Gutierrez De Carrillo, C. I., León-Figueroa, D. A., & Valladares-Garrido, M. J. (2025). Predictive variables and diagnostic performance of cross-sectional models for hypertension detection: A systematic review. Frontiers in Cardiovascular Medicine, 12, 1713531. https://doi.org/10.3389/fcvm.2025.1713531
Wu, Y., Xin, B., Wan, Q., Ren, Y., & Jiang, W. (2024). Risk factors and prediction models for cardiovascular complications of hypertension in older adults with machine learning: A cross-sectional study. Heliyon, 10(6), e27941. https://doi.org/10.1016/j.heliyon.2024.e27941
Yang, J., Wang, H., Liu, P., Lu, Y., Yao, M., & Yan, H. (2024). Prediction of hypertension risk based on multiple feature fusion. Journal of Biomedical Informatics, 157, 104701. https://doi.org/10.1016/j.jbi.2024.104701
Yi, J., Wang, L., Song, J., Liu, Y., Liu, J., Zhang, H., Lu, J., & Zheng, X. (2024). Development of a machine learning-based model for predicting individual responses to antihypertensive treatments. Nutrition, Metabolism and Cardiovascular Diseases. Advance online publication. https://doi.org/10.1016/j.numecd.2024.02.014
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Jurnal ilmiah Sistem Informasi dan Ilmu Komputer

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.







