An Explainable AI-Based Decision Support System for Teaching and Classifying Hair Loss Types
DOI:
https://doi.org/10.46328/ijtes.8226Keywords:
Decision support system, Explainable Artificial Intelligence, Hair loss classification, Leakage-resistant pipeline, Multi-class classificationAbstract
Hair loss is a multifactorial condition that requires accurate classification to support reliable and personalized decision-making. However, many existing machine learning approaches suffer from data leakage and limited interpretability, reducing their robustness and practical applicability in decision support systems. This study proposes a leakage-resistant machine learning framework for multi-class hair loss classification, integrating explainable artificial intelligence to enhance transparency and reliability. The framework employs a unified preprocessing pipeline within nested cross-validation to prevent information leakage, while SMOTEENN is used to address class imbalance. Several algorithms, including Decision Tree, Random Forest, K-Nearest Neighbors, Logistic Regression, and Extreme Gradient Boosting, are evaluated within this pipeline. Experimental results indicate that Extreme Gradient Boosting achieves the best performance, with an accuracy of 0.8300, F1-score of 0.7908, and AUC of 0.9305 in nested cross-validation. Evaluation on a holdout dataset demonstrates stable generalization, achieving an accuracy of 0.8250, F1-score of 0.7517, and AUC of 0.9182. Furthermore, the integration of explainable artificial intelligence enables interpretable predictions that can be utilized in a rule-based decision support system. Overall, the proposed framework provides a robust, transparent, and leakage-resistant solution for reliable machine learning-based classification in practical applications.
References
Abbas, Q., Jeong, W., & Lee, S. W. (2025). Explainable AI in Clinical Decision Support Systems: A Meta-Analysis of Methods, Applications, and Usability Challenges. Healthcare, 13(17), 2154. https://doi.org/10.3390/healthcare13172154
Aboulmira, A., Hrimech, H., & Lachgar, M. (2024). Skin Diseases Classification with Machine Learning and Deep Learning Techniques: A Systematic Review. International Journal of Advanced Computer Science and Applications, 15(10). https://doi.org/10.14569/IJACSA.2024.01510118
Agrawal, R., Gupta, T., Gupta, S., Chauhan, S., Patel, P., & Hamdare, S. (2025). Fostering trust and interpretability: integrating explainable AI (XAI) with machine learning for enhanced disease prediction and decision transparency. Diagnostic Pathology, 20(1), 105. https://doi.org/10.1186/s13000-025-01686-3
Akilandasowmya, G., Nirmaladevi, G., Suganthi, SU., & Aishwariya, A. (2024). Skin cancer diagnosis: Leveraging deep hidden features and ensemble classifiers for early detection and classification. Biomedical Signal Processing and Control, 88, 105306. https://doi.org/10.1016/j.bspc.2023.105306
Almustafa, K. M. (2025). Predictive modeling and optimization in dermatology: Machine learning for skin disease classification. Computers in Biology and Medicine, 189, 109946. https://doi.org/10.1016/j.compbiomed.2025.109946
Apicella, A., Isgrò, F., & Prevete, R. (2025). Don’t push the button! Exploring data leakage risks in machine learning and transfer learning. Artificial Intelligence Review, 58(11), 339. https://doi.org/10.1007/s10462-025-11326-3
Arif Hidayat, & Sutedi, S. (2025). Klasifikasi Rambut Rontok Menggunakan Metode Naive Bayes. SATESI: Jurnal Sains Teknologi Dan Sistem Informasi, 5(2), 183–189. https://doi.org/10.54259/satesi.v5i2.5612
Aukerman, E. L., & Jafferany, M. (2023). The psychological consequences of androgenetic alopecia: A systematic review. Journal of Cosmetic Dermatology, 22(1), 89–95. https://doi.org/10.1111/jocd.14983
Bartz-Beielstein, T., & Zaefferer, M. (2023). Models. In Hyperparameter Tuning for Machine and Deep Learning with R (pp. 27–69). Springer Nature Singapore. https://doi.org/10.1007/978-981-19-5170-1_3
Boghosian, T., Mendez, H., Sayegh, M., Rabionet, A., Beer, J., & Tosti, A. (2026). The Intersection of Sleep and Hair Loss: A Systematic Review. Dermatology and Therapy, 16(2), 937–952. https://doi.org/10.1007/s13555-025-01641-6
Borda, L. J., & Wikramanayake, T. C. (2015). Seborrheic Dermatitis and Dandruff: A Comprehensive Review. Journal of Clinical and Investigative Dermatology, 3(2). https://doi.org/10.13188/2373-1044.1000019
Chaddad, A., Peng, J., Xu, J., & Bouridane, A. (2023). Survey of Explainable AI Techniques in Healthcare. Sensors, 23(2), 634. https://doi.org/10.3390/s23020634
Chien Yin, G. O., Siong-See, J. L., & Wang, E. C. E. (2021). Telogen Effluvium – a review of the science and current obstacles. Journal of Dermatological Science, 101(3), 156–163. https://doi.org/10.1016/j.jdermsci.2021.01.007
Dhanka, S., Kumar, A., Maini, S., Kumar, N., Singh, J., Khan, M., Abbas, M., & Ksibi, A. (2026). Padding interpolation, median imputation, RobustScalar, and particle swarm optimization with heterogeneous classifiers: a robust combination for effective heart disease diagnosis. Frontiers in Medicine, 12. https://doi.org/10.3389/fmed.2025.1721740
Fatani, M. I. A., Alkhalifah, A., Alruwaili, A. F. S., Alharbi, A. H. S., Alharithy, R., Khardaly, A. M., Almudaiheem, H. Y., Al-Jedai, A., & Eshmawi, M. T. Y. (2023). Diagnosis and Management of Alopecia Areata: A Saudi Expert Consensus Statement (2023). Dermatology and Therapy, 13(10), 2129–2151. https://doi.org/10.1007/s13555-023-00991-3
Gavazzoni Dias, M. F. (2015). Hair cosmetics: An overview. International Journal of Trichology, 7(1), 2. https://doi.org/10.4103/0974-7753.153450
Hairani, H., Widiyaningtyas, T., & Dwi Prasetya, D. (2024). Addressing Class Imbalance of Health Data: A Systematic Literature Review on Modified Synthetic Minority Oversampling Technique (SMOTE) Strategies. JOIV : International Journal on Informatics Visualization, 8(3), 1310. https://doi.org/10.62527/joiv.8.3.2283
Han, Y., & Joe, I. (2024). Enhancing Machine Learning Models Through PCA, SMOTE-ENN, and Stochastic Weighted Averaging. Applied Sciences, 14(21), 9772. https://doi.org/10.3390/app14219772
Hur, S., Lee, Y., Park, J., Jeon, Y. J., Cho, J. H., Cho, D., Lim, D., Hwang, W., Cha, W. C., & Yoo, J. (2025). Comparison of SHAP and clinician friendly explanations reveals effects on clinical decision behaviour. Npj Digital Medicine, 8(1), 578. https://doi.org/10.1038/s41746-025-01958-8
Ichwani, A., Kesuma, R. I., Setiawan, A., Wicaksono, I. E., & Hanifah, R. (2026). Preventing Data Leakage in Classification via Integrated Machine Learning Pipelines: Preprocessing, Feature Transformation, and Hyperparameter Tuning. Jurnal Teknik Informatika (Jutif), 7(1), 391–410. https://doi.org/10.52436/1.jutif.2026.7.1.5490
Jeong, K., Mallard, A. R., Coombe, L., & Ward, J. (2023). Artificial intelligence and prediction of cardiometabolic disease: Systematic review of model performance and potential benefits in indigenous populations. Artificial Intelligence in Medicine, 139, 102534. https://doi.org/10.1016/j.artmed.2023.102534
Kapoor, I., & Mishra, A. (2018). Automated Classification Method for Early Diagnosis of Alopecia Using Machine Learning. Procedia Computer Science, 132, 437–443. https://doi.org/10.1016/j.procs.2018.05.157
Li, M., Sun, H., Huang, Y., & Chen, H. (2024). Shapley value: from cooperative game to explainable artificial intelligence. Autonomous Intelligent Systems, 4(1), 2. https://doi.org/10.1007/s43684-023-00060-8
Liu, S., Du, H., & Feng, M. (2020). Robust Predictive Models in Clinical Data—Random Forest and Support Vector Machines. In Leveraging Data Science for Global Health (pp. 219–228). Springer International Publishing. https://doi.org/10.1007/978-3-030-47994-7_13
Lusito, S., Pugnana, A., & Guidotti, R. (2024). Solving imbalanced learning with outlier detection and features reduction. Machine Learning, 113(8), 5273–5330. https://doi.org/10.1007/s10994-023-06448-0
Ly, N., Paiewonsky, B., Fruechte, S., Goldfarb, N., Hordinsky, M. K., Bakker, C., Sadick, N., Arruda, S., & Farah, R. S. (2025). Caffeine Supplementation and Hair: A Systematic Review. Journal of Drugs in Dermatology, 24(11), 1070–1074. https://doi.org/10.36849/JDD.8902
Maloh, J., Engel, T., Natarelli, N., Nong, Y., Zufall, A., & Sivamani, R. K. (2023). Systematic Review of Psychological Interventions for Quality of Life, Mental Health, and Hair Growth in Alopecia Areata and Scarring Alopecia. Journal of Clinical Medicine, 12(3), 964. https://doi.org/10.3390/jcm12030964
Mienye, I. D., Swart, T. G., Obaido, G., Jordan, M., & Ilono, P. (2025). Deep Convolutional Neural Networks in Medical Image Analysis: A Review. Information, 16(3), 195. https://doi.org/10.3390/info16030195
Nohara, Y., Matsumoto, K., Soejima, H., & Nakashima, N. (2022). Explanation of machine learning models using shapley additive explanation and application for real data in hospital. Computer Methods and Programs in Biomedicine, 214, 106584. https://doi.org/10.1016/j.cmpb.2021.106584
Omar, E. D., Mat, H., Abd Karim, A. Z., Sanaudi, R., Ibrahim, F., Omar, M. A., Ismail, M. Z. H., Jayaraj, V., & Goh, B. L. (2024). Comparative Analysis of Logistic Regression, Gradient Boosted Trees, SVM, and Random Forest Algorithms for Prediction of Acute Kidney Injury Requiring Dialysis After Cardiac Surgery. International Journal of Nephrology and Renovascular Disease, Volume 17, 197–204. https://doi.org/10.2147/IJNRD.S461028
Parikh, A. K., Tan, I. J., Wolfe, S. M., & Cohen, B. A. (2024). Advances in Topical Therapies for Clinically Relevant and Prevalent Forms of Alopecia. Life, 14(12), 1577. https://doi.org/10.3390/life14121577
Pensa, R. G., Crombach, A., Peignier, S., & Rigotti, C. (2025). Explaining Random Forest and XGBoost with Shallow Decision Trees by Co-clustering Feature Importance. Machine Learning, 114(12), 287. https://doi.org/10.1007/s10994-025-06932-9
Saad Hussein, A., Li, T., Yohannese, C. W., & Bashir, K. (2019). A-SMOTE: A New Preprocessing Approach for Highly Imbalanced Datasets by Improving SMOTE. International Journal of Computational Intelligence Systems, 12(2), 1412. https://doi.org/10.2991/ijcis.d.191114.002
Salmi, M., Atif, D., Oliva, D., Abraham, A., & Ventura, S. (2024). Handling imbalanced medical datasets: review of a decade of research. Artificial Intelligence Review, 57(10), 273. https://doi.org/10.1007/s10462-024-10884-2
Sarhan, A. M., Ali, H. A., Yasser, S., Gobara, M., Kandil, A. A., Sherif, G., & Moustafa, E. (2025). Achieving high-accuracy skin cancer classification with deep learning optimized by ant colony algorithm. Journal of Electrical Systems and Information Technology, 12(1), 49. https://doi.org/10.1186/s43067-025-00243-8
Sasse, L., Nicolaisen-Sobesky, E., Dukart, J., Eickhoff, S. B., Götz, M., Hamdan, S., Komeyer, V., Kulkarni, A., Lahnakoski, J. M., Love, B. C., Raimondo, F., & Patil, K. R. (2025). Overview of leakage scenarios in supervised machine learning. Journal of Big Data, 12(1), 135. https://doi.org/10.1186/s40537-025-01193-8
Siami, M. I., & Azis, H. (2025). Predicting Hair Loss with Machine Learning: A Multi-Factor Analysis. International Journal of Artificial Intelligence in Medical Issues, 3(1), 60–68. https://doi.org/10.56705/ijaimi.v3i1.360
Sirish Kumar, M., Reddy, P. L. K., Dinesh Reddy, G., Kumar, A. S., & Nagendra, P. (2025). Predictive Modeling of Hair Fall using Random Forest Algorithms. Proceedings of the 4th International Conference on Information Technology, Civil Innovation, Science, and Management, ICITSM 2025, 28-29 April 2025, Tiruchengode, Tamil Nadu, India, Part II. https://doi.org/10.4108/eai.28-4-2025.2358120
Sujon, K. M., Hassan, R., Choi, K., & Samad, M. A. (2025). Accuracy, precision, recall, f1-score, or MCC? empirical evidence from advanced statistics, ML, and XAI for evaluating business predictive models. Journal of Big Data, 12(1), 268. https://doi.org/10.1186/s40537-025-01313-4
Taha, K. (2025). Machine learning in biomedical and health big data: a comprehensive survey with empirical and experimental insights. Journal of Big Data, 12(1), 61. https://doi.org/10.1186/s40537-025-01108-7
Teoh, T. T. (2023). Convolutional Neural Networks for Medical Applications. Springer Nature Singapore. https://doi.org/10.1007/978-981-19-8814-1
Venkatesh, J., Vijayalakshmi, R. A. K., Krishnan, D., & Partheeban, P. (2026). EfficientNet-based soft computing techniques for dermatological condition detection. Discover Artificial Intelligence. https://doi.org/10.1007/s44163-026-01147-w
Vinutha, H. P., Poornima, B., & Sagar, B. M. (2018). Detection of Outliers Using Interquartile Range Technique from Intrusion Dataset (pp. 511–518). https://doi.org/10.1007/978-981-10-7563-6_53
Wainer, J., & Cawley, G. (2021). Nested cross-validation when selecting classifiers is overzealous for most practical applications. Expert Systems with Applications, 182, 115222. https://doi.org/10.1016/j.eswa.2021.115222
Yoraeni, A., & Rakhmah, S. N. (2025). Penerapan Algoritma Naive Bayes untuk Prediksi Kerontokan Rambut. Jurnal Bumigora Information Technology (BITe), 7(1), 63–70. https://doi.org/10.30812/bite.v7i1.5201
Downloads
Published
Issue
Section
License
Copyright (c) 2026 International Journal of Technology in Education and Science

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

