An Explainable AI-Based Decision Support System for Teaching and Classifying Hair Loss Types

Authors

  • Trisnani Widowati Universitas Negeri Semarang
  • Ade Novi Nurul Ihsani Universitas Negeri Semarang
  • Anik Maghfiroh Universitas Negeri Semarang
  • Clarita Aprilliani Universitas Negeri Semarang
  • Septian Eko Prasetyo Universitas Negeri Semarang

DOI:

https://doi.org/10.46328/ijtes.8226

Keywords:

Decision support system, Explainable Artificial Intelligence, Hair loss classification, Leakage-resistant pipeline, Multi-class classification

Abstract

Hair loss is a multifactorial condition that requires accurate classification to support reliable and personalized decision-making. However, many existing machine learning approaches suffer from data leakage and limited interpretability, reducing their robustness and practical applicability in decision support systems. This study proposes a leakage-resistant machine learning framework for multi-class hair loss classification, integrating explainable artificial intelligence to enhance transparency and reliability. The framework employs a unified preprocessing pipeline within nested cross-validation to prevent information leakage, while SMOTEENN is used to address class imbalance. Several algorithms, including Decision Tree, Random Forest, K-Nearest Neighbors, Logistic Regression, and Extreme Gradient Boosting, are evaluated within this pipeline. Experimental results indicate that Extreme Gradient Boosting achieves the best performance, with an accuracy of 0.8300, F1-score of 0.7908, and AUC of 0.9305 in nested cross-validation. Evaluation on a holdout dataset demonstrates stable generalization, achieving an accuracy of 0.8250, F1-score of 0.7517, and AUC of 0.9182. Furthermore, the integration of explainable artificial intelligence enables interpretable predictions that can be utilized in a rule-based decision support system. Overall, the proposed framework provides a robust, transparent, and leakage-resistant solution for reliable machine learning-based classification in practical applications.

References

Abbas, Q., Jeong, W., & Lee, S. W. (2025). Explainable AI in Clinical Decision Support Systems: A Meta-Analysis of Methods, Applications, and Usability Challenges. Healthcare, 13(17), 2154. https://doi.org/10.3390/healthcare13172154

Aboulmira, A., Hrimech, H., & Lachgar, M. (2024). Skin Diseases Classification with Machine Learning and Deep Learning Techniques: A Systematic Review. International Journal of Advanced Computer Science and Applications, 15(10). https://doi.org/10.14569/IJACSA.2024.01510118

Agrawal, R., Gupta, T., Gupta, S., Chauhan, S., Patel, P., & Hamdare, S. (2025). Fostering trust and interpretability: integrating explainable AI (XAI) with machine learning for enhanced disease prediction and decision transparency. Diagnostic Pathology, 20(1), 105. https://doi.org/10.1186/s13000-025-01686-3

Akilandasowmya, G., Nirmaladevi, G., Suganthi, SU., & Aishwariya, A. (2024). Skin cancer diagnosis: Leveraging deep hidden features and ensemble classifiers for early detection and classification. Biomedical Signal Processing and Control, 88, 105306. https://doi.org/10.1016/j.bspc.2023.105306

Almustafa, K. M. (2025). Predictive modeling and optimization in dermatology: Machine learning for skin disease classification. Computers in Biology and Medicine, 189, 109946. https://doi.org/10.1016/j.compbiomed.2025.109946

Apicella, A., Isgrò, F., & Prevete, R. (2025). Don’t push the button! Exploring data leakage risks in machine learning and transfer learning. Artificial Intelligence Review, 58(11), 339. https://doi.org/10.1007/s10462-025-11326-3

Arif Hidayat, & Sutedi, S. (2025). Klasifikasi Rambut Rontok Menggunakan Metode Naive Bayes. SATESI: Jurnal Sains Teknologi Dan Sistem Informasi, 5(2), 183–189. https://doi.org/10.54259/satesi.v5i2.5612

Aukerman, E. L., & Jafferany, M. (2023). The psychological consequences of androgenetic alopecia: A systematic review. Journal of Cosmetic Dermatology, 22(1), 89–95. https://doi.org/10.1111/jocd.14983

Bartz-Beielstein, T., & Zaefferer, M. (2023). Models. In Hyperparameter Tuning for Machine and Deep Learning with R (pp. 27–69). Springer Nature Singapore. https://doi.org/10.1007/978-981-19-5170-1_3

Boghosian, T., Mendez, H., Sayegh, M., Rabionet, A., Beer, J., & Tosti, A. (2026). The Intersection of Sleep and Hair Loss: A Systematic Review. Dermatology and Therapy, 16(2), 937–952. https://doi.org/10.1007/s13555-025-01641-6

Borda, L. J., & Wikramanayake, T. C. (2015). Seborrheic Dermatitis and Dandruff: A Comprehensive Review. Journal of Clinical and Investigative Dermatology, 3(2). https://doi.org/10.13188/2373-1044.1000019

Chaddad, A., Peng, J., Xu, J., & Bouridane, A. (2023). Survey of Explainable AI Techniques in Healthcare. Sensors, 23(2), 634. https://doi.org/10.3390/s23020634

Chien Yin, G. O., Siong-See, J. L., & Wang, E. C. E. (2021). Telogen Effluvium – a review of the science and current obstacles. Journal of Dermatological Science, 101(3), 156–163. https://doi.org/10.1016/j.jdermsci.2021.01.007

Dhanka, S., Kumar, A., Maini, S., Kumar, N., Singh, J., Khan, M., Abbas, M., & Ksibi, A. (2026). Padding interpolation, median imputation, RobustScalar, and particle swarm optimization with heterogeneous classifiers: a robust combination for effective heart disease diagnosis. Frontiers in Medicine, 12. https://doi.org/10.3389/fmed.2025.1721740

Fatani, M. I. A., Alkhalifah, A., Alruwaili, A. F. S., Alharbi, A. H. S., Alharithy, R., Khardaly, A. M., Almudaiheem, H. Y., Al-Jedai, A., & Eshmawi, M. T. Y. (2023). Diagnosis and Management of Alopecia Areata: A Saudi Expert Consensus Statement (2023). Dermatology and Therapy, 13(10), 2129–2151. https://doi.org/10.1007/s13555-023-00991-3

Gavazzoni Dias, M. F. (2015). Hair cosmetics: An overview. International Journal of Trichology, 7(1), 2. https://doi.org/10.4103/0974-7753.153450

Hairani, H., Widiyaningtyas, T., & Dwi Prasetya, D. (2024). Addressing Class Imbalance of Health Data: A Systematic Literature Review on Modified Synthetic Minority Oversampling Technique (SMOTE) Strategies. JOIV : International Journal on Informatics Visualization, 8(3), 1310. https://doi.org/10.62527/joiv.8.3.2283

Han, Y., & Joe, I. (2024). Enhancing Machine Learning Models Through PCA, SMOTE-ENN, and Stochastic Weighted Averaging. Applied Sciences, 14(21), 9772. https://doi.org/10.3390/app14219772

Hur, S., Lee, Y., Park, J., Jeon, Y. J., Cho, J. H., Cho, D., Lim, D., Hwang, W., Cha, W. C., & Yoo, J. (2025). Comparison of SHAP and clinician friendly explanations reveals effects on clinical decision behaviour. Npj Digital Medicine, 8(1), 578. https://doi.org/10.1038/s41746-025-01958-8

Ichwani, A., Kesuma, R. I., Setiawan, A., Wicaksono, I. E., & Hanifah, R. (2026). Preventing Data Leakage in Classification via Integrated Machine Learning Pipelines: Preprocessing, Feature Transformation, and Hyperparameter Tuning. Jurnal Teknik Informatika (Jutif), 7(1), 391–410. https://doi.org/10.52436/1.jutif.2026.7.1.5490

Jeong, K., Mallard, A. R., Coombe, L., & Ward, J. (2023). Artificial intelligence and prediction of cardiometabolic disease: Systematic review of model performance and potential benefits in indigenous populations. Artificial Intelligence in Medicine, 139, 102534. https://doi.org/10.1016/j.artmed.2023.102534

Kapoor, I., & Mishra, A. (2018). Automated Classification Method for Early Diagnosis of Alopecia Using Machine Learning. Procedia Computer Science, 132, 437–443. https://doi.org/10.1016/j.procs.2018.05.157

Li, M., Sun, H., Huang, Y., & Chen, H. (2024). Shapley value: from cooperative game to explainable artificial intelligence. Autonomous Intelligent Systems, 4(1), 2. https://doi.org/10.1007/s43684-023-00060-8

Liu, S., Du, H., & Feng, M. (2020). Robust Predictive Models in Clinical Data—Random Forest and Support Vector Machines. In Leveraging Data Science for Global Health (pp. 219–228). Springer International Publishing. https://doi.org/10.1007/978-3-030-47994-7_13

Lusito, S., Pugnana, A., & Guidotti, R. (2024). Solving imbalanced learning with outlier detection and features reduction. Machine Learning, 113(8), 5273–5330. https://doi.org/10.1007/s10994-023-06448-0

Ly, N., Paiewonsky, B., Fruechte, S., Goldfarb, N., Hordinsky, M. K., Bakker, C., Sadick, N., Arruda, S., & Farah, R. S. (2025). Caffeine Supplementation and Hair: A Systematic Review. Journal of Drugs in Dermatology, 24(11), 1070–1074. https://doi.org/10.36849/JDD.8902

Maloh, J., Engel, T., Natarelli, N., Nong, Y., Zufall, A., & Sivamani, R. K. (2023). Systematic Review of Psychological Interventions for Quality of Life, Mental Health, and Hair Growth in Alopecia Areata and Scarring Alopecia. Journal of Clinical Medicine, 12(3), 964. https://doi.org/10.3390/jcm12030964

Mienye, I. D., Swart, T. G., Obaido, G., Jordan, M., & Ilono, P. (2025). Deep Convolutional Neural Networks in Medical Image Analysis: A Review. Information, 16(3), 195. https://doi.org/10.3390/info16030195

Nohara, Y., Matsumoto, K., Soejima, H., & Nakashima, N. (2022). Explanation of machine learning models using shapley additive explanation and application for real data in hospital. Computer Methods and Programs in Biomedicine, 214, 106584. https://doi.org/10.1016/j.cmpb.2021.106584

Omar, E. D., Mat, H., Abd Karim, A. Z., Sanaudi, R., Ibrahim, F., Omar, M. A., Ismail, M. Z. H., Jayaraj, V., & Goh, B. L. (2024). Comparative Analysis of Logistic Regression, Gradient Boosted Trees, SVM, and Random Forest Algorithms for Prediction of Acute Kidney Injury Requiring Dialysis After Cardiac Surgery. International Journal of Nephrology and Renovascular Disease, Volume 17, 197–204. https://doi.org/10.2147/IJNRD.S461028

Parikh, A. K., Tan, I. J., Wolfe, S. M., & Cohen, B. A. (2024). Advances in Topical Therapies for Clinically Relevant and Prevalent Forms of Alopecia. Life, 14(12), 1577. https://doi.org/10.3390/life14121577

Pensa, R. G., Crombach, A., Peignier, S., & Rigotti, C. (2025). Explaining Random Forest and XGBoost with Shallow Decision Trees by Co-clustering Feature Importance. Machine Learning, 114(12), 287. https://doi.org/10.1007/s10994-025-06932-9

Saad Hussein, A., Li, T., Yohannese, C. W., & Bashir, K. (2019). A-SMOTE: A New Preprocessing Approach for Highly Imbalanced Datasets by Improving SMOTE. International Journal of Computational Intelligence Systems, 12(2), 1412. https://doi.org/10.2991/ijcis.d.191114.002

Salmi, M., Atif, D., Oliva, D., Abraham, A., & Ventura, S. (2024). Handling imbalanced medical datasets: review of a decade of research. Artificial Intelligence Review, 57(10), 273. https://doi.org/10.1007/s10462-024-10884-2

Sarhan, A. M., Ali, H. A., Yasser, S., Gobara, M., Kandil, A. A., Sherif, G., & Moustafa, E. (2025). Achieving high-accuracy skin cancer classification with deep learning optimized by ant colony algorithm. Journal of Electrical Systems and Information Technology, 12(1), 49. https://doi.org/10.1186/s43067-025-00243-8

Sasse, L., Nicolaisen-Sobesky, E., Dukart, J., Eickhoff, S. B., Götz, M., Hamdan, S., Komeyer, V., Kulkarni, A., Lahnakoski, J. M., Love, B. C., Raimondo, F., & Patil, K. R. (2025). Overview of leakage scenarios in supervised machine learning. Journal of Big Data, 12(1), 135. https://doi.org/10.1186/s40537-025-01193-8

Siami, M. I., & Azis, H. (2025). Predicting Hair Loss with Machine Learning: A Multi-Factor Analysis. International Journal of Artificial Intelligence in Medical Issues, 3(1), 60–68. https://doi.org/10.56705/ijaimi.v3i1.360

Sirish Kumar, M., Reddy, P. L. K., Dinesh Reddy, G., Kumar, A. S., & Nagendra, P. (2025). Predictive Modeling of Hair Fall using Random Forest Algorithms. Proceedings of the 4th International Conference on Information Technology, Civil Innovation, Science, and Management, ICITSM 2025, 28-29 April 2025, Tiruchengode, Tamil Nadu, India, Part II. https://doi.org/10.4108/eai.28-4-2025.2358120

Sujon, K. M., Hassan, R., Choi, K., & Samad, M. A. (2025). Accuracy, precision, recall, f1-score, or MCC? empirical evidence from advanced statistics, ML, and XAI for evaluating business predictive models. Journal of Big Data, 12(1), 268. https://doi.org/10.1186/s40537-025-01313-4

Taha, K. (2025). Machine learning in biomedical and health big data: a comprehensive survey with empirical and experimental insights. Journal of Big Data, 12(1), 61. https://doi.org/10.1186/s40537-025-01108-7

Teoh, T. T. (2023). Convolutional Neural Networks for Medical Applications. Springer Nature Singapore. https://doi.org/10.1007/978-981-19-8814-1

Venkatesh, J., Vijayalakshmi, R. A. K., Krishnan, D., & Partheeban, P. (2026). EfficientNet-based soft computing techniques for dermatological condition detection. Discover Artificial Intelligence. https://doi.org/10.1007/s44163-026-01147-w

Vinutha, H. P., Poornima, B., & Sagar, B. M. (2018). Detection of Outliers Using Interquartile Range Technique from Intrusion Dataset (pp. 511–518). https://doi.org/10.1007/978-981-10-7563-6_53

Wainer, J., & Cawley, G. (2021). Nested cross-validation when selecting classifiers is overzealous for most practical applications. Expert Systems with Applications, 182, 115222. https://doi.org/10.1016/j.eswa.2021.115222

Yoraeni, A., & Rakhmah, S. N. (2025). Penerapan Algoritma Naive Bayes untuk Prediksi Kerontokan Rambut. Jurnal Bumigora Information Technology (BITe), 7(1), 63–70. https://doi.org/10.30812/bite.v7i1.5201

Downloads

Published

2026-05-11

Issue

Section

Articles

How to Cite

An Explainable AI-Based Decision Support System for Teaching and Classifying Hair Loss Types. (2026). International Journal of Technology in Education and Science, 641-663. https://doi.org/10.46328/ijtes.8226