A Machine Learning Framework for Anticipating Ferritin Deficiency Utilizing Clinical Biomarkers and Gradient Boosting Models

Authors

  • Zahraa Kadhim AlSendi University Of Kerbala / College of Computer Science and Information Technology

DOI:

https://doi.org/10.29304/jqcsm.2026.18.22698

Keywords:

Machine Learning, XGBoost, LightGBM, CatBoost

Abstract

Ferritin deficiency is a public health problem that causes. It means people are iron deficient and that can have serious health consequences. We need to catch people at risk early and make sure that these problems do not happen.

This study takes these insights and uses a machine learning framework to predict whether an individual will have ferritin levels. We use data including who people are, what their blood contains and their diets. We extracted this information from NHANES data.

We ran three different machine learning systems. XGBoost, LightGBM and CatBoost. A sample of 1318 people were tested. The main problem was that there were no subjects with ferritin deficiency in the group. We used person measures such as area under the curve and accuracy to take a look on how the systems worked. CatBoost did the best of the bunch. It was good at detecting whom had deficiency and whom not. The area under curve of CatBoost system was 0.946 which’s very good. It also scored with a recall of 64.77% and an F1-score of 0.60 So that it was not bad at identifying deficiency. CatBoost system reduced the number of negatives by 38% compared to basic system. LightGBM system did well too. The Catboost system performed better than the others in identifying individuals with ferritin deficiency.

The results indicate that machine learning systems can help us identification of the people who are at risk of health problems. To make sense of results, we also need to view them in the context of something that some health problemsre not common

Downloads

Download data is not yet available.

References

Efros, O., Soffer, S., Mudrik, A., Robinson, R., Kenet, G., Nadkarni, G.N., and Klang, E.: ‘Predictive machine-learning model for screening iron deficiency without anaemia: a retrospective cohort study’, BMJ Open, 2025, 15, (8), pp. e097016

Kurstjens, S., de Bel, T., van der Horst, A., Kusters, R., Krabbe, J., and van Balveren, J.: ‘Automated prediction of low ferritin concentrations using a machine learning algorithm’, Clin Chem Lab Med, 2022, 60, (12), pp. 1921-1928

Rojanaphan, P.: ‘Automated Nutrient Deficiency Detection and Recommendation Systems Using Deep Learning in Nutrition Science’, International Journal of Scientific Research and Management (IJSRM), 2024, 12, (11), pp. 1746-1763

Das, A., Bai, C.H., Chang, J.S., Huang, Y.L., Wang, F.F., Hsu, C.Y., Chen, Y.C., and Chao, J.C.: ‘A ferritin-related dietary pattern is positively associated with iron status but negatively associated with vitamin D status in pregnant women: a cross-sectional study’, Eur J Nutr, 2024, 64, (1), pp. 30

Revesai, Z., and Kogeda, O.P.: ‘Knowledge-Guided Neural Networks for Transparent Micronutrient Deficiency Detection’, in Editor (Ed.)^(Eds.): ‘Book Knowledge-Guided Neural Networks for Transparent Micronutrient Deficiency Detection’ (2025, edn.), pp. 1-6

A. Ali Heydari, Naghmeh Rezaei, Javier L. Prieto†, Shwetak N. Patel, Ahmed A. Metwally: ‘Lifestyle-Informed Personalized Blood Biomarker Prediction via Novel Representation Learning ’, 2024, (arXiv:2407.07277v1 [cs.LG] 9 Jul 2024)

Sawicki, C., Haslam, D., and Bhupathiraju, S.: ‘Utilising the precision nutrition toolkit in the path towards precision medicine’, Proc Nutr Soc, 2023, 82, (3), pp. 359-369

Obermeyer, Z., and Emanuel, E.J.: ‘Predicting the Future — Big Data, Machine Learning, and Clinical Medicine’, N Engl J Med, 2016, 375, (13), pp. 1216–1219

] Rajkomar, A., Dean, J., and Kohane, I.: ‘Machine Learning in Medicine’, N Engl J Med, 2019, 380, (14), pp. 1347–135810

Guyon, I., and Elisseeff, A.: ‘An Introduction to Variable and Feature Selection’, J Mach Learn Res, 2003, 3, pp. 1157–1182

X. Yao, T. Li, and J. J. Liu, “Application of Machine Learning Model in Fraud Identification: A Comparative Study of CatBoost, XGBoost and LightGBM,” Mar. 2025, doi: 10.20944/preprints202503.1199.v1.

M. R. Johnson, E. S. A. Sanders, D. Miller, S. Ramírez, and D. R. Adams, “Comparative Study of Efficient Machine Learning Models for Real-Time Fraud Detection: CatBoost, XGBoost and LightGBM,” Sep. 2025, doi: 10.21203/rs.3.rs-7539803/v1.

Bentéjac, C., Csörgő, A., & Martínez-Muñoz, G. (2021). A comparative analysis of gradient boosting algorithms. Artificial Intelligence Review, 54(3), 1937–1967. https://doi.org/10.1007/S10462-020-09896-5

Д. С. Пономарев, “Сопоставление библиотек для создания моделей машинного обучения на основе методов градиентного бустинга,” Современные инновации, системы и технологии, vol. 5, no. 2, pp. 3001–3006, Apr. 2025, doi: 10.47813/2782-2818-2025-5-2-3001-3006.

R. D. Cançado, L. A. C. Leite, and M. Múñoz, “Defining Global Thresholds for Serum Ferritin: A Challenging Mission in Establishing the Iron Deficiency Diagnosis in This Era of Striving for Health Equity,” Diagnostics, vol. 15, no. 3, p. 289, Jan. 2025, doi: 10.3390/diagnostics15030289.

Downloads

Published

2026-06-27

How to Cite

AlSendi, Z. K. (2026). A Machine Learning Framework for Anticipating Ferritin Deficiency Utilizing Clinical Biomarkers and Gradient Boosting Models. Journal of Al-Qadisiyah for Computer Science and Mathematics, 18(2), Comp 220–230. https://doi.org/10.29304/jqcsm.2026.18.22698

Issue

Section

Computer Articles