Statistical and machine learning analysis of Potato damage: modeling count data with GLM and predictive algorithms
DOI:
https://doi.org/10.64497/jssci.27Keywords:
Predictive algorithms,, Overdispersion, Potato damage, Model comparison, Zero-inflated model(ZIM)Abstract
This study evaluates statistical and machine learning approaches for modeling potato damage count data, comparing Poisson regression, Negative Binomial regression, Random Forest, and XGBoost. Initial analysis revealed significant overdispersion (dispersion parameter = 4.95) in the Poisson model, prompting the use of Negative Binomial regression, which demonstrated superior fit (AIC reduction from 6093 to 4326) despite higher prediction errors (RMSE = 8.28). Machine learning models outperformed traditional approaches, with Random Forest achieving the lowest prediction errors (RMSE = 3.13, MAE = 1.97), suggesting better capture of complex data patterns. Key factors influencing damage included genotype (G3 increasing damage by 30%, G5 reducing it by 27%), energy type (E2 increasing damage by 46%), and rodent exposure (R5-R8 showing protective effects). While Negative Binomial regression provided more reliable statistical inference for count data, machine learning offered superior predictive accuracy. These findings highlight the importance of selecting modeling approaches based on study objectives—traditional methods for understanding factor effects and machine learning for optimal prediction. The results provide actionable insights for potato storage and handling practices while demonstrating the value of multiple analytical approaches in agricultural research.
Downloads
References
[1] Food and Agriculture Organization (FAO). . (2022). The state of food and agriculture 2022: Leveraging automation in agriculture for transformative change.. . Rome: FAO, https://doi.org/10.4060/cb9479en. DOI: https://doi.org/10.4060/cb9479en
[2] Al-Mallahi, A., Kataoka, T., & Okamoto, H. . (2010). Detection of potato tubers using an ultraviolet imaging technique. ,. Computers and Electronics in Agriculture, 73(1), 59-67. https://doi.org/10.1016/j.compag.2010.04.004 DOI: https://doi.org/10.1016/j.compag.2010.04.004
[3] Bollen, A. F., Dela Rue, B. T., & Coat, F. W. . (2001). A comparative study of the mechanical behaviour of potato tubers under compression, impact and shear loading. Journal of Agricultural Engineering Research, 78(3), 291-297. https://doi.org/10.1006/jaer.2000.0643 DOI: https://doi.org/10.1006/jaer.2000.0642
[4] Peters, R. (1996). Damage and damage prevention. Potato harvest and storage, 16-18. Potato harvest and storage, 16-18.
[5] Cameron, A. C., & Trivedi, P. K. (2013). Regression analysis of count data (2nd ed.). . . Cambridge University Press, https://doi.org/10.1017/CBO9781139013567. DOI: https://doi.org/10.1017/CBO9781139013567
[6] Hilbe, J. M. (2011). Negative binomial regression. . Cambridge University Press. https://doi.org/10.1017/CBO9780511973420 DOI: https://doi.org/10.1017/CBO9780511973420
[7] Payne, R. W., Harding, S. A., Murray, D. A., Soutar, D. M., Baird, D. B., & Welham, S. J. . (2022). The guide to GenStat release 21, part 2: . Statistics. VSN International.
[8] Biau, G., & Scornet, E. . (2016). A random forest guided tour. Test, . Test, 25(2), 197-227. https://doi.org/10.1007/s11749-016-0481-7 DOI: https://doi.org/10.1007/s11749-016-0481-7
[9] Shahhosseini, M., Hu, G., & Archontoulis, S. V. (2023). Forecasting corn yield with machine learning ensembles. . Frontiers in Plant Science,, 14, 1120185. https://doi.org/10.3389/fpls.2023.1120185.
[10] Yarkoni, T., & Westfall, J. . (2017). Choosing prediction over explanation in psychology: Lessons from machine learning. Perspectives on Psychological Science, 12(6), 1100-1122. https://doi.org/10.1177/1745691617693393 DOI: https://doi.org/10.1177/1745691617693393
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 O.P. Adebayo, I. Ahmed, M.I. Garba, K.T. Oyeleke

This work is licensed under a Creative Commons Attribution 4.0 International License.
- Abstract 162
- PDF 118

