The modified sine distribution and machine learning models for enhancing crude oil production prediction
DOI:
https://doi.org/10.64497/jssci.3Keywords:
Sine distribution, transformation technique, quantile function, machine learning, crude oil productionAbstract
Economic data modeling and forecasting are crucial for facilitating decision-making in the industrial and financial realms. This paper describes a dual-objective study focusing on modeling and forecasting economic data. The Modified Sine (MoS) distribution is introduced in the modeling phase, which was constructed utilizing an innovative transformation technique. This technique incorporates an additional parameter, enhancing the distribution's flexibility. The MoS distribution exhibits diverse density shapes; its hazard function is versatile, displaying decreasing and inverted bathtub patterns. These features make the MoS distribution highly suitable for analyzing economic and industrial datasets. Key statistical properties have been identified, and parameters are evaluated using maximum likelihood estimation (MLE). The MoS distribution's performance is validated by applying it to crude oil production data, where it outperforms established models in goodness-of-fit metrics. Machine learning models are employed to forecast economic trends in the prediction phase. Ridge, Support Vector Regression (SVR), Linear Regression, Lasso, and Random Forest are some of the models employed. They are assessed using measures like mean absolute error (MAE), mean absolute percentage error (MAPE), mean squared error (MSE), and root mean squared error (RMSE). Recursive Feature Elimination (RFE) is used to identify significant predictors, reducing model complexity and overfitting. Random Forest achieves the lowest error rates before and after feature selection, consistently demonstrating the best predictive performance. The study reveals that feature selection significantly enhances model accuracy, with Random Forest delivering the most precise predictions. This integrated approach highlights the MoS distribution as a reliable instrument for statistical computing.
Downloads
References
[1] C. Omekara, O. Okereke, K. Ire, and C. Okamgba, “ARIMA modeling of Nigeria crude oil production,” Journal of Energy Technologies and Policy, vol. 5, pp. 1–5, 2015.
[2] C. Okororie, K. Egwim, C. Eke, and D. Onuoha, “Buys-ballot modeling of Nigerian domestic crude oil production,” Journal of Economic and Business, vol. 4, no. 7, pp. 13–19, 2012.
[3] C. P. Obite, A. Chukwu, D. C. Bartholomew, U. I. Nwosu, and G. E. Esiaba, “Classical and machine learning modeling of crude oil production in Nigeria: Identification of an eminent model for application,” Energy Reports, vol. 7, pp. 3497–3505, 2021. DOI: https://doi.org/10.1016/j.egyr.2021.06.005
[4] A. Usman, A. I. Ishaq, M. Tasi’U, A. Suleiman, and U. A. Abdullahi, “Inverse Power Log-Toppleone distribution and its competitors for estimating the exchange rate datasets,” 2023 International Conference on Data Analytics for Business and Industry (ICDABI), pp. 639–643, 2023. DOI: https://doi.org/10.1109/ICDABI60145.2023.10629436
[5] A, A, Suleiman, H. Daud, N. S. S. Singh, M. Othman, A. I. Ishaq, and R. Sokkalingam, “A novel odd beta prime-logistic distribution: Desirable mathematical properties and applications to engineering and environmental data,” Sustainability, vol. 15, no. 13, pp. 10239, 2023. DOI: https://doi.org/10.3390/su151310239
[6] A. I. Ishaq, U. Panitanarak, A. A. Abiodun, A. A. Suleiman, and H. Daud, “The Generalized odd Maxwell-Kumaraswamy distribution: Its properties and applications,” Contemporary Mathematics, vol. 5, pp. 711-742, 2024. DOI: https://doi.org/10.37256/cm.5120242888
[7] H. Daud, A. A. Suleiman, A. I. Ishaq, N. Alsadat, M. Elgarhy, A. Usman, P. Wiratchotisatian, U. A. Ubale, and Y. Liping, “A new extension of the Gumbel distribution with biomedical data analysis,” Journal of Radiation Research and Applied Sciences, vol. 17, no. 4, pp. 101055, 2024. DOI: https://doi.org/10.1016/j.jrras.2024.101055
[8] A. A. Suleiman, H. Daud, A. I. Ishaq, M. Othman, H. M. Alshanbari, and S. N. Alaziz, “A novel extended Kumaraswamy distribution and its application to COVID19 data,” Engineering Reports, p. e12967, 2023. DOI: https://doi.org/10.1002/eng2.12967
[9] A. A. Suleiman, H. Daud, N. S. S. Singh, A. I. Ishaq, and M. Othman, “A new odd beta prime-burr X distribution with applications to petroleum rock sample data and COVID-19 mortality rate,” Data, vol. 8, no. 9, pp. 143, 2023. DOI: https://doi.org/10.3390/data8090143
[10] Ishaq, A., Abiodun, A., Suleiman, A., Usman, A., Mohammed, A. and Tasiu, M. (2023). Modelling Nigerian Inflation Rates from January 2003 to June 2023 Using Newly Developed Inverse Power Chi-Square Distribution. 2023 International Conference on Data Analytics for Business and Industry (ICDABI), 644–651. DOI: https://doi.org/10.1109/ICDABI60145.2023.10629442
[11] A. Hassan, I. Dar, and M. Lone, “A novel family of generating distributions based on trigonometric function with an application to exponential distribution,” Journal of Scientific Research, vol. 65, no. 5, pp. 172–179, 2021. DOI: https://doi.org/10.37398/JSR.2021.650519
[12] I. Alkhairy, M. Nagy, A. H. Muse, and E. Hussam, “The Arctan-X family of distributions: Properties, simulation, and applications to actuarial sciences,” Complexity, vol. 1, pp. 4689010, 2021. DOI: https://doi.org/10.1155/2021/4689010
[13] S. Benchiha, L. P. Sapkota, A. Al Mutairi, V. Kumar, R. H. Khashab, A. M. Gemeay, M. Elgarhy, and S. G. Nassr, “A new sine family of generalized distributions: Statistical inference with applications,” Math. Comput. Appl., vol. 28, no. 4, 83, 2023. DOI: https://doi.org/10.3390/mca28040083
[14] Z. Mahmood, T. M. Jawa, N. Sayed-Ahmed, E. M. Khalil, A. H. Muse, and A. H. Tolba, “An extended cosine generalized family of distributions for reliability modeling: Characteristics and applications with simulation study,” Mathematical Problems in Engineering, vol. 1, pp. 3634698, 2022. DOI: https://doi.org/10.1155/2022/3634698
[15] H. M. Alshanbari, G. S. Rao, J. T. Seong, S. Salem, and S. K. Khosa, “A new cosine-originated probability distribution with symmetrical and asymmetrical behaviors: Repetitive acceptance sampling with reliability application,” Symmetry, vol. 15, no. 12, pp. 2187, 2023. DOI: https://doi.org/10.3390/sym15122187
[16] A. Ahmad, A. A. Rather, A. M. Gemeay, M. Nagy, L. P. Sapkota, and A. Mansi, “Novel sin-G class of distributions with an illustration of Lomax distribution: Properties and data analysis,” AIP Advances, vol. 14, no. 3, pp. 5.0180263, 2024. DOI: https://doi.org/10.1063/5.0180263
17] A. Ahmad, F. M. Alghamdi, A. Ahmad, O. Albalawi, A. A. Zaagan, M. Zakarya, E. M. Almetwally, G. T. and Mekiso, “New Arctan Generator Family of Distributions with an Example of Frechet Distribution: Simulation and Analysis to Strength of Glass and Carbon Fiber Data,” Alexandria Engineering Journal, 100: 42–52, 2024. DOI: https://doi.org/10.1016/j.aej.2024.05.021
[18] Y. A. Tashkandy, A. Elshenawy, G. Mekiso Tekle, M. E. Bakr, and O. S. Balogun, “A novel distributional approach based on the trigonometric function with medical data sets,” Alexandria Engineering Journal, vol. 96, pp. 303–322, 2024. DOI: https://doi.org/10.1016/j.aej.2024.03.076
[19] D. Zwillinger, V. Moll, I. S. Gradshteyn, and I. M. Ryzhik, “7-Definite integrals of special functions. in table of integrals, series, and products,” 8th ed. Boston: Academic Press, 776–865, 2014. DOI: https://doi.org/10.1016/B978-0-12-384933-5.00007-2
[20] U. A. Abdullahi, A. A. Suleiman, A. I. Ishaq, A. Usman, and A. Suleiman, “The Maxwell-exponential distribution: Theory and application to lifetime data,” Journal of Statistical Modeling and Analytics (JOSMA), vol. 3, 2021. DOI: https://doi.org/10.22452/josma.vol3no2.4
[21] A. S. Mohammed, B. Abba, I. Abdullahi, Y. Zakari, A. I. Ishaq, “On the characterization and applications of a three-parameter improved Weibull-Weibull distribution,” Reliability: Theory & Applications, 1 (77), pp. 551-563, 2024.
[22] A. A. Abiodun, A. I. Ishaq, O. I. Adeniyi, I. V. Omekam, J. Popoola, O. M. Oladuti, and E. O. Eunice Job, “MAXWELL-GOMPERTZ distribution: Properties and applications,” Reliability: Theory & Applications, vol. 2, no. 78, pp. 495-508, 2024.
[23] T. Proietti, and D. J. Pedregal, “Seasonality in high frequency time series,” Econometrics and Statistics, vol. 27, 62-82, 2023. DOI: https://doi.org/10.1016/j.ecosta.2022.02.001
[24] Y. Wang, and Y. Li, “Mapping the ratoon rice suitability region in China using random forest and recursive feature elimination modeling,” Field Crops Research, vol. 301, pp. 109016, 2023. DOI: https://doi.org/10.1016/j.fcr.2023.109016
[25] G. James, D. Witten, T. Hastie, R. Tibshirani, and J. Taylor, “Linear regression. In An introduction to statistical learning: With applications in python,” pp. 69-134, 2023. DOI: https://doi.org/10.1007/978-3-031-38747-0_3
[26] K. A. Addey, and J. B. D. Jatoe, “Implications of crop yield distributions for multiperil crop insurance rating in Ghana: A lasso model application,” Agricultural Finance Review, vol. 84, pp. 246-265, 2024. DOI: https://doi.org/10.1108/AFR-05-2024-0078
[27] H. Yıldırım, and M. R. Özkale, “A combination of ridge and Liu regressions for extreme learning machine,” Soft Computing, vol. 27, no. 5, pp. 2493-2508, 2023. DOI: https://doi.org/10.1007/s00500-022-07745-x
[28] H. Gâlmeanu, and R. Andonie, “Weighted Incremental–Decremental Support Vector Machines for concept drift with shifting window,” Neural Networks, vol. 152, pp. 528-541, 2022. DOI: https://doi.org/10.1016/j.neunet.2022.05.018
[29] C. Sattarhoff, and T. Lux, “Forecasting the variability of stock index returns with the multifractal random walk model for realized volatilities,” International Journal of Forecasting, vol. 39, no. 4, 1678-1697, 2023. DOI: https://doi.org/10.1016/j.ijforecast.2022.08.009
[30] D. Chicco, M. J. Warrens, and G. Jurman, “The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation,” Peerj Computer Science, vol. 7, e623, 2021. DOI: https://doi.org/10.7717/peerj-cs.623
Downloads
Published
Versions
- 2025-06-29 (2)
- 2025-06-29 (1)
How to Cite
Issue
Section
License
Copyright (c) 2025 Uthumporn Panitanarak; Aliyu Ismail Ishaq; Abubakar Usman, Ibrahim Abubakar Sadiq, Department of Statistics, Ahmadu Bello University, Zaria. Nigeria

This work is licensed under a Creative Commons Attribution 4.0 International License.
- Abstract 690
- PDF 176

