Deriving the Fisher information matrix for a multinomial distribution with non-proportional odds in a baseline category logit model

Authors

  • Ibrahim Yusuf Ibrahim Department of Mathematical Sciences, Bayero University, Kano, Nigeria https://orcid.org/0009-0006-8460-8524
  • Abubakar Yahaya Department of Statistics, Ahmadu Bello University, Zaria, Nigeria
  • Tasi’u Musa Department of Statistics, Ahmadu Bello University, Zaria, Nigeria
  • Yahaya Zakari Department of Statistics, Ahmadu Bello University, Zaria, Nigeria

DOI:

https://doi.org/10.64497/jssci.103

Keywords:

Fisher Information Matrix, Baseline-Category, Score Function, Maximum Likelihood Function, Non-Proportional Odds-Assumptions, Desing Point and Design space

Abstract

This paper derives the Fisher Information Matrix (FIM) using Maximum Likelihood Estimation (MLE) for a multinomial distribution with four categories under a baseline category logit model incorporating non-proportional odds assumptions. The baseline category logit model is a type of generalized linear model used for categorical response variables, where one category is treated as a baseline, and the log-odds of the other categories relative to the baseline are modelled as linear functions of the covariates. In this study, we consider a multinomial distribution with four outcome categories, and assume the odds are not proportional across the covariates, allowing the relationship between covariates and outcomes to vary by category. We derive the log-likelihood function for this model, then compute the first and second derivatives with respect to the model parameters to obtain the score function and the observed information matrix. The Fisher Information Matrix is then derived as the expected value of the observed information matrix. This derivation is essential for understanding the precision of parameter estimates obtained via MLE, as the FIM provides a measure of the information available in the data about the parameters. Additionally, the FIM is crucial for constructing confidence intervals and hypothesis tests for the parameters. Our results have broad applications in various fields such as epidemiology, social sciences, and marketing, where multinomial response models with complex covariate structures are commonly used. The theoretical development is complemented by a practical example demonstrating the implementation of these methods using simulated.

Downloads

Download data is not yet available.

References

[1] D. W. Hosmer, S. Lemeshow, and R. X. Sturdivant, Applied Logistic Regression, 3rd ed. Hoboken, NJ: Wiley, 2013. doi: 10.1002/9781118548387.

[2] A. Agresti, Analysis of Ordinal Categorical Data, 2nd ed. Hoboken, NJ: Wiley, 2010. doi: 10.1002/9780470594001.

[3] Y. Pawitan, In All Likelihood: Statistical Modelling and Inference Using Likelihood. Oxford: Oxford Univ. Press, 2001.

[4] J. A. Anderson, “Regression and ordered categorical variables,” J. R. Stat. Soc. Ser. B (Methodological), vol. 46, no. 1, pp. 1–30, 1984.

[5] A. Agresti, Categorical Data Analysis, 2nd ed. Hoboken, NJ: Wiley, 2002. doi: 10.1002/0471249688.

[6] R. A. Fisher, “On the mathematical foundations of theoretical statistics,” Philosophical Transactions of the Royal Society of London. Series A, Containing Papers of a Mathematical or Physical Character, vol. 222, pp. 309–368, 1922. doi: 10.1098/rsta.1922.0009.

[7] H. Cramér, Mathematical Methods of Statistics. Princeton, NJ: Princeton Univ. Press, 1946.

[8] B. Efron and D. V. Hinkley, “Assessing the accuracy of the maximum likelihood estimator: Observed versus expected Fisher information,” Biometrika, vol. 65, no. 3, pp. 457–487, 1978. doi: 10.1093/biomet/65.3.457.

[9] X. Bu, D. Majumdar, and J. Yang, “D-optimal designs for multinomial logistic models,” Annals of Statistics, vol. 48, no. 2, pp. 983–1001, 2020. doi: 10.1214/19-AOS1830.

[10] B. Peterson and F. E. Harrell, “Partial proportional odds models for ordinal response variables,” Appl. Stat., vol. 39, no. 2, pp. 205–217, 1990. doi: 10.2307/2347760.

[11] D. Hedeker, “A mixed-effects multinomial logistic regression model,” Statistics in Medicine, vol. 22, no. 9, pp. 1433–1446, 2003. doi: 10.1002/sim.1522.

[12] A. C. Atkinson, C. G. B. Demetrio, and S. Zocchi, “Optimum dose levels when males and females differ in response,” J. Appl. Stat., vol. 44, no. 2, pp. 213–226, 1995. doi: 10.2307/2986197.

[13] H. Dette and K. Schorning, “Complete classes of designs for nonlinear regression models and principal representations of moment spaces,” Annals of Statistics, vol. 41, no. 3, pp. 1260–1267, 2013. doi: 10.1214/12-AOS1080.

[14] R. H. Myers, D. C. Montgomery, and C. G. Vining, Generalized Linear Models with Applications in Engineering and Science, 2nd ed. Hoboken, NJ: Wiley, 2002.

[1] D. W. Hosmer, S. Lemeshow, and R. X. Sturdivant, Applied Logistic Regression, 3rd ed. Hoboken, NJ: Wiley, 2013. doi: 10.1002/9781118548387. DOI: https://doi.org/10.1002/9781118548387

[2] A. Agresti, Analysis of Ordinal Categorical Data, 2nd ed. Hoboken, NJ: Wiley, 2010. doi: 10.1002/9780470594001. DOI: https://doi.org/10.1002/9780470594001

[3] Y. Pawitan, In All Likelihood: Statistical Modelling and Inference Using Likelihood. Oxford: Oxford Univ. Press, 2001. DOI: https://doi.org/10.1093/oso/9780198507659.001.0001

[4] J. A. Anderson, “Regression and ordered categorical variables,” J. R. Stat. Soc. Ser. B (Methodological), vol. 46, no. 1, pp. 1–30, 1984. DOI: https://doi.org/10.1111/j.2517-6161.1984.tb01270.x

[5] A. Agresti, Categorical Data Analysis, 2nd ed. Hoboken, NJ: Wiley, 2002. doi: 10.1002/0471249688. DOI: https://doi.org/10.1002/0471249688

[6] R. A. Fisher, “On the mathematical foundations of theoretical statistics,” Philosophical Transactions of the Royal Society of London. Series A, Containing Papers of a Mathematical or Physical Character, vol. 222, pp. 309–368, 1922. doi: 10.1098/rsta.1922.0009. DOI: https://doi.org/10.1098/rsta.1922.0009

[7] H. Cramér, Mathematical Methods of Statistics. Princeton, NJ: Princeton Univ. Press, 1946. DOI: https://doi.org/10.1515/9781400883868

[8] B. Efron and D. V. Hinkley, “Assessing the accuracy of the maximum likelihood estimator: Observed versus expected Fisher information,” Biometrika, vol. 65, no. 3, pp. 457–487, 1978. doi: 10.1093/biomet/65.3.457. DOI: https://doi.org/10.2307/2335893

[9] X. Bu, D. Majumdar, and J. Yang, “D-optimal designs for multinomial logistic models,” Annals of Statistics, vol. 48, no. 2, pp. 983–1001, 2020. doi: 10.1214/19-AOS1830. DOI: https://doi.org/10.1214/19-AOS1834

[10] B. Peterson and F. E. Harrell, “Partial proportional odds models for ordinal response variables,” Appl. Stat., vol. 39, no. 2, pp. 205–217, 1990. doi: 10.2307/2347760. DOI: https://doi.org/10.2307/2347760

[11] D. Hedeker, “A mixed-effects multinomial logistic regression model,” Statistics in Medicine, vol. 22, no. 9, pp. 1433–1446, 2003. doi: 10.1002/sim.1522. DOI: https://doi.org/10.1002/sim.1522

[12] A. C. Atkinson, C. G. B. Demetrio, and S. Zocchi, “Optimum dose levels when males and females differ in response,” J. Appl. Stat., vol. 44, no. 2, pp. 213–226, 1995. doi: 10.2307/2986197. DOI: https://doi.org/10.2307/2986346

[13] H. Dette and K. Schorning, “Complete classes of designs for nonlinear regression models and principal representations of moment spaces,” Annals of Statistics, vol. 41, no. 3, pp. 1260–1267, 2013. doi: 10.1214/12-AOS1080. DOI: https://doi.org/10.1214/13-AOS1108

[14] R. H. Myers, D. C. Montgomery, and C. G. Vining, Generalized Linear Models with Applications in Engineering and Science, 2nd ed. Hoboken, NJ: Wiley, 2002.

[15] S. P. Singh, M. Siuli, and R. Harsh, “Min-max crossover designs for two treatments in binary and Poisson crossover trials,” Statistical Papers, vol. 62, pp. 53–74, 2021. doi: 10.1007/s00362-018-01141-0. DOI: https://doi.org/10.1007/s11222-021-10029-3

Downloads

Published

2025-09-13

How to Cite

Ibrahim, I. Y., Yahaya, A., Musa, T., & Zakari, Y. (2025). Deriving the Fisher information matrix for a multinomial distribution with non-proportional odds in a baseline category logit model. Journal of Statistical Sciences and Computational Intelligence, 1(3), 215–226. https://doi.org/10.64497/jssci.103
Views
  • Abstract 553
  • PDF 190

Similar Articles

1 2 3 4 > >> 

You may also start an advanced similarity search for this article.