[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ Skip to main content
Log in

The EM algorithm with gradient function update for discrete mixtures with known (fixed) number of components

  • Published:
Statistics and Computing Aims and scope Submit manuscript

Abstract

The paper is focussing on some recent developments in nonparametric mixture distributions. It discusses nonparametric maximum likelihood estimation of the mixing distribution and will emphasize gradient type results, especially in terms of global results and global convergence of algorithms such as vertex direction or vertex exchange method. However, the NPMLE (or the algorithms constructing it) provides also an estimate of the number of components of the mixing distribution which might be not desirable for theoretical reasons or might be not allowed from the physical interpretation of the mixture model. When the number of components is fixed in advance, the before mentioned algorithms can not be used and globally convergent algorithms do not exist up to now. Instead, the EM algorithm is often used to find maximum likelihood estimates. However, in this case multiple maxima are often occuring. An example from a meta-analyis of vitamin A and childhood mortality is used to illustrate the considerable, inferential importance of identifying the correct global likelihood. To improve the behavior of the EM algorithm we suggest a combination of gradient function steps and EM steps to achieve global convergence leading to the EM algorithm with gradient function update (EMGFU). This algorithms retains the number of components to be exactly k and typically converges to the global maximum. The behavior of the algorithm is highlighted at hand of several examples.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
£29.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price includes VAT (United Kingdom)

Instant access to the full article PDF.

Similar content being viewed by others

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

References

  • Böhning D. 1982. Convergence of Simar's algorithm for finding the maximum likelihood estimate of a compound Poisson process. Annals of Statistics 10: 1006–1008.

    Google Scholar 

  • Böhning D. 2000. Computer-Assisted Analysis of Mixtures and Applications. Meta-Analysis, Disease Mapping and Others. Chapman & Hall/CRC, Boca Raton.

    Google Scholar 

  • Carlin B.P. and Louis T.A. 2000. Bayes and Empirical Bayes Methods for Data Analysis. Chapman & Hall, London.

    Google Scholar 

  • Celeux G. 2001. Different points of view for choosing the number of components in a mixture model. In: Govaert G., Janssen J., and Limnios N. (Eds.), Proceedings of the 10th International Symposium on Applied Stochastic Models and Data Analysis, June 12-15. Compiegne, pp. 21–28.

  • Dempster A.P., Laird N.M., and Rubin D.B. 1977. Maximum likelihood estimation from incomplete data via the EM algorithm (with Discussion). Journal of the Royal Statistical Society B 39: 1–38.

    Google Scholar 

  • Fawzi W.W., Chalmers T.C., Herrera M.G., and Mosteller F. 1993. Vitamin A supplementation and child mortality. A meta-analysis. Journal of the American Medical Association 269: 898–903.

    Google Scholar 

  • Hasselblad V. 1969. Estimation of finite mixtures of distributions from the exponential family. Journal of the American Statistical Association 64: 1459–1471.

    Google Scholar 

  • Laird N.M. 1978. Nonparametric maximum likelihood estimation of a mixing distribution. Journal of the American Statistical Association 73: 805–811.

    Google Scholar 

  • Leroux B.G. 1992. Consistent estimation of a mixing distribution. Annals of Statistics 20: 1350–1360.

    Google Scholar 

  • Lim, T.-O., Bakri R., Morad Z., and Hamid M.A. 2001. Bimodality in blood glucose distribution: Is it universal? Preprint of the Clinical Research Centre, c/o Department of Nephrology, Kualalumpur Hospital, Kualalumpur, Malaysia.

  • Lindsay B.G. 1983. The geometry of mixture likelihoods, Part I: A general theory. Annals of Statistics 11: 783–792.

    Google Scholar 

  • Mclachlan G. and Peel D. 2000. Finite Mixture Models. Wiley, New York.

    Google Scholar 

  • Seidel W., Mosler K., and Alker M. 2000. A cautionary note on likelihood ratio tests in mixture models. Annals of the Institute of Statistical Mathematics 52: 481–487.

    Google Scholar 

  • Simar L. 1976. Maximum likelihood estimation of a compound Poisson process. Annals of Statistics 4: 1200–1209.

    Google Scholar 

  • Thyrion P. 1960. Contribution à l'étude du bonus pour non sinsitre en assurance automobile. Astin Bulletin 1: 142–162.

    Google Scholar 

  • Titterington D.M., Smith A.F.M., and Makov U.E. 1985. Statistical Analysis of Finite Mixture Distributions. Wiley, New York.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Böhning, D. The EM algorithm with gradient function update for discrete mixtures with known (fixed) number of components. Statistics and Computing 13, 257–265 (2003). https://doi.org/10.1023/A:1024222817645

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/A:1024222817645

Navigation