Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1390156.1390230acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmlConference Proceedingsconference-collections
research-article

An asymptotic analysis of generative, discriminative, and pseudolikelihood estimators

Published:05 July 2008Publication History

ABSTRACT

Statistical and computational concerns have motivated parameter estimators based on various forms of likelihood, e.g., joint, conditional, and pseudolikelihood. In this paper, we present a unified framework for studying these estimators, which allows us to compare their relative (statistical) efficiencies. Our asymptotic analysis suggests that modeling more of the data tends to reduce variance, but at the cost of being more sensitive to model misspecification. We present experiments validating our analysis.

References

  1. Besag, J. (1975). The analysis of non-lattice data. The Statistician, 24, 179--195.Google ScholarGoogle ScholarCross RefCross Ref
  2. Bouchard, G., & Triggs, B. (2004). The trade-off between generative and discriminative classifiers. International Conference on Computational Statistics (pp. 721--728).Google ScholarGoogle Scholar
  3. Lafferty, J., McCallum, A., & Pereira, F. (2001). Conditional random fields: Probabilistic models for segmenting and labeling data. International Conference on Machine Learning (ICML). Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Lasserre, J. A., Bishop, C. M., & Minka, T. P. (2006). Principled hybrids of generative and discriminative models. Computer Vision and Pattern Recognition (CVPR) (pp. 87--94). Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Liang, P., Klein, D., & Jordan, M. I. (2008). Agreement-based learning. Advances in Neural Information Processing Systems (NIPS).Google ScholarGoogle Scholar
  6. Lindsay, B. (1988). Composite likelihood methods. Contemporary Mathematics, 80, 221--239.Google ScholarGoogle ScholarCross RefCross Ref
  7. McCallum, A., Pal, C., Druck, G., & Wang, X. (2006). Multi-conditional learning: Generative/discriminative training for clustering and classification. Association for the Advancement of Artificial Intelligence (AAAI). Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Ng, A. Y., & Jordan, M. I. (2002). On discriminative vs. generative classifiers: A comparison of logistic regression and naive Bayes. Advances in Neural Information Processing Systems (NIPS).Google ScholarGoogle Scholar
  9. Sutton, C., & McCallum, A. (2005). Piecewise training of undirected models. Uncertainty in Artificial Intelligence (UAI).Google ScholarGoogle Scholar
  10. van der Vaart, A. W. (1998). Asymptotic Statistics. Cambridge University Press.Google ScholarGoogle Scholar
  11. Varin, C. (2008). On composite marginal likelihoods. Advances in Statistical Analysis, 92, 1--28.Google ScholarGoogle ScholarCross RefCross Ref
  12. Wainwright, M. (2006). Estimating the "wrong" graphical model: Benefits in the computation-limited setting. Journal of Machine Learning Research, 7, 1829--1859. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Wainwright, M., Jaakkola, T., & Willsky, A. (2003). Treereweighted belief propagation algorithms and approximate ML estimation by pseudo-moment matching. Artificial Intelligence and Statistics (AISTATS).Google ScholarGoogle Scholar
  14. Wainwright, M., & Jordan, M. I. (2003). Graphical models, exponential families, and variational inference (Technical Report). Department of Statistics, University of California at Berkeley.Google ScholarGoogle Scholar

Index Terms

  1. An asymptotic analysis of generative, discriminative, and pseudolikelihood estimators

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Other conferences
            ICML '08: Proceedings of the 25th international conference on Machine learning
            July 2008
            1310 pages
            ISBN:9781605582054
            DOI:10.1145/1390156

            Copyright © 2008 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 5 July 2008

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • research-article

            Acceptance Rates

            Overall Acceptance Rate140of548submissions,26%

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader