Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1873951.1874118acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
short-paper

Explicit and implicit concept-based video retrieval with bipartite graph propagation model

Authors Info & Claims
Published:25 October 2010Publication History

ABSTRACT

The major scientific problem for content-based video retrieval is the semantic gap. Generally speaking, there are two appropriate ways to bridge the semantic gap: the first one is from human perspective (top-down) and the other one is from computer perspective (bottom-up). The top-down method defines a concept lexicon from human perspective, trains the detector for each concept based on supervised learning, and then indexes the corpus with concept detectors. Since each concept has an explicit semantic meaning, we call this concept as an explicit concept. The bottom-up approach directly discovers the underlying latent topics from video corpus by machine perspective using an unsupervised learning. The video corpus is indexed subsequently by these latent topics. As opposite to explicit concepts, we name latent topics as implicit concepts. Given the explicit concept set is pre-defined and independent of the corpus, it is impossible to completely describe corpus and users' queries. On the other hand, the implicit concepts are dynamic and dependent on the corpus, which is able to fully describe corpus and users' queries. Therefore, combining explicit and implicit concepts could be a promising way to bridge the semantic gap effectively. In this paper, a Bipartite Graph Propagation Model (BGPM) is applied to automatically balance influences from explicit and implicit concepts. Concept nodes with strong connections to queries are reinforced no matter explicit or implicit. Demonstrated by the experiments on TREVID 2008 video dataset, BGPM successfully fuses explicit and implicit concepts to achieve a significant improvement on 48 search tasks.

References

  1. C. G.M. Snoek and M. Worring. Concept-Based Video Retrieval. Foundations and Trends in Information Retrieval: 2(4), 215--322, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. A. G. Hauptmann, et al. Video Retrieval Based on Semantic Concepts. Proceedings of the IEEE : 96(4), 602--622, 2008.Google ScholarGoogle ScholarCross RefCross Ref
  3. A. G. Hauptmann, R. Yan, W. H. Lin, and H. Wactlar. Can high-level concepts fill the semantic gap in video retrieval? IEEE Transactions on Multimedia: 9(5), 958--966, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Y.G. Jiang, C.W. Ngo, and S.F. Chang. Semantic Context Transfer across Heterogeneous Sources for Domain Adaptive.Google ScholarGoogle Scholar
  5. X.G. Rui, M.J. Li, Z.W. Li, W.Y. Ma, Bipartite graph reinforcement model for web image annotation, In. Proc. of ACM Multimedia, 585--594, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. D. Blei, A. Ng, and M. Jordan. Latent dirichlet allocation. Journal of Machine Learning Research, 3, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. R. Zhao, et al. Narrowing the semantic gap-improved text-based web document retrieval using visual features. IEEE Transactions on Multimedia: 4(2), 189--200, 2002 Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. J. Cao, Y.D. Zhang, B.L. Feng, X.F Hua, L. Bao, X. Zhang and J.T. Li. MCG-ICT-CAS TRECVID2008 Search Task Report. In. Proc. of TRECVID Workshop, 2008.Google ScholarGoogle Scholar
  9. R. Yan, A. G. Hauptmann. Probabilistic Latent Query Analysis for Combining Multiple Retrieval Sources. In Proc. of ACM SIGIR, 324--331, 2006 Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Y.G. Jiang, A. Yanagawa, S.F. Chang, and C.W. Ngo. CU-VIREO374: Fusing Columbia374 and VIREO374 for Large Scale Semantic Concept Detection, Columbia University ADVENT Technical Report #223-2008-1, Aug. 2008.Google ScholarGoogle Scholar

Index Terms

  1. Explicit and implicit concept-based video retrieval with bipartite graph propagation model

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      MM '10: Proceedings of the 18th ACM international conference on Multimedia
      October 2010
      1836 pages
      ISBN:9781605589336
      DOI:10.1145/1873951

      Copyright © 2010 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 25 October 2010

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • short-paper

      Acceptance Rates

      Overall Acceptance Rate995of4,171submissions,24%

      Upcoming Conference

      MM '24
      MM '24: The 32nd ACM International Conference on Multimedia
      October 28 - November 1, 2024
      Melbourne , VIC , Australia

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader