Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3077136.3080681acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
short-paper

Deep Multimodal Embedding Model for Fine-grained Sketch-based Image Retrieval

Published:07 August 2017Publication History

ABSTRACT

Fine-grained Sketch-based Image Retrieval (Fine-grained SBIR), which uses hand-drawn sketches to search the target object images, has been an emerging topic over the last few years. The difficulties of this task not only come from the ambiguous and abstract characteristics of sketches with less useful information, but also the cross-modal gap at both visual and semantic level. However, images on the web are always exhibited with multimodal contents. In this paper, we consider Fine-grained SBIR as a cross-modal retrieval problem and propose a deep multimodal embedding model that exploits all the beneficial multimodal information sources in sketches and images. In our experiment with large quantity of public data, we show that the proposed method outperforms the state-of-the-art methods for Fine-grained SBIR.

References

  1. A. Chalechale, G. Naghdy, and P. Premaratne. Sketch-based shape retrieval using length and curvature of 2d digital contours. In IWCIA, pages 474--487, 2004.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Y. Wang, M. Yu, and Q. Jia. Query by sketch: An asymmetric sketch-vs-image retrieval system. CISP, 3:1368--1372, 2011. Google ScholarGoogle ScholarCross RefCross Ref
  3. Y. Li, T. M. Hospedales, Y.-Z. Song, and S. Gong. Fine-grained sketch-based image retrieval by matching deformable part models. In BMVC, 2014.Google ScholarGoogle Scholar
  4. P. Xu, Q. Yin, Y. Qi, Y-Z. Song, Z. Ma and L. Wang. Instance-Level Coupled Subspace Learning for Fine-Grained Sketch-Based Image Retrieval. In ECCV, pages 19--34, 2016. Google ScholarGoogle ScholarCross RefCross Ref
  5. Q. Yu, F. Liu, Y-Z Song, T. Xiang, T. M. Hospedales, and C-C Loy. Sketch me that shoe. In CVPR, pages 799--807, 2016 Google ScholarGoogle ScholarCross RefCross Ref
  6. P. Sangkloy, N. Burnell, C. Ham, and J. Hays. The sketchy database: learning to retrieve badly drawn bunnies. ACM TOG, 35(4):119, 2016. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. N. Dalal and T. Bill. Histograms of oriented gradients for human detection. In CVPR, pages 886--893, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, and A. Rabinovich. Going deeper with convolutions. In CVPR, pages 1--9, 2015. Google ScholarGoogle ScholarCross RefCross Ref
  9. R. Kiros, Y. Zhu, R. R. Salakhutdinov, R. Zemel, R. Urtasun, A. Torralba, and S. Fidler. Skip-thought vectors. In NIPS, pages 3294--3302, 2015.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Y. Zhu, R. Kiros, R. S. Zemel, R. Salakhutdinov, R. Urtasun, A. Torralba, and S. Fidler. Aligning books and movies: Towards story-like visual explanations by watching movies and reading books. In ICCV, pages 19--27, 2015.Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. M. Eitz, R. Richter, T. Boubekeur, et al. Sketch-based shape retrieval. ACM Trans. Graph. 31(4): 31:1--31:10, 2012.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Deep Multimodal Embedding Model for Fine-grained Sketch-based Image Retrieval

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      SIGIR '17: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval
      August 2017
      1476 pages
      ISBN:9781450350228
      DOI:10.1145/3077136

      Copyright © 2017 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 7 August 2017

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • short-paper

      Acceptance Rates

      SIGIR '17 Paper Acceptance Rate78of362submissions,22%Overall Acceptance Rate792of3,983submissions,20%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader