In this paper, we propose a novel method to mine the com- monsense knowledge shared between the video and text modalities for video-text retrieval, ...
Jun 28, 2022 · In this paper, we propose a novel method to mine the commonsense knowledge shared between the video and text modalities for video-text retrieval ...
Abstract: In this paper, we propose a novel method to mine the commonsense knowledge shared between the video and text modalities for video-text retrieval, ...
People also ask
What is video text retrieval?
What is consensus modeling?
This paper makes the first attempt to model the visual consensus by mining the visual concepts from videos and exploiting their co-occurrence patterns ...
In this paper, we propose a novel method to mine the commonsense knowledge shared between the video and text modalities for video-text retrieval, ...
A curated list of deep learning resources for video-text retrieval. - danieljf24/awesome-video-text-retrieval. ... AAAI22] Visual Consensus Modeling for Video- ...
Apr 7, 2024 · Figure 1: Existing video retrieval works focus on improving representation learning ability by learning from benchmarks that have many ...
Dec 6, 2023 · With the text data at hand, we learn and up- date the shared sparse space in a supervised manner using the proposed similarity and align- ment ...
Visual Consensus Modeling for Video-Text Retrieval · Shuqiang CaoBairui Wang ... This paper makes the first attempt to model the visual consensus by mining the ...
Oct 10, 2022 · To fill this gap, in this work, we propose a novel visual-linguistic aligning model named HiSE for VTR, which improves the cross-modal ...