Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
×
In this paper, we propose a novel method to mine the com- monsense knowledge shared between the video and text modalities for video-text retrieval, ...
Jun 28, 2022 · In this paper, we propose a novel method to mine the commonsense knowledge shared between the video and text modalities for video-text retrieval ...
Abstract: In this paper, we propose a novel method to mine the commonsense knowledge shared between the video and text modalities for video-text retrieval, ...
People also ask
This paper makes the first attempt to model the visual consensus by mining the visual concepts from videos and exploiting their co-occurrence patterns ...
In this paper, we propose a novel method to mine the commonsense knowledge shared between the video and text modalities for video-text retrieval, ...
A curated list of deep learning resources for video-text retrieval. - danieljf24/awesome-video-text-retrieval. ... AAAI22] Visual Consensus Modeling for Video- ...
Apr 7, 2024 · Figure 1: Existing video retrieval works focus on improving representation learning ability by learning from benchmarks that have many ...
Dec 6, 2023 · With the text data at hand, we learn and up- date the shared sparse space in a supervised manner using the proposed similarity and align- ment ...
Visual Consensus Modeling for Video-Text Retrieval · Shuqiang CaoBairui Wang ... This paper makes the first attempt to model the visual consensus by mining the ...
Oct 10, 2022 · To fill this gap, in this work, we propose a novel visual-linguistic aligning model named HiSE for VTR, which improves the cross-modal ...