Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
×
May 13, 2023 · Our MASCOT performs state-of-the-art performance on four major text-video retrieval benchmarks, including MSR-VTT, LSMDC, ActivityNet, and ...
Oct 27, 2023 · In this paper, we present MAsk for Semantics COmpleTion (MASCOT) based on semantic-based masked modeling. Specifically, after applying attention ...
This paper proposes Informed Semantics Completion to recover masked semantics information by aligning the masked content with the unmasked visual regions ...
Oct 30, 2023 · Retrieval. Conference Paper. Mask to Reconstruct: Cooperative Semantics Completion for Video-text Retrieval. October 2023. DOI:10.1145 ...
Mask to reconstruct: Cooperative Semantics Completion for Video-text Retrieval ... masks, we propose Informed Semantics Completion to recover masked semantics ...
Clip2video: Mastering video-text retrieval via image clip. H Fang, P Xiong, L ... Mask to Reconstruct: Cooperative Semantics Completion for Video-text Retrieval.
In this paper, we present Mask for Semantics Completion (MASCOT) based on semantic-based masked modeling. Specifically, after applying attention-based video ...
Mar 21, 2023 · Mask to reconstruct: Cooperative Semantics Completion for Video-text Retrieval ... Semantics Completion to recover masked semantics information ...
In this paper, we present Mask for Semantics Completion (MASCOT) based on semantic-based masked modeling. Specifically, after applying attention-based video ...
Mask to Reconstruct: Cooperative Semantics Completion for Video-text Retrieval ... Completion to recover masked semantics information by aligning the masked ...