Mask to Reconstruct: Cooperative Semantics Completion for Video-text Retrieval.

AllImages Videos Books Maps News Shopping

Mask to reconstruct: Cooperative Semantics Completion for Video ...

May 13, 2023 · Our MASCOT performs state-of-the-art performance on four major text-video retrieval benchmarks, including MSR-VTT, LSMDC, ActivityNet, and ...

Cooperative Semantics Completion for Video-text Retrieval

dl.acm.org › doi

Oct 27, 2023 · In this paper, we present MAsk for Semantics COmpleTion (MASCOT) based on semantic-based masked modeling. Specifically, after applying attention ...

Mask to Reconstruct: Cooperative Semantics Completion for Video ...

www.semanticscholar.org › paper › Mas...

This paper proposes Informed Semantics Completion to recover masked semantics information by aligning the masked content with the unmasked visual regions ...

Cooperative Semantics Completion for Video-text Retrieval

www.researchgate.net › ... › Retrieval

Oct 30, 2023 · Retrieval. Conference Paper. Mask to Reconstruct: Cooperative Semantics Completion for Video-text Retrieval. October 2023. DOI:10.1145 ...

Video-Text Retrieval | Papers With Code

paperswithcode.com › task › codeless

Mask to reconstruct: Cooperative Semantics Completion for Video-text Retrieval ... masks, we propose Informed Semantics Completion to recover masked semantics ...

‪Han Fang‬ - ‪Google 学术搜索‬

scholar.google.com.hk › citations

Clip2video: Mastering video-text retrieval via image clip. H Fang, P Xiong, L ... Mask to Reconstruct: Cooperative Semantics Completion for Video-text Retrieval.

similar - arxiv-sanity

arxiv-sanity-lite.com › ...

In this paper, we present Mask for Semantics Completion (MASCOT) based on semantic-based masked modeling. Specifically, after applying attention-based video ...

Zhifei Yang - Papers With Code

paperswithcode.com › author › zhifei-yang

Mar 21, 2023 · Mask to reconstruct: Cooperative Semantics Completion for Video-text Retrieval ... Semantics Completion to recover masked semantics information ...

Chao Ban - CatalyzeX

www.catalyzex.com › author

In this paper, we present Mask for Semantics Completion (MASCOT) based on semantic-based masked modeling. Specifically, after applying attention-based video ...

HunYuan_tvr for Text-Video Retrivial | Semantic Scholar

www.semanticscholar.org › paper › Hun...

Mask to Reconstruct: Cooperative Semantics Completion for Video-text Retrieval ... Completion to recover masked semantics information by aligning the masked ...