mSLAM: Massively multilingual joint pre-training for speech and text.

AllImages Videos Books Maps News Shopping

mSLAM: Massively multilingual joint pre-training for speech and text - arXiv

Feb 3, 2022 · Our speech translation model demonstrates zero-shot text translation without seeing any text translation data, providing evidence for cross- ...

Scholarly articles for mSLAM: Massively multilingual joint pre-training for speech and text.

scholar.google.com › citations

… : Massively multilingual joint pre-training for speech …
Bapna · Cited by 92

[PDF] arXiv:2202.01374v1 [cs.CL] 3 Feb 2022

arxiv.org › pdf

Feb 3, 2022 · We present mSLAM, a multilingual Speech and. LAnguage Model that learns cross-lingual cross- modal representations of speech and text by pre ...

Speech Translation - Mohamed Anwar

anwarvic.github.io › speech-translation

mSLAM is the multilingual version of SLAM which has been pre-trained on speech data from 51 51 languages and text data from 101 101 languages. mSLAM was ...

Multilingual Speech-Text Pretraining We pre-train a ... - ResearchGate

www.researchgate.net › figure › Multilin...

We present mSLAM, a multilingual Speech and LAnguage Model that learns cross-lingual cross-modal representations of speech and text by pre-training jointly on ...

[PDF] Joint Pre-Training with Speech and Bilingual Text for Direct Speech ...

www.semanticscholar.org › paper › Joint...

A Speech2S model is proposed, which is jointly pre-trained with unpaired speech and bilingual text data for direct speech-to-speech translation tasks, ...

Alexis Conneau on X: " mSLAM: Massively multilingual joint pre ...

twitter.com › alex_conneau › status

Feb 8, 2022 · New paper: "mSLAM: Massively Multilingual Joint Pre-training for Speech and Text" mSLAM is our new 2B-param speech-text model in 100+ languages ...

People also search for

slam: a unified encoder for speech and language modeling via speech-text joint pre-training

maestro: matched speech text representations through modality matching

mSLAM GitHub

Mu2SLAM

Mu 2 SLAM: multitask, multilingual speech and language models

dl.acm.org › doi

Jul 23, 2023 · We present Mu2SLAM, a multilingual sequence-to-sequence model pre-trained jointly on unlabeled speech, unlabeled text and supervised data ...

Joint Pre-Training with Speech and Bilingual Text for Direct ... - DeepAI

deepai.org › publication › joint-pre-traini...

Oct 31, 2022 · mSLAM: Massively multilingual joint pre-training for speech and text. We present mSLAM, a multilingual Speech and LAnguage Model that learns c..

Joint Pre-Training with Speech and Bilingual Text for Direct Speech to ...

www.researchgate.net › publication › 36...

Joint Pre-Training with Speech and Bilingual Text for Direct Speech to Speech Translation ... mslam: Massively multilingual joint pre-training for speech and text.

Ankur Bapna on LinkedIn: New paper: "mSLAM

www.linkedin.com › posts › ankur-bapn...

Feb 8, 2022 · New paper: "mSLAM: Massively Multilingual Joint Pre-training for Speech and Text" mSLAM is our new 2B-param speech-text model in 100+ languages ...