Viewing a single comment thread. View all comments

gunshoes t1_j42zyap wrote

Sounds like HuBERT and other MLMs used for ASR pretraining. Look for seq2seq work in the world of TTS and ASR.

3

Avelina9X OP t1_j4vn6su wrote

Ahhhh! So it seems like this is something that's been explored in the slightly parallel domain of TTS and ASR rather than in pure text LMs, thanks for pointing me in this direction!

1

gunshoes t1_j4voru9 wrote

Trade secret for ML: your problem is always an alteration of preexisting cv/speech/NLP framework

1