Submitted by Avelina9X t3_109yuvi in MachineLearning
gunshoes t1_j42zyap wrote
Sounds like HuBERT and other MLMs used for ASR pretraining. Look for seq2seq work in the world of TTS and ASR.
Avelina9X OP t1_j4vn6su wrote
Ahhhh! So it seems like this is something that's been explored in the slightly parallel domain of TTS and ASR rather than in pure text LMs, thanks for pointing me in this direction!
gunshoes t1_j4voru9 wrote
Trade secret for ML: your problem is always an alteration of preexisting cv/speech/NLP framework
Viewing a single comment thread. View all comments