pvp239
pvp239 OP t1_j2z3obd wrote
Reply to comment by Franck_Dernoncourt in [P] 🗣️ Speechbox - A new library to *unnormalize* your speech. by pvp239
Haven't done a in-depth test yet, do you have some links to other existing codebases? :-)
​
Here are some tests: https://huggingface.co/datasets?other=speechbox_punc
pvp239 OP t1_j2xoukt wrote
Reply to comment by sloganking in [P] 🗣️ Speechbox - A new library to *unnormalize* your speech. by pvp239
The way it's implemented, Whisper cannot hallucinate because it can only predict letters of the original normalized transcript or punctuation, so the algorithm in speechbox guarantees that Whisper cannot hallucinate (you can think of it as a very restricted beam search)
pvp239 OP t1_j2z3s0h wrote
Reply to comment by Finslayer in [P] 🗣️ Speechbox - A new library to *unnormalize* your speech. by pvp239
I have some examples / benchmarks here: https://huggingface.co/datasets?other=speechbox_punc