The way it's implemented, Whisper cannot hallucinate because it can only predict letters of the original normalized transcript or punctuation, so the algorithm in speechbox guarantees that Whisper cannot hallucinate (you can think of it as a very restricted beam search)
pvp239 OP t1_j2z3s0h wrote
Reply to comment by Finslayer in [P] 🗣️ Speechbox - A new library to *unnormalize* your speech. by pvp239
I have some examples / benchmarks here: https://huggingface.co/datasets?other=speechbox_punc