Submitted by jplhughes t3_11prxd9 in MachineLearning
[removed]
Submitted by jplhughes t3_11prxd9 in MachineLearning
[removed]
Excellent demo on your page, I just used it on a YT video featuring a non-native English speaker. There was only a slight error in punctuation due to an ambiguously long pause in the speech.
Is this a purely commercial product or will there be an open source release?
what is the difference between $1.25/hr for Standard, $1.90/hr for Enhanced
Seems like there's a very generous free tier, then super cheap after that.
Seems super cheap to me tbf - no problem with paying for stuff like this.
Research pub or Gtfo
"Hey, we made this commercial tool that is better than open source!"
This is incredible
Release the model. It wants to be free.
Pretty sure commercial product only. Speechmatics has never opensourced any of their models.
Any ways to get the encoded speech features?
Can confirm it is better than whisper, doesn't randomly go off the rails either but I don't wanna have to pay 😅
are there wer for other languages? Like in the github page for whisper? I want to compare the performance in other languages
Why is this tagged [R]. This is a commercial project at best. Where's the paper, where's the code? Can we use it today on our PC like whisper? This really isn't 'research'.
Does it support Ukrainian and Russian?
[removed]
Wav2vec2 is still sota as long as this isn’t open source it’s kinda useless lmao
>25% improvement over Whisper
>Not open source
>doubt.jpeg
Typical, basing your research on open source projects and then make a commercial product on top of other people's work. Great achievement.
Yes, that's probably a cherry picking marketing only.
So is this post kind of a hidden advertisement or what?
my guess is model size
I tested it using Japanese and it seems like it misses punctuation for the most part. But, overall, seems to be doing a good job getting the words.
On which metric are you basing this on? I'm not deep in ASR but in the Whisper paper it is compared to word2vec 2.0 and whisper is better in most categories.
It’s fine, open source SOTA will make them irrelevant sooner rather than later
Removed after LOTS of reports. See rules #3 and #8 in the sidebar.
rshah4 t1_jbzvzsl wrote
Is it open source?