Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

According to [1] on the accompanying blog post, this brings the Whisper 44.3 WER down to 18.7, although it’s unclear to me how much better this is at primarily English speech recognition. I’d love to see a full comparison of accuracy improvements as well as a proper writeup of how much more power it takes to run this in production or on mobile vs something like whisper.

[1]: https://scontent-sjc3-1.xx.fbcdn.net/v/t39.8562-6/346801894_...



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: