/Tech2h ago

Researchers Fine-Tune Whisper for SOTA Speech Recognition in 89 Languages

2420290

#985

Original post

Leshem (Legend) Choshen 🤖🤗 @ACL @ICML@LChoshen#985inTech

What they did? Cleaned data, took a big open speech model (whisper) changed the tokenizer and fine-tuned per data.

Leshem (Legend) Choshen 🤖🤗 @ACL @ICML@LChoshen

89 languages sota speech models. There's plenty of speech data it appears, so the simplest fine-tuning plus tokenization on public data just improves everything substantially. And it's not even with any tricks or all the data... #conll #acl

4:36 PM · Jul 3, 2026 · 56 Views

Sentiment

Users express optimism about BuzzASR's speech recognition advances enabling much better models through collaborative data cleaning and larger-scale training.

Pos

100.0%

Neg

0.0%

1 comments with sentiment.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Posts from X

Most Activity

VIEWS234LIKES4RETWEETS2REPLIES1

Leshem (Legend) Choshen 🤖🤗 @ACL @ICML@LChoshen

2h23440

Leshem (Legend) Choshen 🤖🤗 @ACL @ICML@LChoshen

Call for help: there's a lot more data that needs cleaning, if we do we can train on a scale larger of models and create much much better models. Ones that are likely to be the best in tons of languages for years. Let's help them do that. Reply for suggestions or questions

2h24