Jargonic Sets New SOTA for Japanese ASR

1317 · 2025-05-07T13:59:00 1746626340

SOTA: not used in the article but probably State Of The Art

ASR: Automatic Speech Recognition, speech-to-text

lenerdenator · 2025-05-07T14:13:37 1746627217

And here I was, as a ham radio operator, excited to read something about Summits On The Air.

shuffles dejectedly back to shack

rfv6723 · 2025-05-07T13:46:29 1746625589

Why no comparition to gpt-4o-transcribe？

If you don't compare to latest model on the market, how can you claim it's SOTA?

According to OpenAI, gpt-4o-transcribe has much better performance than whisper-large-v2.

albertzeyer · 2025-05-07T14:03:52 1746626632

Are there any details on what they changed to improve over other existing models?

（评论） (comments)