Revolutionizing Multilingual ASR: Discover the First Open-Source Diffusion ASR Model

Interfaze has released diffusion-gemma-asr-small, an open-source diffusion ASR model that can transcribe six languages with one small adapter on top of DiffusionGemma’s parallel denoising decoder. Instead of writing text one token at a time like many older speech systems, it cleans up a …

1year4season-

Search This Blog

Tags

Read more

View all
Load More
That is All

Total Page Views

Kakao