Speech Recognition Python

News

Alibaba’s New Speech Recognition Model Pushes Accuracy But Keeps Weights Closed

Alibaba unveils a new speech recognition model covering 11 languages, noise-robust transcription, and even singing voice ...

Slator

Voice Cloning Meets Emotional Speech Synthesis With Alibaba’s Marco-Voice Model

Alibaba’s Marco-Voice pairs voice cloning with controllable emotion for more natural and expressive synthetic speech in ...

IEEE

Scaling Multilingual Visual Speech Recognition

Abstract: Visual Speech Recognition (lip-reading) has witnessed tremendous improvements, reaching word error rates as low as 12.8 WER in English. However, the ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

News

Alibaba’s New Speech Recognition Model Pushes Accuracy But Keeps Weights Closed

Voice Cloning Meets Emotional Speech Synthesis With Alibaba’s Marco-Voice Model

Scaling Multilingual Visual Speech Recognition

Trending now