menu
techminis

A naukri.com initiative

google-web-stories
Home

>

ML News

>

Kyutai Rel...
source image

Marktechpost

1M

read

95

img
dot

Kyutai Releases Hibiki: A 2.7B Real-Time Speech-to-Speech and Speech-to-Text Translation with Near-Human Quality and Voice Transfer

  • Kyutai has developed Hibiki, a 2.7 billion-parameter decoder-only model for real-time speech-to-speech and speech-to-text translation.
  • Hibiki operates at a 12.5Hz framerate with a 2.2kbps bitrate and supports French-to-English translation while preserving voice characteristics.
  • The model employs contextual alignment and a neural audio codec for efficient translation generation and dynamic adjustment of translation delays.
  • Hibiki demonstrates strong performance in translation quality, speaker fidelity, and maintains a competitive latency, offering practical benefits for real-time speech translation.

Read Full Article

like

5 Likes

For uninterrupted reading, download the app