Kyutai has developed Hibiki, a 2.7 billion-parameter decoder-only model for real-time speech-to-speech and speech-to-text translation.Hibiki operates at a 12.5Hz framerate with a 2.2kbps bitrate and supports French-to-English translation while preserving voice characteristics.The model employs contextual alignment and a neural audio codec for efficient translation generation and dynamic adjustment of translation delays.Hibiki demonstrates strong performance in translation quality, speaker fidelity, and maintains a competitive latency, offering practical benefits for real-time speech translation.