NVIDIA AI has open-sourced two models: Canary 1B Flash and Canary 180M Flash for multilingual speech recognition and translation.Both models utilize an encoder-decoder architecture with task-specific tokens and have scalable designs.Canary 1B Flash achieves high performance with low word error rates and BLEU scores on various datasets.The models support word-level and segment-level timestamping, enabling offline processing and on-device deployment.