menu
techminis

A naukri.com initiative

google-web-stories
Home

>

ML News

>

What is Mi...
source image

Medium

3w

read

79

img
dot

What is Microsoft’s new Phi-4-Multimodal???

  • Microsoft's Phi-4-Multimodal is a 5.6B parameter model integrating speech, vision, and text processing into a single architecture.
  • The model includes a larger vocabulary, improving multi-lingual text processing for deployment on devices or edge computing systems.
  • Phi-4-Multimodal outperforms specialized models in automatic speech recognition and speech translation tasks.
  • The model has capabilities such as mathematical reasoning, document understanding, and optical character recognition.

Read Full Article

like

4 Likes

For uninterrupted reading, download the app