Microsoft has launched Phi-4-multimodal and Phi-4-mini, small language models (SLMs).Phi-4-multimodal integrates speech, vision, and text processing, enabling natural and context-aware interactions.The Phi-4 multimodal model surpasses Google Gemini and is comparable to OpenAI’s GPT-4o.Phi-4-mini is a text-based model suitable for reasoning, coding, and long-context tasks.