Infinigence AI has released Megrez-3B-Omni, a 3-billion-parameter on-device multimodal large language model (LLM).
Megrez-3B-Omni is designed to analyze text, audio, and image inputs simultaneously, emphasizing on-device functionality.
The model incorporates technical features that enhance its performance across modalities, achieving high accuracy in language processing and speech understanding.
Megrez-3B-Omni demonstrates strong results in image understanding, text analysis, and speech processing, with on-device functionality for lower latency and improved privacy.