Patronus AI launched the industry’s first MLLM-as-a-Judge to evaluate AI systems interpreting images and generating text.The technology helps detect and address hallucinations and reliability issues in multimodal AI applications.Etsy is using Patronus AI's Judge-Image to verify caption accuracy for product images on its marketplace.Patronus built Judge-Image using Google’s Gemini model after comparing it with alternatives like OpenAI’s GPT-4V.The evaluation tool assesses image captions based on criteria like hallucination detection, object recognition, and text analysis.Applications for Judge-Image extend to marketing teams, enterprises, and companies dealing with document processing.Outsourcing AI evaluation to tools like Judge-Image can make strategic and economic sense for companies.Patronus offers various pricing tiers for its evaluation tool, with a free option for experimentation.The company sees itself as complementary rather than competitive with foundational model providers like Google and OpenAI.Patronus plans to expand its AI evaluation capabilities beyond images into audio assessment in the future.