March 14, 2025:
Patronus AI Unveils Multimodal Judge for Images - Patronus AI has released the Multimodal LLM-as-a-Judge, a tool utilizing Google Gemini to refine multimodal AI systems for image-to-text tasks. The Judge-Image tool aids developers by evaluating text presence, object description, and spatial accuracy while detecting caption hallucination and verifying brand asset accuracy.
Already used by Etsy, this tool aims to improve predictability and reliability in AI-driven applications. Upcoming updates will broaden its capabilities to include audio and vision evaluations, enhancing its utility for developers.