May 29, 2024: Scale AI Introduces LLM Performance Rankings - Scale AI launches its first SEAL Leaderboards, ranking large language models (LLMs) on performance across specific domains like coding, multilinguality, and math. OpenAIs GPT models and Googles Gemini excel, with Anthropics Claude 3 Opus leading in math. The rankings, aimed at providing transparency in AI capabilities, derive from evaluations using private datasets and are set to update periodically, including new models and domains.