June 17, 2025:
Google Unveils Gemini 2.5 LLM Updates and Pricing - Google has launched Gemini 2.5 Flash-Lite, an entry-level large language model aimed at faster and more cost-effective processing. This model is part of the Gemini 2.5 series, which also includes the Flash and Pro models now generally available, along with updated pricing structures. Trained with TPUv5p chips, these models are multimodal, supporting up to 1 million tokens per prompt.
Flash-Lite is particularly effective for tasks like translation, with costs of 10 cents per million input tokens and 40 cents per million output tokens. The pricing updates for the Flash model involve changes to input and output token costs and the removal of separate fees for the thinking mode, simplifying the pricing model for users.