Datagrom AI News Logo

DeepSeek-V3, ultra-large open-source AI, outperforms Llama and Qwen on launch

DeepSeek-V3, ultra-large open-source AI, outperforms Llama and Qwen on launch

December 26, 2024: DeepSeek-V3 Sets New Benchmark in Open AI - Chinese startup DeepSeek launched DeepSeek-V3, an ultra-large open-source AI model with 671B parameters, outperforming top models like Llama and Qwen. It utilizes a mixture-of-experts architecture and achieves superior efficiency with innovations such as auxiliary loss-free load-balancing and multi-token prediction. Trained at a fraction of the usual cost, it excels in Chinese and math benchmarks, challenging closed models like GPT-4o.

DeepSeek-V3 marks a significant step toward closing the gap between open and closed-source AI, potentially reshaping the AI landscape by offering enterprises competitive alternatives.

KEEP UP WITH THE INNOVATIVE AI TECH TRANSFORMING BUSINESS

Datagrom keeps business leaders up-to-date on the latest AI innovations, automation advances,
policy shifts, and more, so they can make informed decisions about AI tech.