In case you missed it, DeepSeek has updated its V3 model. The latest one, DeepSeek V3-0324, is now the highest scoring non-reasoning AI model. It has jumped ahead of Gemini 2.0 Pro, GPT4.5 and is on par with Grok 3 according to artificial analysis intelligence index. Here are the improvements:
- MMLU-Pro: 75.9 → 81.2 (+5.3)
- GPQA: 59.1 → 68.4 (+9.3)
- AIME: 39.6 → 59.4 (+19.8)
- LiveCodeBench: 39.2 → 49.2 (+10.0)
This model can generate more good looking web pages and game front-end. It also offers better quality in medium-to-long-form writing.
DeepSeek takes the lead: DeepSeek V3-0324 is now the highest scoring non-reasoning model
This is the first time an open weights model is the leading non-reasoning model, a milestone for open source.
DeepSeek V3-0324 has jumped forward 7 points in Artificial Analysis… pic.twitter.com/t7geJnQiBs
— Artificial Analysis (@ArtificialAnlys) March 25, 2025
[HT]