AI benchmarks | Digital Mind News

AI

TokenArena introduces energy-aware AI benchmarking across 78 endpoints, revealing 6.2x efficiency variations and 12.5-point accuracy differences…

2026-05-11

AI

xAI launched Grok 4.3 at $1.25 per million input tokens, significantly undercutting competitors while showing domain-specific…

2026-05-10

AI

xAI launched Grok 4.3 with aggressive pricing at $1.25 per million input tokens, showing performance gains…

2026-05-09

AI

TokenArena introduces energy-aware AI benchmarking that measures models at endpoint granularity, revealing substantial performance variations and…

2026-05-08

AI

DeepSeek released its V4 model achieving near state-of-the-art AI benchmark performance while pricing API access at…

2026-05-08

OpenAI

OpenAI's GPT-5.5 reclaims benchmark leadership while specialized tests reveal significant performance gaps between AI models. New…

2026-04-25

Enterprise

AI models achieve record benchmark scores with Claude Opus 4.6 reaching 94.1% accuracy on technical reasoning…

2026-04-24

AI Agents

Enterprise AI systems are achieving record-breaking benchmark scores in 2026, with Google's Deep Research agents, Anthropic's…

2026-04-23

AI Agents

Google and Anthropic have launched breakthrough AI tools setting new benchmark records, with Google's Deep Research…

2026-04-23

Enterprise

Major AI companies are setting new benchmark records with enterprise-focused tools like Anthropic's Claude Design and…

2026-04-21

AI

The AI industry is shifting focus from traditional benchmark scores to practical applications and real-world performance.…

2026-04-20

Enterprise

Anthropic's Claude Design launch and new Train-to-Test scaling laws are reshaping enterprise AI strategy. These developments…

2026-04-20