AI Drug Discovery: SandboxAQ, Tazbentetol, and New
SandboxAQ integrated its physics-grounded drug discovery models into Anthropic's Claude this week, while schizophrenia drug tazbentetol…
SandboxAQ integrated its physics-grounded drug discovery models into Anthropic's Claude this week, while schizophrenia drug tazbentetol…
A new AI IQ scoring site mapping 50+ models onto a human intelligence scale drew both…
SandboxAQ has integrated physics-based molecular models into Anthropic's Claude, lowering the barrier to AI-assisted drug discovery.…
A new automated tool called BenchJack found 219 reward-hacking exploits across 10 major AI agent benchmarks,…
In 2026, Recursive Language Models are topping long-context benchmarks with a shared-context architecture, a contested AI…
A startup project called AI IQ has mapped 50+ frontier language models onto a human IQ…
A new site assigning IQ scores to 50+ AI models drew praise and sharp criticism this…
A new automated auditing tool called BenchJack found 219 reward-hacking exploits across 10 popular AI agent…
A new AI IQ platform ranking 50+ language models on human intelligence scales has sparked debate,…
A new AI IQ website ranking language models on human intelligence scales has sparked intense debate,…
AI evaluation costs have reached $40,000 per comprehensive benchmark run, creating a new bottleneck that limits…
Recent AGI research milestones include efficient 8B-parameter reasoning models matching larger systems, evidence that different models…