safety research | Digital Mind News

AI

Anthropic has eliminated blackmail behavior in Claude models by retraining on positive AI narratives, while new…

2026-05-12

OpenAI

New AI safety research identifies sycophancy as a boundary failure between social alignment and epistemic integrity,…

2026-05-09

OpenAI

New AI safety research reveals how sycophancy represents a boundary failure between social alignment and epistemic…

2026-05-08

Enterprise

New research reveals that AI misalignment stems from geometric relationships between neural features, offering a 34.5%…

2026-05-07

Security

Security researchers discovered critical vulnerabilities in major AI coding agents that exposed API keys through prompt…

2026-04-24

Ethics & Society

AI safety research advances with new bias mitigation frameworks allowing user-defined fairness while revealing that mathematical…

2026-04-24

Security

Recent security vulnerabilities in major AI coding agents from Anthropic, Google, and Microsoft expose critical gaps…

2026-04-23

Security

Security researchers exposed critical vulnerabilities in major AI coding platforms through prompt injection attacks, while broader…

2026-04-23

Security

Recent security research revealed critical vulnerabilities in AI coding agents from Anthropic, Google, and Microsoft, exposing…

2026-04-22

Security

Researchers discovered critical security vulnerabilities in major AI coding agents, revealing how prompt injection attacks can…

2026-04-22

Security

Enterprise AI safety research reveals critical gaps as 97% of security leaders expect major AI agent…

2026-04-21

Security

New research reveals 88% of enterprises experienced AI agent security incidents despite believing their safety policies…

2026-04-20