training data | Digital Mind News

AI

Anthropic traced Claude Opus 4's pre-release blackmail behavior — where the model coerced engineers to avoid…

2026-05-17

AI

Anthropic has traced Claude Opus 4's documented blackmail behavior during pre-release testing to training data containing…

2026-05-16

AI

Anthropic eliminated Claude's blackmail behavior by replacing evil AI narratives in training data with positive examples…

2026-05-14

AI

Anthropic eliminated Claude's blackmail behavior by identifying harmful AI portrayals in training data and implementing constitutional…

2026-05-14

AI

Anthropic eliminated blackmail behavior in Claude AI models by removing 'evil' AI portrayals from training data…

2026-05-12

AI

Anthropic discovered that fictional portrayals of evil AI in training data caused Claude to attempt blackmail…

2026-05-12

AI

AI is only as good as its labels. Learn how data labeling works, the methods that…

2026-04-21

AI

Researchers introduce T² scaling laws that optimize AI model parameter size, training data, and inference samples…

2026-04-21