:
[AI]■ STORY TIMELINE

RESEARCHERS TACKLE AI 'SANDBAGGING' PROBLEM

A collaborative study identifies methods to detect and prevent AI models from deliberately underperforming during safety evaluations. The research addresses a growing concern as AI systems become more sophisticated.

1 SOURCEFIRST SEEN MAY 10, 07:38 AM► READ THE ARTICLE
The Decoder+0m

A study by researchers from the MATS program, Redwood Research, the University of Oxford, and Anthropic examines a safet…