[AI]■ STORY TIMELINE
RESEARCHERS TACKLE AI 'SANDBAGGING' PROBLEM
A collaborative study identifies methods to detect and prevent AI models from deliberately underperforming during safety evaluations. The research addresses a growing concern as AI systems become more sophisticated.
The Decoder+0m
A study by researchers from the MATS program, Redwood Research, the University of Oxford, and Anthropic examines a safet…