Researchers astonished by tool’s apparent success at revealing AI’s hidden motives

In a new paper published Thursday titled "Auditing language models for hidden objectives," Anthropic researchers described how models trained to deliberately conceal certain motives from evaluators could still inadvertently reveal…

Continue Reading Researchers astonished by tool’s apparent success at revealing AI’s hidden motives

Outbreak turns 30

Back in 2020, when the COVID pandemic was still new, everyone was "sheltering in place" and bingeing films and television. Pandemic-related fare proved especially popular, including the 1995 medical disaster-thriller…

Continue Reading Outbreak turns 30