New joint safety testing from UK-based nonprofit Apollo Research and OpenAI set out to reduce secretive behaviors like scheming in AI models. What researchers found could complicate promising ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results