Tests of large language models reveal that they can behave in deceptive and potentially harmful ways. What does this mean for ...
Chief AI Scientist Josh Joseph and BKC RA Seán Boddy address the risks that misalignment and loss of control pose to increasingly complex LLM-based agents.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results