Turing Award winner Yoshua Benjio, along with a group of AI researchers, proposed ‘Scientist AI' to address the risks of superintelligent agents.
This AI system functions as a guardrail to protect against 'unsafe agentic AIs' and is designed to accelerate scientific progress and research.
The proposed 'Scientist AIs' are trained to provide explanations for events based on their understanding of the world, rather than just pursuing goals.
The system aims to avoid risks associated with reinforcement learning and becomes safer and more accurate with increasing computational power.