Browsing by Author "Mitrani, Nathaniel"
Now showing items 1-1 of 1
-
Beyond Algorithms: understanding the Challenges of AI Safety
Mitrani, Nathaniel (Universitat Politècnica de Catalunya, 2024)
Coursework
Open AccessWe go over the different ways an AI system might not behave as we intend it to, highlighting the importance and increasing need for research in this direction. We introduce AI safety, and the challenges in Reinforcement ...