Peter Dayan


Peter Dayan is a British neuroscientist and computer scientist who is director at the Max Planck Institute for Biological Cybernetics in Tübingen, Germany, along with Ivan De Araujo. He has pioneered the field of reinforcement learning where he and his colleagues proposed that dopamine signals reward prediction error, and helped develop the Q-learning algorithm. He is co-author of Theoretical Neuroscience, an influential textbook on computational neuroscience. He is also known for applying Bayesian methods from machine learning and artificial intelligence to understand neural function, and is particularly recognized for relating neurotransmitter levels to prediction errors and Bayesian uncertainties. He made contributions to unsupervised learning, including the wake-sleep algorithm for neural networks and the Helmholtz machine.

Education

Dayan studied mathematics at the University of Cambridge and then continued for a PhD in artificial intelligence at the University of Edinburgh School of Informatics on statistical learning supervised by David Willshaw and David Wallace, focusing on associative memory and reinforcement learning.

Career and research

After his PhD, Dayan held postdoctoral research positions with Terry Sejnowski at the Salk Institute and Geoffrey Hinton at the University of Toronto. He then took up an assistant professor position at the Massachusetts Institute of Technology, and moved to the Gatsby Charitable Foundation computational neuroscience unit at University College London in 1998, becoming professor and director in 2002. In September 2018, the Max Planck Society announced his appointment as a director at the Max Planck Institute for Biological Cybernetics in Tübingen.

Awards and honours

Dayan was elected a Fellow of the Royal Society in 2018. In 2023, he was elected a member of the Academia Europaea. He was awarded the Rumelhart Prize in 2012 and The Brain Prize in 2017.