Richard S. Sutton | |
---|---|
![]() Sutton in 2021 | |
Born | 1957 or 1958 (age 67–68) Ohio, U.S. |
Citizenship | Canadian |
Education | Stanford University (BA) University of Massachusetts, Amherst (MS, PhD) |
Known for | Temporal difference learning, Dyna, Options, GQ(λ) |
Awards | AAAI Fellow (2001) President's Award (INNS) (2003) Royal Society of Canada Fellow (2016) Turing Award (2025) |
Scientific career | |
Fields | Artificial intelligence Reinforcement learning |
Institutions | University of Alberta |
Thesis | Temporal credit assignment in reinforcement learning (1984) |
Doctoral advisor | Andrew Barto |
Doctoral students | David Silver Doina Precup |
Website | incompleteideas |
Richard Stuart Sutton[1] FRS FRSC (born 1957 or 1958) is a Canadian computer scientist. He is a professor of computing science at the University of Alberta, fellow & Chief Scientific Advisor at the Alberta Machine Intelligence Institute, and a research scientist at Keen Technologies.[2] Sutton is considered one of the founders of modern computational reinforcement learning,[3] having several significant contributions to the field, including temporal difference learning and policy gradient methods.[4]
Piatetsky-2017
was invoked but never defined (see the help page).
© MMXXIII Rich X Search. We shall prevail. All rights reserved. Rich X Search