Preface xi
We are very grateful for the excellent feedback – narrative and technical –
provided to us by Adam White and our anonymous reviewers, which allowed
us to make substantial improvements on the original draft. We thank Rich
Sutton, Andy Barto, Csaba Szepesvári, Kevin Murphy, Aaron Courville, Doina
Precup, Prakash Panangaden, David Silver, Joelle Pineau, and Dale Schuur-
mans, for discussions on book writing and serving as role models on taking
an effort larger than anything else we had previously done. We appreciate the
technical and conceptual input of many of our colleagues at Google, DeepMind,
Mila, and beyond: Bernardo Avila Pires, Jason Baldridge, Pierre-Luc Bacon,
Yoshua Bengio, Michael Bowling, Sal Candido, Peter Dayan, Thomas Degris,
Audrunas Gruslys, Hado van Hasselt, Shie Mannor, Volodymyr Mnih, Derek
Nowrouzezahrai, Adam Oberman, Bilal Piot, Tom Schaul, Danny Tarlow, and
Olivier Pietquin. We further thank the many people who reviewed parts of
this book and helped fill in some of the gaps in our knowledge: Yinlam Chow,
Erick Delage, Pierluca D’Oro, Doug Eck, Amir-massoud Farahmand, Jesse
Farebrother, Chris Finlay, Tadashi Kozuno, Hugo Larochelle, Elliot Ludvig,
Andrea Michi, Blake Richards, Daniel Slater, and Simone Totaro. We further
thank Vektor Dewanto, Tyler Kastner, Karolis Ramanauskas, Rylan Schaeffer,
Eugene Tarassov, and Jun Tian for their feedback on the online draft and the
COMP-579 students at McGill University for beta-testing our presentation of
the material. We were lucky to perform this research within DeepMind and
Google Brain, which provided support both moral and material and inspiration
to take on ever larger challenges. Finally, we thank Francis Bach, Elizabeth
Swayze, Matt Valades, and the team at MIT Press for championing this work
and making it a possibility.
Marc gives further thanks to Judy Loewen, Frédéric Lavoie, Jacqueline Smith,
Madeleine Fugère, Samantha Work, Damon MacLeod, and Andreas Fidjeland,
for support along the scientific journey, and to Lauren Busheikin, for being
an incredibly supportive partner. Further thanks go to CIFAR and the Mila
academic community for providing the fertile scientific ground from which the
writing of this book began.
Will wishes to additionally thank Zeb Kurth-Nelson and Matt Botvinick for
their patience and scientific rigor as we explored distributional reinforcement
learning in neuroscience; Koray Kavukcuoglu and Demis Hassabis for their
enthusiasm and encouragement surrounding the project; Rémi Munos for sup-
porting our pursuit of random, risky research ideas; and Blair Lyonev for being
a supportive partner, providing both encouragement and advice surrounding the
challenges of writing a book.
Mark would like to thank Maciej Dunajski, Andrew Thomason, Adrian
Weller, Krzysztof Choromanski, Rich Turner, and John Aston for their
Draft version.