Reinforcement Learning

There is a new edition of this book available Reinforcement Learning, Second Edition.

An Introduction

by Sutton, Barto

Reinforcement learning, one of the most active research areas in artificial intelligence, is a computational approach to learning whereby an agent tries to maximize the total amount of reward it receives when interacting with a complex, uncertain environment. In Reinforcement Learning, Richard Sutton and Andrew Barto provide a clear and simple account of the key ideas and algorithms of reinforcement learning. Their discussion ranges from the history of the field's intellectual foundations to the most recent developments and applications. The only necessary mathematical background is familiarity with elementary concepts of probability.

The book is divided into three parts. Part I defines the reinforcement learning problem in terms of Markov decision processes. Part II provides basic solution methods: dynamic programming, Monte Carlo methods, and temporal-difference learning. Part III presents a unified view of the solution methods and incorporates artificial neural networks, eligibility traces, and planning; the two final chapters present case studies and consider the future of reinforcement learning.

This is a highly intuitive and accessible introduction to the recent major developments in reinforcement learning, written by two of the field's pioneering contributors.
Dimitri P. Bertsekas and John N. Tsitsiklis Professors, Department of Electrical Engineering andn Computer Science, Massachusetts Institute of Technology

This book not only provides an introduction to learning theory but also serves as a tremendous source of ideas for further development and applications in the real world.
Toshio Fukuda Nagoya University, Japan; President, IEEE Robotics and Automantion Society

Reinforcement learning has always been important in the understanding of the driving force behind biological systems, but in the last two decades it has become increasingly important, owing to the development of mathematical algorithms. Barto and Sutton were the prime movers in leading the development of these algorithms and have described them with wonderful clarity in this new text. I predict it will be the standard text.
Dana Ballard Professor of Computer Science, University of Rochester

The widely acclaimed work of Sutton and Barto on reinforcement learning applies some essentials of animal learning, in clever ways, to artificial learning systems. This is a very readable and comprehensive account of the background, algorithms, applications, and future directions of this pioneering and far-reaching work.
Wolfram Schultz University of Fribourg, Switzerland

Expand/Collapse All
Contents (pg. vii)
Series Foreword (pg. xiii)
Preface (pg. xv)
I The Problem (pg. 1)
1 Introduction (pg. 3)
2 Evaluative Feedback (pg. 25)
3 The Reinforcement Learning Problem (pg. 51)
II Elementary Solution Methods (pg. 87)
4 Dynamic Programming (pg. 89)
5 Monte Carlo Methods (pg. 111)
6 Temporal-Difference Learning (pg. 133)
III A Unified View (pg. 161)
7 Eligibility Traces (pg. 163)
8 Generalization and Function Approximation (pg. 193)
9 Planning and Learning (pg. 227)
10 Dimensions of Reinforcement Learning (pg. 255)
11 Case Studies (pg. 261)
References (pg. 291)
Summary of Notation (pg. 313)
Index (pg. 315)

Richard S. Sutton

Richard S. Sutton is Senior Research Scientist, Department of Computer Science, University of Massachusetts.

Andrew G. Barto

Andrew G. Barto is Professor of Computer Science at the University of Massachusetts.

Instructors Only

Browser	Version
Chrome (Recommended)	Latest
Firefox	Latest
Edge	Latest
Safari	Latest