Normal view MARC view ISBD view

Algorithms for Reinforcement Learning [electronic resource] / by Csaba Szepesvári.

By: Szepesvári, Csaba [author.].

Contributor(s): SpringerLink (Online service).

Material type: materialTypeLabel

BookSeries: Synthesis Lectures on Artificial Intelligence and Machine Learning: Publisher: Cham : Springer International Publishing : Imprint: Springer, 2010Edition: 1st ed. 2010.Description: XIII, 89 p. online resource.Content type: text Media type: computer Carrier type: online resourceISBN: 9783031015519.Subject(s): Artificial intelligence

| Machine learning

| Neural networks (Computer science)

| Artificial Intelligence

| Machine Learning

| Mathematical Models of Cognitive Processes and Neural Networks

Additional physical formats: Printed edition:: No title; Printed edition:: No title; Printed edition:: No titleDDC classification: 006.3 Online resources: Click here to access online

Contents:

Markov Decision Processes -- Value Prediction Problems -- Control -- For Further Exploration.

In: Springer Nature eBookSummary: Reinforcement learning is a learning paradigm concerned with learning to control a system so as to maximize a numerical performance measure that expresses a long-term objective. What distinguishes reinforcement learning from supervised learning is that only partial feedback is given to the learner about the learner's predictions. Further, the predictions may have long term effects through influencing the future state of the controlled system. Thus, time plays a special role. The goal in reinforcement learning is to develop efficient learning algorithms, as well as to understand the algorithms' merits and limitations. Reinforcement learning is of great interest because of the large number of practical applications that it can be used to address, ranging from problems in artificial intelligence to operations research or control engineering. In this book, we focus on those algorithms of reinforcement learning that build on the powerful theory of dynamic programming. We give a fairly comprehensive catalog of learning problems, describe the core ideas, note a large number of state of the art algorithms, followed by the discussion of their theoretical properties and limitations. Table of Contents: Markov Decision Processes / Value Prediction Problems / Control / For Further Exploration.

average rating: 0.0 (0 votes)

Holdings ( 0 )
Title notes
Comments ( 0 )
Book reviews by critics ( XXX )

No physical items for this record

Markov Decision Processes -- Value Prediction Problems -- Control -- For Further Exploration.

Reinforcement learning is a learning paradigm concerned with learning to control a system so as to maximize a numerical performance measure that expresses a long-term objective. What distinguishes reinforcement learning from supervised learning is that only partial feedback is given to the learner about the learner's predictions. Further, the predictions may have long term effects through influencing the future state of the controlled system. Thus, time plays a special role. The goal in reinforcement learning is to develop efficient learning algorithms, as well as to understand the algorithms' merits and limitations. Reinforcement learning is of great interest because of the large number of practical applications that it can be used to address, ranging from problems in artificial intelligence to operations research or control engineering. In this book, we focus on those algorithms of reinforcement learning that build on the powerful theory of dynamic programming. We give a fairly comprehensive catalog of learning problems, describe the core ideas, note a large number of state of the art algorithms, followed by the discussion of their theoretical properties and limitations. Table of Contents: Markov Decision Processes / Value Prediction Problems / Control / For Further Exploration.

There are no comments for this item.

More book reviews at iDreamBooks.com

Central Library

Algorithms for Reinforcement Learning [electronic resource] / by Csaba Szepesvári.

By: Szepesvári, Csaba [author.].

Contributor(s): SpringerLink (Online service).