000 | 03369nam a22005295i 4500 | ||
---|---|---|---|
001 | 978-3-031-01551-9 | ||
003 | DE-He213 | ||
005 | 20240730164105.0 | ||
007 | cr nn 008mamaa | ||
008 | 220601s2010 sz | s |||| 0|eng d | ||
020 |
_a9783031015519 _9978-3-031-01551-9 |
||
024 | 7 |
_a10.1007/978-3-031-01551-9 _2doi |
|
050 | 4 | _aQ334-342 | |
050 | 4 | _aTA347.A78 | |
072 | 7 |
_aUYQ _2bicssc |
|
072 | 7 |
_aCOM004000 _2bisacsh |
|
072 | 7 |
_aUYQ _2thema |
|
082 | 0 | 4 |
_a006.3 _223 |
100 | 1 |
_aSzepesvári, Csaba. _eauthor. _4aut _4http://id.loc.gov/vocabulary/relators/aut _982051 |
|
245 | 1 | 0 |
_aAlgorithms for Reinforcement Learning _h[electronic resource] / _cby Csaba Szepesvári. |
250 | _a1st ed. 2010. | ||
264 | 1 |
_aCham : _bSpringer International Publishing : _bImprint: Springer, _c2010. |
|
300 |
_aXIII, 89 p. _bonline resource. |
||
336 |
_atext _btxt _2rdacontent |
||
337 |
_acomputer _bc _2rdamedia |
||
338 |
_aonline resource _bcr _2rdacarrier |
||
347 |
_atext file _bPDF _2rda |
||
490 | 1 |
_aSynthesis Lectures on Artificial Intelligence and Machine Learning, _x1939-4616 |
|
505 | 0 | _aMarkov Decision Processes -- Value Prediction Problems -- Control -- For Further Exploration. | |
520 | _aReinforcement learning is a learning paradigm concerned with learning to control a system so as to maximize a numerical performance measure that expresses a long-term objective. What distinguishes reinforcement learning from supervised learning is that only partial feedback is given to the learner about the learner's predictions. Further, the predictions may have long term effects through influencing the future state of the controlled system. Thus, time plays a special role. The goal in reinforcement learning is to develop efficient learning algorithms, as well as to understand the algorithms' merits and limitations. Reinforcement learning is of great interest because of the large number of practical applications that it can be used to address, ranging from problems in artificial intelligence to operations research or control engineering. In this book, we focus on those algorithms of reinforcement learning that build on the powerful theory of dynamic programming. We give a fairly comprehensive catalog of learning problems, describe the core ideas, note a large number of state of the art algorithms, followed by the discussion of their theoretical properties and limitations. Table of Contents: Markov Decision Processes / Value Prediction Problems / Control / For Further Exploration. | ||
650 | 0 |
_aArtificial intelligence. _93407 |
|
650 | 0 |
_aMachine learning. _91831 |
|
650 | 0 |
_aNeural networks (Computer science) . _982052 |
|
650 | 1 | 4 |
_aArtificial Intelligence. _93407 |
650 | 2 | 4 |
_aMachine Learning. _91831 |
650 | 2 | 4 |
_aMathematical Models of Cognitive Processes and Neural Networks. _932913 |
710 | 2 |
_aSpringerLink (Online service) _982053 |
|
773 | 0 | _tSpringer Nature eBook | |
776 | 0 | 8 |
_iPrinted edition: _z9783031000218 |
776 | 0 | 8 |
_iPrinted edition: _z9783031004230 |
776 | 0 | 8 |
_iPrinted edition: _z9783031026799 |
830 | 0 |
_aSynthesis Lectures on Artificial Intelligence and Machine Learning, _x1939-4616 _982054 |
|
856 | 4 | 0 | _uhttps://doi.org/10.1007/978-3-031-01551-9 |
912 | _aZDB-2-SXSC | ||
942 | _cEBK | ||
999 |
_c85289 _d85289 |