000 | 03336nam a2200517 i 4500 | ||
---|---|---|---|
001 | 6267472 | ||
003 | IEEE | ||
005 | 20220712204715.0 | ||
006 | m o d | ||
007 | cr |n||||||||| | ||
008 | 151224s2008 maua ob 001 eng d | ||
010 | _z 91183100 (print) | ||
020 |
_a9780262288507 _qelectronic |
||
020 |
_z9780262512787 _qprint |
||
035 | _a(CaBNVSL)mat06267472 | ||
035 | _a(IDAMS)0b000064818b44ac | ||
040 |
_aCaBNVSL _beng _erda _cCaBNVSL _dCaBNVSL |
||
050 | 4 |
_aQA76.7 _b.K33 1990eb |
|
082 | 0 | 0 |
_a005.1 _220 |
100 | 1 |
_aKaelbling, Leslie Pack, _eauthor. _922979 |
|
245 | 1 | 0 |
_aLearning in embedded systems / _cby Leslie Pack Kaelbling. |
264 | 1 |
_aStanford, Calif. : _bDept. of Computer Science, Stanford University, _c[c1990] |
|
264 | 2 |
_a[Piscataqay, New Jersey] : _bIEEE Xplore, _c[2008] |
|
300 |
_a1 PDF (xx, 200 pages) : _billustrations. |
||
336 |
_atext _2rdacontent |
||
337 |
_aelectronic _2isbdmedia |
||
338 |
_aonline resource _2rdacarrier |
||
490 | 1 |
_aReport ; _vno. STAN-CS-90-1326 |
|
500 | _aCover title. | ||
500 | _a"June 1990." | ||
504 | _aIncludes bibliographical references (p. 191-200). | ||
506 | 1 | _aRestricted to subscribers or individual electronic text purchasers. | |
520 | _aLearning to perform complex action strategies is an important problem in the fields of artificial intelligence, robotics, and machine learning. Filled with interesting new experimental results, Learning in Embedded Systems explores algorithms that learn efficiently from trial-and error experience with an external world. It is the first detailed exploration of the problem of learning action strategies in the context of designing embedded systems that adapt their behavior to a complex, changing environment; such systems include mobile robots, factory process controllers, and long-term software databases.Kaelbling investigates a rapidly expanding branch of machine learning known as reinforcement learning, including the important problems of controlled exploration of the environment, learning in highly complex environments, and learning from delayed reward. She reviews past work in this area and presents a number of significant new results. These include the intervalestimation algorithm for exploration, the use of biases to make learning more efficient in complex environments, a generate-and-test algorithm that combines symbolic and statistical processing into a flexible learning method, and some of the first reinforcement-learning experiments with a real robot.Leslie Pack Kaelbling is Assistant Professor in the Computer Science Department at Brown University. | ||
530 | _aAlso available in print. | ||
538 | _aMode of access: World Wide Web | ||
588 | _aDescription based on PDF viewed 12/24/2015. | ||
650 | 0 |
_aEmbedded computer systems _xProgramming. _914089 |
|
650 | 0 |
_aComputer algorithms. _94534 |
|
655 | 0 |
_aElectronic books. _93294 |
|
710 | 2 |
_aIEEE Xplore (Online Service), _edistributor. _922980 |
|
710 | 2 |
_aMIT Press, _epublisher. _922981 |
|
776 | 0 | 8 |
_iPrint version _z9780262512787 |
830 | 0 |
_aReport (Stanford University. Computer Science Department) ; _vno. STAN-CS-90-1326. _922982 |
|
856 | 4 | 2 |
_3Abstract with links to resource _uhttps://ieeexplore.ieee.org/xpl/bkabstractplus.jsp?bkn=6267472 |
942 | _cEBK | ||
999 |
_c73126 _d73126 |