MARC View

000			03336nam a2200517 i 4500
001			6267472
003			IEEE
005			20220712204715.0
006			m o d
007			cr \|n\|\|\|\|\|\|\|\|\|
008			151224s2008 maua ob 001 eng d
010			_z 91183100 (print)
020			_a9780262288507 _qelectronic
020			_z9780262512787 _qprint
035			_a(CaBNVSL)mat06267472
035			_a(IDAMS)0b000064818b44ac
040			_aCaBNVSL _beng _erda _cCaBNVSL _dCaBNVSL
050		4	_aQA76.7 _b.K33 1990eb
082	0	0	_a005.1 _220
100	1		_aKaelbling, Leslie Pack, _eauthor. _922979
245	1	0	_aLearning in embedded systems / _cby Leslie Pack Kaelbling.
264		1	_aStanford, Calif. : _bDept. of Computer Science, Stanford University, _c[c1990]
264		2	_a[Piscataqay, New Jersey] : _bIEEE Xplore, _c[2008]
300			_a1 PDF (xx, 200 pages) : _billustrations.
336			_atext _2rdacontent
337			_aelectronic _2isbdmedia
338			_aonline resource _2rdacarrier
490	1		_aReport ; _vno. STAN-CS-90-1326
500			_aCover title.
500			_a"June 1990."
504			_aIncludes bibliographical references (p. 191-200).
506	1		_aRestricted to subscribers or individual electronic text purchasers.
520			_aLearning to perform complex action strategies is an important problem in the fields of artificial intelligence, robotics, and machine learning. Filled with interesting new experimental results, Learning in Embedded Systems explores algorithms that learn efficiently from trial-and error experience with an external world. It is the first detailed exploration of the problem of learning action strategies in the context of designing embedded systems that adapt their behavior to a complex, changing environment; such systems include mobile robots, factory process controllers, and long-term software databases.Kaelbling investigates a rapidly expanding branch of machine learning known as reinforcement learning, including the important problems of controlled exploration of the environment, learning in highly complex environments, and learning from delayed reward. She reviews past work in this area and presents a number of significant new results. These include the intervalestimation algorithm for exploration, the use of biases to make learning more efficient in complex environments, a generate-and-test algorithm that combines symbolic and statistical processing into a flexible learning method, and some of the first reinforcement-learning experiments with a real robot.Leslie Pack Kaelbling is Assistant Professor in the Computer Science Department at Brown University.
530			_aAlso available in print.
538			_aMode of access: World Wide Web
588			_aDescription based on PDF viewed 12/24/2015.
650		0	_aEmbedded computer systems _xProgramming. _914089
650		0	_aComputer algorithms. _94534
655		0	_aElectronic books. _93294
710	2		_aIEEE Xplore (Online Service), _edistributor. _922980
710	2		_aMIT Press, _epublisher. _922981
776	0	8	_iPrint version _z9780262512787
830		0	_aReport (Stanford University. Computer Science Department) ; _vno. STAN-CS-90-1326. _922982
856	4	2	_3Abstract with links to resource _uhttps://ieeexplore.ieee.org/xpl/bkabstractplus.jsp?bkn=6267472
942			_cEBK
999			_c73126 _d73126