| Volume 9,Issue 3,2011 Table of Contents
Editorial Special issue on approximate dynamic programming and reinforcement learning | | | Editorial: Special issue on approximate dynamic programming and reinforcement learning | | Silvia Ferrari,Jagannathan Sarangapani and Frank L. Lewis | | 2011,9(3):309 [Abstract(1987)] [View PDF 32.96 K (411)] [HTML] | | | | Approximate policy iteration: a survey and some new methods | | Dimitri P. BERTSEKAS | | 2011,9(3):310-335 [Abstract(4509)] [View PDF 460.78 K (355)] [HTML] | | | | A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications | | Warren B. POWELL and Jun MA | | 2011,9(3):336-352 [Abstract(3349)] [View PDF 263.48 K (364)] [HTML] | | | | Adaptive dynamic programming for online solution of a zero-sum differential game | | Draguna VRABIE and Frank LEWIS | | 2011,9(3):353-360 [Abstract(4280)] [View PDF 222.31 K (619)] [HTML] | | | | Approximate dynamic programming solutions with a single network adaptive critic for a class of nonlinear systems | | Jie DING and S. N. BALAKRISHNAN | | 2011,9(3):370-380 [Abstract(2496)] [View PDF 493.05 K (510)] [HTML] | | | | Finite horizon optimal control of discrete-time nonlinear systems with unfixed initial state using adaptive dynamic programming | | Qinglai WEI and Derong LIU | | 2011,9(3):381-390 [Abstract(2231)] [View PDF 379.43 K (548)] [HTML] | | | | A model-based approximate λ-policy iteration approach to online evasive path planning and the video game Ms. Pac-Man | | Greg FODERARO,Vikram RAJU and Silvia FERRARI | | 2011,9(3):391-399 [Abstract(4123)] [View PDF 477.32 K (558)] [HTML] | | | | Asymptotic tracking by a reinforcement learning-based adaptive critic controller | | Shubhendu BHASIN,Nitin SHARMA,Parag PATRE and Warren DIXON | | 2011,9(3):400-409 [Abstract(5320)] [View PDF 458.25 K (482)] [HTML] | | | | Stable reinforcement learning with recurrent neural networks | | James Nate KNIGHT and Charles ANDERSON | | 2011,9(3):410-420 [Abstract(5029)] [View PDF 367.62 K (707)] [HTML] | | | | Semi-Markov adaptive critic heuristics with application to airline revenue management | | Ketaki KULKARNI,Abhijit GOSAVI,Susan MURRAY and Katie GRANTHAM | | 2011,9(3):421-430 [Abstract(2599)] [View PDF 207.68 K (474)] [HTML] | | | | Multiresolution state-space discretization for Q-Learning with pseudorandomized discretization | | Amanda LAMPTON,John VALASEK and Mrinal KUMAR | | 2011,9(3):431-439 [Abstract(2340)] [View PDF 475.94 K (361)] [HTML] | | | | Hierarchical state-abstracted and socially augmented Q-Learning for reducing complexity in agent-based learning | | Xueqing SUN,Tao MAO,Laura RAY,Dongqing SHI and Jerald KRALIK | | 2011,9(3):440-450 [Abstract(2216)] [View PDF 551.24 K (494)] [HTML] | | | | Moving least-squares approximations for linearly-solvable stochastic optimal control problems | | Mingyuan ZHONG and Emanuel TODOROV | | 2011,9(3):451-463 [Abstract(3611)] [View PDF 584.51 K (341)] [HTML] | | |
|