From post

An analysis of linear models, linear value-function approximation, and feature selection for reinforcement learning.

, , , , и . ICML, том 307 из ACM International Conference Proceeding Series, стр. 752-759. ACM, (2008)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

A Generalized Reinforcement-Learning Model: Convergence and Applications., и . ICML, стр. 310-318. Morgan Kaufmann, (1996)Combining Independent Modules in Lexical Multiple-Choice Problems, , , и . (Jan 10, 2005)A Change Detection Model for Non-Stationary k-Armed Bandit Problems., и . AAAI Spring Symposium: Between a Rock and a Hard Place: Cognitive Science Principles Meet AI-Hard Problems, стр. 39. AAAI, (2006)The First Probabilistic Track of the International Planning Competition., , , и . J. Artif. Intell. Res., (2005)Bandit-Based Planning and Learning in Continuous-Action Markov Decision Processes., и . ICAPS, AAAI, (2012)Bayesian adaptive sampling for variable selection and model averaging, , и . Journal of Computational and Graphical Statistics, 20 (1): 80--101 (January 2011)Ask Me Anything about MOOCs., , , , , и . AI Magazine, 38 (2): 7-12 (2017)The AAAI Fall Symposia., , , , , , , , , и . AI Magazine, 20 (3): 87-89 (1999)Planning with abstract markov decision processes, , , , , , , и . International Conference on Automated Planning and Scheduling, стр. 480--488. (2017)PPDDL1.0: An extension to PDDL for expressing planning domains with probabilistic effects, и . Carnegie Mellon University, Pittsburg, (2004)