Author of the publication

Off-Policy Temporal Difference Learning with Function Approximation.

, , and . ICML, page 417-424. Morgan Kaufmann, (2001)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Performance Guarantees for Hierarchical Clustering.. COLT, volume 2375 of Lecture Notes in Computer Science, page 351-363. Springer, (2002)Robust Learning from Discriminative Feature Feedback., and . AISTATS, volume 108 of Proceedings of Machine Learning Research, page 973-982. PMLR, (2020)Constants Matter: The Performance Gains of Active Learning., and . ICML, volume 162 of Proceedings of Machine Learning Research, page 16123-16173. PMLR, (2022)Moment-based Uniform Deviation Bounds for k-means and Friends., and . NIPS, page 2940-2948. (2013)The Complexity of Approximating the Entropy., , , and . CCC, page 17. IEEE Computer Society, (2002)An algorithm for L1 nearest neighbor search via monotonic embedding., and . NIPS, page 983-991. (2016)Interactive Structure Learning with Structural Query-by-Committee., and . NeurIPS, page 1129-1139. (2018)Analysis of a greedy active learning strategy.. NIPS, page 337-344. (2004)Random projection trees and low dimensional manifolds., and . STOC, page 537-546. ACM, (2008)Online k-means Clustering on Arbitrary Data Streams., , , and . ALT, volume 201 of Proceedings of Machine Learning Research, page 204-236. PMLR, (2023)