Author of the publication

The Sufficiency of Off-Policyness and Soft Clipping: PPO Is Still Insufficient according to an Off-Policy Measure.

, , , , , , , , , and . AAAI, page 7078-7086. AAAI Press, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Constraint-Based Mining of Web Page Associations, , , and . volume 4830 of Lecture Notes in Computer Science, chapter Constraint-Based Mining of Web Page Associations, page 315-326. Springer Berlin/Heidelberg, (2007)Submodular and supermodular multi-labeling, and vertex happiness., , and . CoRR, (2016)On Generality and Knowledge Transferability in Cross-Domain Duplicate Question Detection for Heterogeneous Community Question Answering., , , , , , and . CoRR, (2018)Strong Equivalence of Logic Programs with Abstract Constraint Atoms., , , , and . LPNMR, volume 6645 of Lecture Notes in Computer Science, page 161-173. Springer, (2011)Curried least general generalization: A framework for higher order concept learning., , and . PRICAI Workshops, volume 1359 of Lecture Notes in Computer Science, page 45-60. Springer, (1996)The role of default representations in incremental learning., , and . PRICAI Workshops, volume 1359 of Lecture Notes in Computer Science, page 92-105. Springer, (1996)An Improved Approximation Algorithm for the Bandpass Problem., , , and . FAW-AAIM, volume 7285 of Lecture Notes in Computer Science, page 351-358. Springer, (2012)Describing Plan Recognition as Nonmonotonic Reasoning and Belief Revision., and . Australian Joint Conference on Artificial Intelligence, volume 1342 of Lecture Notes in Computer Science, page 236-245. Springer, (1997)Taking Levi Identity Seriously: A Plea for Iterated Belief Contraction., , , and . KSEM, volume 4092 of Lecture Notes in Computer Science, page 305-317. Springer, (2006)Using Definite Clauses and Integrity Constraints as the Basis for a Theory Formation Approach to Diagnostic Reasoning., , and . ICLP, volume 225 of Lecture Notes in Computer Science, page 211-222. Springer, (1986)