From post

Imperfect also Deserves Reward: Multi-Level and Sequential Reward Modeling for Better Dialog Management.

, , , , , , и . NAACL-HLT, стр. 2993-3001. Association for Computational Linguistics, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

A new almost compactness in L-topological spaces., , и . FSKD, стр. 23-25. IEEE, (2011)Sparse Representation Based on Discriminant Locality Preserving Dictionary Learning for Face Recognition., , , и . IDEAL, том 10585 из Lecture Notes in Computer Science, стр. 306-314. Springer, (2017)An adaptive mobile robots tethering algorithm in constrained environments., и . IROS, стр. 1377-1382. IEEE, (2009)Optimistic Knowledge Gradient Policy for Optimal Budget Allocation in Crowdsourcing., , и . ICML (3), том 28 из JMLR Workshop and Conference Proceedings, стр. 64-72. JMLR.org, (2013)Hyperspectral Image Reconstruction via Block Low-Rank and Three-Dimension Weighted Total Variation Constraint., , , , и . IEEE Access, (2019)Evaluation of frozen tissue-derived prognostic gene expression signatures in FFPE colorectal cancer samples, , , , , , , , , и . Scientific Reports, (сентября 2016)A framework for context sensitive services: A knowledge discovery based approach., и . Decis. Support Syst., 48 (1): 158-168 (2009)Recent development in computational complexity characterization of Nash equilibrium., и . Comput. Sci. Rev., 1 (2): 88-99 (2007)Performing MapReduce on Data Centers with Hierarchical Structures., , , и . Int. J. Comput. Commun. Control, 7 (3): 432-449 (2012)The complexity of optimal multidimensional pricing for a unit-demand buyer., , , , и . Games Econ. Behav., (2018)