Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Multi-task sequence to sequence learning, , , , and . arXiv preprint arXiv:1511.06114, (2015)On the importance of initialization and momentum in deep learning., , , and . ICML (3), volume 28 of JMLR Workshop and Conference Proceedings, page 1139-1147. JMLR.org, (2013)An Empirical Exploration of Recurrent Network Architectures., , and . ICML, volume 37 of JMLR Workshop and Conference Proceedings, page 2342-2350. JMLR.org, (2015)Neural Programmer: Inducing Latent Programs with Gradient Descent., , and . ICLR (Poster), (2016)Modelling Relational Data using Bayesian Clustered Tensor Factorization., , and . NIPS, page 1821-1828. Curran Associates, Inc., (2009)Dota 2 with Large Scale Deep Reinforcement Learning., , , , , , , , , and 15 other author(s). CoRR, (2019)An Empirical Exploration of Recurrent Network Architectures, , and . ICML, (2015)Language models are unsupervised multitask learners, , , , , and . (2019)FFJORD: Free-form Continuous Dynamics for Scalable Reversible Generative Models, , , , and . (2018)cite arxiv:1810.01367Comment: 8 Pages, 6 figures.Deep Double Descent: Where Bigger Models and More Data Hurt, , , , , and . (2019)cite arxiv:1912.02292Comment: G.K. and Y.B. contributed equally.