Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Multi-task sequence to sequence learning, , , , and . arXiv preprint arXiv:1511.06114, (2015)Language models are unsupervised multitask learners, , , , , and . (2019)An Empirical Exploration of Recurrent Network Architectures, , and . ICML, (2015)Neural Programmer: Inducing Latent Programs with Gradient Descent., , and . ICLR (Poster), (2016)An Empirical Exploration of Recurrent Network Architectures., , and . ICML, volume 37 of JMLR Workshop and Conference Proceedings, page 2342-2350. JMLR.org, (2015)On the importance of initialization and momentum in deep learning., , , and . ICML (3), volume 28 of JMLR Workshop and Conference Proceedings, page 1139-1147. JMLR.org, (2013)Dota 2 with Large Scale Deep Reinforcement Learning., , , , , , , , , and 15 other author(s). CoRR, (2019)Modelling Relational Data using Bayesian Clustered Tensor Factorization., , and . NIPS, page 1821-1828. Curran Associates, Inc., (2009)Learning to Execute, and . (2014)cite arxiv:1410.4615.Deep Double Descent: Where Bigger Models and More Data Hurt., , , , , and . ICLR, OpenReview.net, (2020)