Inproceedings,

Algorithms for Hyper-parameter Optimization

J. Bergstra, R. Bardenet, Y. Bengio, and B. Kégl.
Proceedings of the 24th International Conference on Neural Information Processing Systems, page 2546--2554. USA, Curran Associates Inc., (2011)

Abstract

Several recent advances to the state of the art in image classification benchmarks have come from better configurations of existing techniques rather than novel approaches to feature learning. Traditionally, hyper-parameter optimization has been the job of humans because they can be very efficient in regimes where only a few trials are possible. Presently, computer clusters and GPU processors make it possible to run more trials and we show that algorithmic approaches can find better results. We present hyper-parameter optimization results on tasks of training neural networks and deep belief networks (DBNs). We optimize hyper-parameters using random search and two new greedy sequential methods based on the expected improvement criterion. Random search has been shown to be sufficiently efficient for learning neural networks for several datasets, but we show it is unreliable for training DBNs. The sequential algorithms are applied to the most difficult DBN learning problems from 1 and find significantly better results than the best previously reported. This work contributes novel techniques for making response surface models P(y|x) in which many elements of hyper-parameter assignment (x) are known to be irrelevant given particular values of other elements.

BibTeX key: bergstra2011algorithms
entry type: inproceedings
address: USA
booktitle: Proceedings of the 24th International Conference on Neural Information Processing Systems
year: 2011
pages: 2546--2554
publisher: Curran Associates Inc.
series: NIPS'11
acmid: 2986743
isbn: 978-1-61839-599-3
numpages: 9
location: Granada, Spain
url: http://dl.acm.org/citation.cfm?id=2986459.2986743

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

@inproceedings{bergstra2011algorithms, abstract = {Several recent advances to the state of the art in image classification benchmarks have come from better configurations of existing techniques rather than novel approaches to feature learning. Traditionally, hyper-parameter optimization has been the job of humans because they can be very efficient in regimes where only a few trials are possible. Presently, computer clusters and GPU processors make it possible to run more trials and we show that algorithmic approaches can find better results. We present hyper-parameter optimization results on tasks of training neural networks and deep belief networks (DBNs). We optimize hyper-parameters using random search and two new greedy sequential methods based on the expected improvement criterion. Random search has been shown to be sufficiently efficient for learning neural networks for several datasets, but we show it is unreliable for training DBNs. The sequential algorithms are applied to the most difficult DBN learning problems from [1] and find significantly better results than the best previously reported. This work contributes novel techniques for making response surface models P(y|x) in which many elements of hyper-parameter assignment (x) are known to be irrelevant given particular values of other elements.}, acmid = {2986743}, added-at = {2018-05-17T11:06:21.000+0200}, address = {USA}, author = {Bergstra, James and Bardenet, R{\'e}mi and Bengio, Yoshua and K{\'e}gl, Bal\'{a}zs}, biburl = {https://www.bibsonomy.org/bibtex/2a366a15b91aa02cd00b762735b230591/nosebrain}, booktitle = {Proceedings of the 24th International Conference on Neural Information Processing Systems}, description = {Algorithms for hyper-parameter optimization}, interhash = {99bf27626a5adecc8c327c43f55dcce0}, intrahash = {a366a15b91aa02cd00b762735b230591}, isbn = {978-1-61839-599-3}, keywords = {hyperparameter networks neural optimization}, location = {Granada, Spain}, numpages = {9}, pages = {2546--2554}, publisher = {Curran Associates Inc.}, series = {NIPS'11}, timestamp = {2018-05-17T11:06:21.000+0200}, title = {Algorithms for Hyper-parameter Optimization}, url = {http://dl.acm.org/citation.cfm?id=2986459.2986743}, year = 2011 }

BibSonomy

Algorithms for Hyper-parameter Optimization

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on