@dblp

Exponential convergence rates for Batch Normalization: The power of length-direction decoupling in non-convex optimization.

, , , , , и . AISTATS, том 89 из Proceedings of Machine Learning Research, стр. 806-815. PMLR, (2019)

Линки и ресурсы

тэги