Author of the publication

Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time.

, , , , , , , , , , and . ICML, volume 162 of Proceedings of Machine Learning Research, page 23965-23998. PMLR, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Comparison of the Achievable Rates in OFDM and Single Carrier Modulation with I.I.D. Inputs., , and . CoRR, (2013)First-Order Methods for Nonconvex Quadratic Minimization., and . SIAM Rev., 62 (2): 395-436 (2020)A Whole New Ball Game: A Primal Accelerated Method for Matrix Games and Minimizing the Maximum of Smooth Functions., , , and . CoRR, (2023)Acceleration with a Ball Optimization Oracle., , , , , , and . NeurIPS, (2020)Analysis of Krylov Subspace Solutions of Regularized Non-Convex Quadratic Problems., and . NeurIPS, page 10728-10738. (2018)Variance Reduction for Matrix Games., , , and . NeurIPS, page 11377-11388. (2019)Eventually-stationary policies for Markov decision models with non-constant discounting., and . VALUETOOLS, page 63. ICST/ACM, (2008)A Rank-1 Sketch for Matrix Multiplicative Weights., , , and . COLT, volume 99 of Proceedings of Machine Learning Research, page 589-623. PMLR, (2019)Optimal and Adaptive Monteiro-Svaiter Acceleration., , , , and . NeurIPS, (2022)Gradient Descent Monotonically Decreases the Sharpness of Gradient Flow Solutions in Scalar Networks and Beyond., , , and . ICML, volume 202 of Proceedings of Machine Learning Research, page 17684-17744. PMLR, (2023)