Author of the publication

Stabilizing Transformer Training by Preventing Attention Entropy Collapse.

, , , , , , , and . ICML, volume 202 of Proceedings of Machine Learning Research, page 40770-40803. PMLR, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Stabilizing Transformer Training by Preventing Attention Entropy Collapse., , , , , , , and . ICML, volume 202 of Proceedings of Machine Learning Research, page 40770-40803. PMLR, (2023)REALM: Robust Entropy Adaptive Loss Minimization for Improved Single-Sample Test-Time Adaptation., , , , and . WACV, page 2051-2060. IEEE, (2024)Theory, Analysis, and Best Practices for Sigmoid Self-Attention., , , , , , , , , and 1 other author(s). CoRR, (2024)The Role of Entropy and Reconstruction in Multi-View Self-Supervised Learning., , , , , , , and . ICML, volume 202 of Proceedings of Machine Learning Research, page 29143-29160. PMLR, (2023)Decoding Decoders: Finding Optimal Representation Spaces for Unsupervised Similarity Tasks., , , , and . ICLR (Workshop), OpenReview.net, (2018)Learning medical triage from clinicians using Deep Q-Learning., , , , , , , , , and 3 other author(s). CoRR, (2020)Neural Temporal Point Processes For Modelling Electronic Health Records., , , , and . ML4H@NeurIPS, volume 136 of Proceedings of Machine Learning Research, page 85-113. PMLR, (2020)Relational Graph Attention Networks., , , and . CoRR, (2019)DUET: 2D Structured and Approximately Equivariant Representations., , , , , , , and . ICML, volume 202 of Proceedings of Machine Learning Research, page 32749-32769. PMLR, (2023)Position Prediction as an Effective Pretraining Strategy., , , , , , , , , and . ICML, volume 162 of Proceedings of Machine Learning Research, page 26010-26027. PMLR, (2022)