Author of the publication

Simplicity Bias in Transformers and their Ability to Learn Sparse Boolean Functions.

, , , and . ACL (1), page 5767-5791. Association for Computational Linguistics, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Differentiable Causal Backdoor Discovery., , , and . AISTATS, volume 108 of Proceedings of Machine Learning Research, page 3970-3979. PMLR, (2020)Learning Using Local Membership Queries., , and . COLT, volume 30 of JMLR Workshop and Conference Proceedings, page 398-431. JMLR.org, (2013)Learning Hurdles for Sleeping Experts., and . Electron. Colloquium Comput. Complex., (2011)Hierarchical Clustering Beyond the Worst-Case., , and . NIPS, page 6201-6209. (2017)On the Hardness of Robust Classification., , , and . NeurIPS, page 7444-7453. (2019)Clustering Redemption-Beyond the Impossibility of Kleinberg's Axioms., , and . NeurIPS, page 8526-8535. (2018)Online k-means Clustering., , , and . AISTATS, volume 130 of Proceedings of Machine Learning Research, page 1126-1134. PMLR, (2021)Reliable Agnostic Learning., , and . COLT, (2009)Understanding In-Context Learning in Transformers and LLMs by Learning to Learn Discrete Functions., , , and . ICLR, OpenReview.net, (2024)Hierarchical Clustering: Objective Functions and Algorithms., , , and . J. ACM, 66 (4): 26:1-26:42 (2019)