Author of the publication

Incremental Comprehension of Garden-Path Sentences by Large Language Models: Semantic Interpretation, Syntactic Re-Analysis, and Attention.

, , , , , , and . CoRR, (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Stable, Fast and Accurate: Kernelized Attention with Relative Positional Encoding., , , , , , , , and . NeurIPS, page 22795-22807. (2021)BitDelta: Your Fine-Tune May Only Be Worth One Bit, , , , , , and . (2024)A Gram-Gauss-Newton Method Learning Overparameterized Deep Neural Networks for Regression Problems., , , , , , , and . CoRR, (2019)SnapKV: LLM Knows What You are Looking for Before Generation., , , , , , , , and . CoRR, (2024)Do Transformers Really Perform Bad for Graph Representation?, , , , , , , and . CoRR, (2021)Convergence of Adversarial Training in Overparametrized Neural Networks., , , , , and . NeurIPS, page 13009-13020. (2019)A Theory of Label Propagation for Subpopulation Shift., , , and . ICML, volume 139 of Proceedings of Machine Learning Research, page 1170-1182. PMLR, (2021)FlexAttention for Efficient High-Resolution Vision-Language Models., , , , , , , and . CoRR, (2024)REST: Retrieval-Based Speculative Decoding., , , , and . NAACL-HLT, page 1582-1595. Association for Computational Linguistics, (2024)Training-Free Activation Sparsity in Large Language Models., , , , , and . CoRR, (2024)