Author of the publication

Stable, Fast and Accurate: Kernelized Attention with Relative Positional Encoding.

, , , , , , , , and . NeurIPS, page 22795-22807. (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Mix MSTAR: A Synthetic Benchmark Dataset for Multi-Class Rotation Vehicle Detection in Large-Scale SAR Images., , and . Remote. Sens., 15 (18): 4558 (September 2023)Revisiting Language Encoding in Learning Multilingual Representations., , , , , , and . CoRR, (2021)One Transformer Can Understand Both 2D & 3D Molecular Data., , , , , , and . ICLR, OpenReview.net, (2023)Do Transformers Really Perform Badly for Graph Representation?, , , , , , , and . NeurIPS, page 28877-28888. (2021)Learning a Fourier Transform for Linear Relative Positional Encodings in Transformers., , , , , , , , , and . AISTATS, volume 238 of Proceedings of Machine Learning Research, page 2278-2286. PMLR, (2024)First Place Solution of KDD Cup 2021 & OGB Large-Scale Challenge Graph Prediction Track., , , , , , , , , and . CoRR, (2021)Benchmarking Graphormer on Large-Scale Molecular Modeling Datasets., , , , , , , , , and . CoRR, (2022)GraphNorm: A Principled Approach to Accelerating Graph Neural Network Training., , , , , and . ICML, volume 139 of Proceedings of Machine Learning Research, page 1204-1215. PMLR, (2021)Do Transformers Really Perform Bad for Graph Representation?, , , , , , , and . CoRR, (2021)Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation., , , , , , , , and . CoRR, (2024)