Author of the publication

Scan and Snap: Understanding Training Dynamics and Token Composition in 1-layer Transformer.

, , , and . CoRR, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Scan and Snap: Understanding Training Dynamics and Token Composition in 1-layer Transformer., , , and . CoRR, (2023)Sensing Characteristics of Tilted Long Period Fiber Gratings Inscribed by Infrared Femtosecond Laser., , , , and . Sensors, 18 (9): 3003 (2018)Conditional Generation of Medical Images via Disentangled Adversarial Inference., , , and . DGM4MICCAI/DALI@MICCAI, volume 13003 of Lecture Notes in Computer Science, page 45-66. Springer, (2021)A Cross-patient SEEG Epileptic Signal Detection Method Based on Adaptive Feature Fusion of Brain Network Features and Single-Channel Features., , , , , and . ICBBE, page 165-172. ACM, (2022)Multiview Long-Short Spatial Contrastive Learning For 3D Medical Image Analysis., , , , , and . ICASSP, page 1226-1230. IEEE, (2022)Adaptive antenna arrays for cellular CDMA communication systems., and . ICASSP, page 1725-1728. IEEE Computer Society, (1995)A novel local descriptor based on image patch gray-value coding., , , and . ROBIO, page 1276-1280. IEEE, (2009)A robust and efficient method for estimating enzyme complex abundance and metabolic flux from expression data., , , , , , , and . Comput. Biol. Chem., (2015)Leveraging Edge Computing and Privacy-Enhanced Prediction Sharing for Urban Traffic Forecasting., , and . ICPADS, page 1067-1074. IEEE, (2023)A New Method for Motion-Blurred Image Blind Restoration Based on Huber Markov Random Field., , , and . ICIG, page 51-56. IEEE Computer Society, (2009)