From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

SLIDE : In Defense of Smart Algorithms over Hardware Acceleration for Large-Scale Deep Learning Systems, , , , , и . (2019)cite arxiv:1903.03129Comment: Published at MLSys 2020.Scan and Snap: Understanding Training Dynamics and Token Composition in 1-layer Transformer., , , и . CoRR, (2023)SOLAR: Sparse Orthogonal Learned and Random Embeddings., , и . ICLR, OpenReview.net, (2021)It Takes Two: On the Seamlessness between Reward and Policy Model in RLHF., , , , , и . CoRR, (2024)Laughing Hyena Distillery: Extracting Compact Recurrences From Convolutions., , , , , , , , , и 4 other автор(ы). CoRR, (2023)Learn To be Efficient: Build Structured Sparsity in Large Language Models., , , , и . CoRR, (2024)Efficient Streaming Language Models with Attention Sinks, , , , и . (2024)LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding., , , , , , , , , и 3 other автор(ы). ACL (1), стр. 12622-12642. Association for Computational Linguistics, (2024)Sub-linear Privacy-preserving Search with Untrusted Server and Semi-honest Parties., , , , и . CoRR, (2016)Densified Winner Take All (WTA) Hashing for Sparse Datasets., и . UAI, стр. 906-916. AUAI Press, (2018)