Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens.

Y. Ding, L. Zhang, C. Zhang, Y. Xu, N. Shang, J. Xu, F. Yang, and M. Yang. CoRR, (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Li Zhang

Other publications of authors with the same name

Boosting Mobile CNN Inference through Semantic Memory.Y. Li, C. Zhang, S. Han, L. Zhang, B. Yin, Y. Liu, and M. Xu. ACM Multimedia, page 2362-2371. ACM, (2021)Towards efficient vision transformer inference: a first study of transformers on mobile devices.X. Wang, L. Zhang, Y. Wang, and M. Yang. HotMobile, page 1-7. ACM, (2022)SpaceEvo: Hardware-Friendly Search Space Design for Efficient INT8 Inference.X. Wang, L. Zhang, J. Xu, Q. Zhang, Y. Wang, Y. Yang, N. Zheng, T. Cao, and M. Yang. ICCV, page 5796-5805. IEEE, (2023)Fast Hardware-Aware Neural Architecture Search.L. Zhang, Y. Yang, Y. Jiang, W. Zhu, and Y. Liu. CVPR Workshops, page 2959-2967. Computer Vision Foundation / IEEE, (2020)LitePred: Transferable and Scalable Latency Prediction for Hardware-Aware Neural Architecture Search.C. Feng, L. Zhang, Y. Liu, J. Xu, C. Zhang, Z. Wang, T. Cao, M. Yang, and H. Tan. NSDI, USENIX Association, (2024)Compresso: Structured Pruning with Collaborative Prompting Learns Compact Large Language Models.S. Guo, J. Xu, L. Zhang, and M. Yang. CoRR, (2023)LUT-NN: Empower Efficient Neural Network Inference with Centroid Learning and Table Lookup.X. Tang, Y. Wang, T. Cao, L. Zhang, Q. Chen, D. Cai, Y. Liu, and M. Yang. MobiCom, page 70:1-70:15. ACM, (2023)nn-Meter: towards accurate latency prediction of deep-learning model inference on diverse edge devices.L. Zhang, S. Han, J. Wei, N. Zheng, T. Cao, Y. Yang, and Y. Liu. MobiSys, page 81-93. ACM, (2021)Accurate and Structured Pruning for Efficient Automatic Speech Recognition.H. Jiang, L. Zhang, Y. Li, Y. Wu, S. Cao, T. Cao, Y. Yang, J. Li, M. Yang, and L. Qiu. INTERSPEECH, page 4104-4108. ISCA, (2023)On Modular Learning of Distributed Systems for Predicting End-to-End Latency.C. Liang, Z. Fang, Y. Xie, F. Yang, Z. Li, L. Zhang, M. Yang, and L. Zhou. NSDI, page 1081-1095. USENIX Association, (2023)

BibSonomy

Disambiguation of "Zhang, Li Lyna"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens.

Please choose a person to relate this publication to

Li Zhang

Li Zhang

Li Zhang

Li Zhang

Li Zhang

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Zhang, Li Lyna"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens.

Please choose a person to relate this publication to

Li Zhang

Li Zhang

Li Zhang

Li Zhang

Li Zhang

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens.