Author of the publication

AD-KD: Attribution-Driven Knowledge Distillation for Language Model Compression.

, , , , and . ACL (1), page 8449-8465. Association for Computational Linguistics, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Multi-view Similarity Learning of Manifold Data., , , and . ICIG (1), volume 11901 of Lecture Notes in Computer Science, page 631-643. Springer, (2019)Low-Rank Laplacian Similarity Learning., , , and . BICS, volume 11691 of Lecture Notes in Computer Science, page 34-44. Springer, (2019)Interactive Time-Dependent Tone Mapping Using Programmable Graphics Hardware., , , and . Rendering Techniques, volume 44 of ACM International Conference Proceeding Series, page 26-37. Eurographics Association, (2003)Feasibility of Mobility for Millimeter-Wave Systems Based on Channel Measurements., , , , , , and . IEEE Communications Magazine, 56 (7): 56-63 (2018)Non-Hierarchical Clock Synchronization for Wireless Sensor Networks, , and . CoRR, (2012)Algorithms for semi-automatic Web service composition.. University of Georgia, Athens, GA, USA, (2011)base-search.net (ftunivgeorgia:oai:ugakr.libs.uga.edu:10724/27335).UGC: Unified GAN Compression for Efficient Image-to-Image Translation., , , , , , , , and . ICCV, page 17235-17245. IEEE, (2023)PreDet: Large-scale weakly supervised pre-training for detection., , and . ICCV, page 2845-2855. IEEE, (2021)Depression Analysis and Recognition Based on Functional Near-Infrared Spectroscopy., , , , , and . IEEE J. Biomed. Health Informatics, 25 (12): 4289-4299 (2021)H∞ control for a class of two-dimensional linear systems with fading measurements., , , and . Trans. Inst. Meas. Control, 43 (2): 267-277 (2021)