Author of the publication

Scissorhands: Exploiting the Persistence of Importance Hypothesis for LLM KV Cache Compression at Test Time.

, , , , , , , and . CoRR, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Lsh-sampling Breaks the Computation Chicken-and-egg Loop in Adaptive Stochastic Gradient Estimation., , and . CoRR, (2019)Sub-linear RACE Sketches for Approximate Kernel Density Estimation on Streaming Data., and . CoRR, (2019)Adaptive Learned Bloom Filter (Ada-BF): Efficient Utilization of the Classifier., and . CoRR, (2019)SLIDE : In Defense of Smart Algorithms over Hardware Acceleration for Large-Scale Deep Learning Systems, , , , , and . (2019)cite arxiv:1903.03129Comment: Published at MLSys 2020.Privacy Adversarial Network: Representation Learning for Mobile Data Privacy., , , and . Proc. ACM Interact. Mob. Wearable Ubiquitous Technol., 3 (4): 144:1-144:18 (2019)Active Sampling Count Sketch (ASCS) for Online Sparse Estimation of a Trillion Scale Covariance Matrix., , , and . CoRR, (2020)CARAMEL: A Succinct Read-Only Lookup Table via Compressed Static Functions., , , , and . CoRR, (2023)Arrays of (locality-sensitive) Count Estimators (ACE): High-Speed Anomaly Detection via Cache Lookups., and . CoRR, (2017)Scalable and Sustainable Deep Learning via Randomized Hashing., and . CoRR, (2016)Beyond Convolutions: A Novel Deep Learning Approach for Raw Seismic Data Ingestion., , , , , and . CoRR, (2021)