Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

SM6: A 16nm System-on-Chip for Accurate and Noise-Robust Attention-Based NLP Applications : The 33rd Hot Chips Symposium - August 22-24, 2021., , , , , , , , , and . HCS, page 1-13. IEEE, (2021)A 16-nm SoC for Noise-Robust Speech and NLP Edge AI Inference With Bayesian Sound Source Separation and Attention-Based DNNs., , , , , , , , , and . IEEE J. Solid State Circuits, 58 (2): 569-581 (February 2023)Learned Best-Effort LLM Serving., , , , and . CoRR, (2024)Full Stack Optimization of Transformer Inference: a Survey., , , , , , , , , and 2 other author(s). CoRR, (2023)EdgeBERT: Optimizing On-Chip Inference for Multi-Task NLP., , , , , , , , and . CoRR, (2020)AI and Memory Wall., , , , , and . CoRR, (2024)Property-Aware Multi-Speaker Data Simulation: A Probabilistic Modelling Technique for Synthetic Data Generation., , , , , , , and . CoRR, (2023)KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization., , , , , , and . CoRR, (2024)SPEED: Speculative Pipelined Execution for Efficient Decoding., , , , , , and . CoRR, (2023)A 12nm 18.1TFLOPs/W Sparse Transformer Processor with Entropy-Based Early Exit, Mixed-Precision Predication and Fine-Grained Power Management., , , , , , , , , and 4 other author(s). ISSCC, page 342-343. IEEE, (2023)