Author of the publication

Towards A Unified View of Sparse Feed-Forward Network in Pretraining Large Language Model.

, , , , and . EMNLP, page 15038-15061. Association for Computational Linguistics, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

A Sample-Efficient OPF Learning Method Based on Annealing Knowledge Distillation., , , , , and . IEEE Access, (2022)Load Forecasting Model and Day-ahead Operation Strategy for City-located EV Quick Charge Stations., , , , , and . CoRR, (2019)Synthesizing Diverse and Physically Stable Grasps with Arbitrary Hand Structures by Differentiable Force Closure Estimation., , , , and . CoRR, (2021)Every Corporation Owns Its Image: Corporate Credit Ratings via Convolutional Neural Networks., , , and . CoRR, (2020)Latency-aware Unified Dynamic Networks for Efficient Image Recognition., , , , , , and . CoRR, (2023)Hoyer regularizer is all you need for ultra low-latency spiking neural networks., , and . CoRR, (2022)PerfOMR: Oblivious Message Retrieval with Reduced Communication and Computation., , and . IACR Cryptol. ePrint Arch., (2024)Q-learning and traditional methods on solving the pocket Rubik's cube., , , and . Comput. Ind. Eng., (2022)A Novel Adaptive Multi-View Non-Negative Graph Semi-Supervised ELM., , , , and . IEEE Access, (2020)Deep and Shallow Features Fusion Based Deep CNN for Spectrum Sensing in Cognitive Radio., , , , , and . ICCT, page 236-240. IEEE, (2022)