From post

NLP From Scratch Without Large-Scale Pretraining: A Simple and Efficient Framework.

, , , и . ICML, том 162 из Proceedings of Machine Learning Research, стр. 25438-25451. PMLR, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Research on Medium- and Long-Term Operation Simulation Method Based on Improved Universal Generating Function., , и . IEEE Access, (2019)CPM: A Large-scale Generative Chinese Pre-trained Language Model., , , , , , , , , и 15 other автор(ы). CoRR, (2020)On the Mean Local Delay of Clustered Fog Radio Access Networks., , , , и . PIMRC, стр. 354-359. IEEE, (2021)FlipDA: Effective and Robust Data Augmentation for Few-Shot Learning., , , , и . ACL (1), стр. 8646-8665. Association for Computational Linguistics, (2022)Test4Deep: an Effective White-Box Testing for Deep Neural Networks., , , , и . CSE/EUC, стр. 16-23. IEEE, (2019)A Latent-Constrained Variational Neural Dialogue Model for Information-Rich Responses., , , и . CIKM, стр. 1351-1360. ACM, (2019)Compositional Task Representations for Large Language Models., , , , , и . ICLR, OpenReview.net, (2023)On the Performance of Data Compression in Clustered Fog Radio Access Networks., , , , , и . CoRR, (2022)Generating Contextually Coherent Responses by Learning Structured Vectorized Semantics., , , , , и . DASFAA (2), том 12682 из Lecture Notes in Computer Science, стр. 70-87. Springer, (2021)Research on Grain Futures Price Prediction Based on a Bi-DSConvLSTM-Attention Model., , , и . Syst., 12 (6): 204 (2024)