From post

Same Pre-training Loss, Better Downstream: Implicit Bias Matters for Language Models.

, , , и . ICML, том 202 из Proceedings of Machine Learning Research, стр. 22188-22214. PMLR, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Evaluate the Correlation Between Electrocardiogram Age and Cardiovascular Disease Using a 12-lead ECG Dataset., , , , , , , и . FSDM, том 378 из Frontiers in Artificial Intelligence and Applications, стр. 905-911. IOS Press, (2023)Interpretable Deep Learning Model for Identifying the Immediate Risk of Myocardial Infarction Complications., , , , , и . FSDM, том 378 из Frontiers in Artificial Intelligence and Applications, стр. 918-924. IOS Press, (2023)Integrated estimator and L1 adaptive controller for well drilling systems., , , и . ACC, стр. 1958-1963. IEEE, (2009)Understanding the Impact of Quantum Noise on Quantum Programs., , , , и . SANER, стр. 426-437. IEEE, (2023)A Robust LiDAR-based SLAM for Autonomous Vehicles aided by GPS/INS Integrated Navigation System., , , , , и . CACRE, стр. 351-358. IEEE, (2021)A Cross-Domain Authentication Scheme Based Master-Slave Chain in Edge Computing., и . SGIoT, том 497 из Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, стр. 48-57. Springer, (2022)Performance Prediction of ℒ₁ Adaptive Control in Linear Stochastic Systems., , , и . CDC, стр. 1-6. IEEE, (2013)Comparative Analysis of Topical Evolution Patterns and Temporal Trends of Hypertension Research., , , , и . MedInfo, том 264 из Studies in Health Technology and Informatics, стр. 308-312. IOS Press, (2019)Interprocedural Analysis Based on Guarded Array Regions., , и . Compiler Optimizations for Scalable Parallel Systems Languages, том 1808 из Lecture Notes in Computer Science, стр. 221-246. Springer, (2001)Understanding the Generalization Benefit of Normalization Layers: Sharpness Reduction., , и . NeurIPS, (2022)