Author of the publication

Understanding the Difficulty of Training Transformers.

, , , , and . EMNLP (1), page 5747-5763. Association for Computational Linguistics, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Using Expert Knowledge in Database-Oriented Problem Solving., and . ICIS, page 2. Association for Information Systems, (1985)Metarule-Guided Mining of Multi-Dimensional Association Rules Using Data Cubes., , and . KDD, page 207-210. AAAI Press, (1997)How Can Data Mining Help Bio-Data Analysis?. BIOKDD, page 1-2. (2002)Geo-Spatial Clustering with User-Specified Constraints., , , and . MDM/KDD, page 1-7. University of Alberta, (2000)Probabilistic Models for Text Mining., , and . Mining Text Data, Springer, (2012)Chain-Split Evaluation in Deductive Databases.. IEEE Trans. Knowl. Data Eng., 7 (2): 261-273 (1995)LinkClus: Efficient clustering via heterogeneous semantic links, and . In VLDB, page 427--438. (2006)Discovering spatial associations in images., and . Data Mining and Knowledge Discovery: Theory, Tools, and Technology, volume 4057 of SPIE Proceedings, page 138-147. SPIE, (2000)Mining Control Flow Abnormality for Logic Error Isolation., , and . SDM, page 106-117. SIAM, (2006)Data Mining: Concepts and Techniques, 3rd edition, , and . Morgan Kaufmann, (2011)