Author of the publication

Pretraining Data Mixtures Enable Narrow Model Selection Capabilities in Transformer Models

, , and . PDF, (November 2023)cite arxiv:2311.00871.
DOI: 10.48550/arXiv.2311.00871

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Iterative Hard Thresholding for Keyword Extraction from Large Text Corpora., , , , and . ICMLA, page 588-593. IEEE, (2014)Estimation and Validation of a Class of Conditional Average Treatment Effects Using Observational Data., , , , and . CoRR, (2019)Underspecification Presents Challenges for Credibility in Modern Machine Learning., , , , , , , , , and 30 other author(s). CoRR, (2020)Pretraining Data Mixtures Enable Narrow Model Selection Capabilities in Transformer Models., , and . CoRR, (2023)Diagnosing Model Performance Under Distribution Shift., , and . CoRR, (2023)Boosting the interpretability of clinical risk scores with intervention predictions., , , , , , , , , and 1 other author(s). CoRR, (2022)A Calibration Metric for Risk Scores with Survival Data., , and . MLHC, volume 106 of Proceedings of Machine Learning Research, page 424-450. PMLR, (2019)Underspecification Presents Challenges for Credibility in Modern Machine Learning., , , , , , , , , and 30 other author(s). J. Mach. Learn. Res., (2022)The MultiBERTs: BERT Reproductions for Robustness Analysis., , , , , , , , , and 2 other author(s). CoRR, (2021)The MultiBERTs: BERT Reproductions for Robustness Analysis., , , , , , , , , and 2 other author(s). ICLR, OpenReview.net, (2022)