Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

An Experimental Design Framework for Label-Efficient Supervised Finetuning of Large Language Models.

G. Bhatt, Y. Chen, A. Das, J. Zhang, S. Truong, S. Mussmann, Y. Zhu, J. Bilmes, S. Du, K. Jamieson, J. Ash, and R. Nowak. ACL (Findings), page 6549-6560. Association for Computational Linguistics, (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Simon S Lee

Marcus Simon

Erstellung und Validierung eines Fragebogens für die Patientenbeurteilung der perioperativen Phase, PPP-FragebogenM. Simon. Uni Marburg, (2009)

Christine Simon

Vollblutaggregation beim Hund: Vergleich der Methode nach Born mit einer errechneten Aggregation nach Thrombozytenzählung mittels konventioneller HämatologiesystemeC. Simon. Uni Gießen, (2009)

Juan Du

Fei Du

Other publications of authors with the same name

What Can Neural Networks Reason About?K. Xu, J. Li, M. Zhang, S. Du, K. ichi Kawarabayashi, and S. Jegelka. CoRR, (2019)Gradient Descent Finds Global Minima of Deep Neural Networks.S. Du, J. Lee, H. Li, L. Wang, and X. Zhai. CoRR, (2018)Near-Optimal Randomized Exploration for Tabular Markov Decision Processes.Z. Xiong, R. Shen, Q. Cui, M. Fazel, and S. Du. NeurIPS, (2022)Planning with General Objective Functions: Going Beyond Total Rewards.R. Wang, P. Zhong, S. Du, R. Salakhutdinov, and L. Yang. NeurIPS, (2020)Agnostic $Q$-learning with Function Approximation in Deterministic Systems: Near-Optimal Bounds on Approximation Error and Sample Complexity.S. Du, J. Lee, G. Mahajan, and R. Wang. NeurIPS, (2020)On Reward-Free Reinforcement Learning with Linear Function Approximation.R. Wang, S. Du, L. Yang, and R. Salakhutdinov. NeurIPS, (2020)Is Long Horizon RL More Difficult Than Short Horizon RL?R. Wang, S. Du, L. Yang, and S. Kakade. NeurIPS, (2020)Over-Parameterization Exponentially Slows Down Gradient Descent for Learning a Single Neuron.W. Xu, and S. Du. COLT, volume 195 of Proceedings of Machine Learning Research, page 1155-1198. PMLR, (2023)Fine-Grained Gap-Dependent Bounds for Tabular MDPs via Adaptive Multi-Step Bootstrap.H. Xu, T. Ma, and S. Du. COLT, volume 134 of Proceedings of Machine Learning Research, page 4438-4472. PMLR, (2021)On Reinforcement Learning with Adversarial Corruption and Its Application to Block MDP.T. Wu, Y. Yang, S. Du, and L. Wang. ICML, volume 139 of Proceedings of Machine Learning Research, page 11296-11306. PMLR, (2021)

BibSonomy

Disambiguation of "Du, Simon S."

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

An Experimental Design Framework for Label-Efficient Supervised Finetuning of Large Language Models.

Please choose a person to relate this publication to

Simon S Lee

Marcus Simon

Christine Simon

Juan Du

Fei Du

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Du, Simon S."

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML An Experimental Design Framework for Label-Efficient Supervised Finetuning of Large Language Models.

Please choose a person to relate this publication to

Simon S Lee

Marcus Simon

Christine Simon

Juan Du

Fei Du

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

An Experimental Design Framework for Label-Efficient Supervised Finetuning of Large Language Models.