Author of the publication

HALP: Heuristic Aided Learned Preference Eviction Policy for YouTube Content Delivery Network.

, , , , , , , , , and . NSDI, page 1149-1163. USENIX Association, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

No persons found for author name Gummadi, Ramki
add a person with the name Gummadi, Ramki
 

Other publications of authors with the same name

Surrogate Objectives for Batch Policy Optimization in One-step Decision Making., , , and . NeurIPS, page 8825-8835. (2019)HALP: Heuristic Aided Learned Preference Eviction Policy for YouTube Content Delivery Network., , , , , , , , , and . NSDI, page 1149-1163. USENIX Association, (2023)Characterizing the Gap Between Actor-Critic and Policy Gradient., , , and . ICML, volume 139 of Proceedings of Machine Learning Research, page 11101-11111. PMLR, (2021)Satisficing Exploration for Deep Reinforcement Learning., , , and . CoRR, (2024)A Parametric Class of Approximate Gradient Updates for Policy Optimization., , , and . ICML, volume 162 of Proceedings of Machine Learning Research, page 7998-8015. PMLR, (2022)Variational Rejection Sampling., , , , and . AISTATS, volume 84 of Proceedings of Machine Learning Research, page 823-832. PMLR, (2018)Feasible Q-Learning for Average Reward Reinforcement Learning., , , and . AISTATS, volume 238 of Proceedings of Machine Learning Research, page 1630-1638. PMLR, (2024)Understanding and Leveraging Overparameterization in Recursive Value Estimation., , , , , , and . ICLR, OpenReview.net, (2022)Target Networks and Over-parameterization Stabilize Off-policy Bootstrapping with Function Approximation., , , , , , , , and . ICML, OpenReview.net, (2024)