Author of the publication

NENYA: Cascade Reinforcement Learning for Cost-Aware Failure Mitigation at Microsoft 365.

, , , , , , , , , , , , and . KDD, page 4032-4040. ACM, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Gray Failure, , , , , , and . Proceedings of the 16th Workshop on Hot Topics in Operating Systems - HotOS \textquotesingle17, ACM Press, (2017)NTAM: Neighborhood-Temporal Attention Model for Disk Failure Prediction in Cloud Platforms., , , , , , , , , and 1 other author(s). WWW, page 1181-1191. ACM / IW3C2, (2021)How Long Will it Take to Mitigate this Incident for Online Service Systems?, , , , , , , , , and 3 other author(s). ISSRE, page 36-46. IEEE, (2021)Towards Lightweight, Model-Agnostic and Diversity-Aware Active Anomaly Detection., , , , , , , , and . ICLR, OpenReview.net, (2023)ReBucket: A method for clustering duplicate crash reports based on call stack similarity., , , , and . ICSE, page 1084-1093. IEEE Computer Society, (2012)YADING: Fast Clustering of Large-Scale Time Series Data., , , , , and . Proc. VLDB Endow., 8 (5): 473-484 (2015)Predictive and Adaptive Failure Mitigation to Avert Production Cloud VM Interruptions., , , , , , , , , and 3 other author(s). OSDI, page 1155-1170. USENIX Association, (2020)RESIN: A Holistic Service for Dealing with Memory Leaks in Production Cloud Infrastructure., , , , , , , , and . OSDI, page 109-125. USENIX Association, (2022)Onion: identifying incident-indicating logs for cloud systems., , , , , , , , , and 3 other author(s). ESEC/SIGSOFT FSE, page 1253-1263. ACM, (2021)SPINE: a scalable log parser with feedback guidance., , , , , , , , , and 2 other author(s). ESEC/SIGSOFT FSE, page 1198-1208. ACM, (2022)