Home

数据中心场景下的高性能、高效率机器学习系统


Speaker

Zhihao Bai,Johns Hopkins University

Time

2021-08-23 09:30:00 ~ 2021-08-23 11:00:00

Location

腾讯线上会议(会议ID:934 642 806, 会议密码:717828)

Host

金海明

Abstract

Machine learning powers an emerging family of intelligent applications in many domains, from retail and transportation, to finance and healthcare. To support computation-intensive machine learning tasks, enterprises usually build large-scale GPU clusters, which presents challenges on the performance and efficiency of the machine learning systems. On the other hand, machine learning itself is also used to improve the system performance. In this talk, I will share two of my projects. The first one is PipeSwitch, which is published in OSDI’20. It allows multiple deep learning applications to time-share the same GPU with millisecond-scale switching overhead. With PipeSwitch, GPU utilization can be significantly improved without sacrificing SLOs.  The second project is RegexNet, which is published in S&P’21. It adopts a deep learning model, which is updated constantly in a feedback loop during runtime, to detect ReDoS attacks, which is a Denial-of-Service attack exploiting vulnerable regular expressions.


Bio

Zhihao Bai is a 5th year Ph.D. student at Johns Hopkins University, advised by Professor Xin Jin. He works on computer systems, with a focus on machine learning systems. Before entering Johns Hopkins University, he received B.S. from Shanghai Jiao Tong University in 2017, and he was a member of ACM Honored Class, Zhiyuan College. He published many papers on top conferences, including USENIX OSDI (2020) and IEEE Security and Privacy (2021). He also received USENIX FAST Best Paper Award in 2019.


© John Hopcroft Center for Computer Science, Shanghai Jiao Tong University
分享到

地址:上海市东川路800号上海交通大学软件大楼专家楼
邮箱:jhc@sjtu.edu.cn 电话:021-54740299
邮编:200240