OpGuard: Bitwise Alignment for Precise and General Debugging of Production LLM Training
Ziming Zhou, Yinjie Zhao, Hang Zhu, Wenxiao Wang, Zhihao Bai, Yun Zhang, Shuguang Wang, Haibin Lin, Peng Huang
OSDI 2026

I’m a Computer Science PhD student at University of Michigan - Ann Arbor, advised by Prof. Ryan Huang
My primary research focus involves solving the reliability and efficiency problems in (ML/Numerical/Distributed) systems.
Graduated with a strong foundation in AI-driven fluid flow diagnostics. Notable achievements include:
Ziming Zhou, Yinjie Zhao, Hang Zhu, Wenxiao Wang, Zhihao Bai, Yun Zhang, Shuguang Wang, Haibin Lin, Peng Huang
OSDI 2026
Yuxuan Jiang, Ziming Zhou, Boyu Xu, Beijie Liu, Runhui Xu, Peng Huang
OSDI 2025
@inproceedings{TrainCheckOSDI2025,
author = {Jiang, Yuxuan and Zhou, Ziming and Xu, Boyu and Liu, Beijie and Xu, Runhui and Huang, Peng},
title = {Training with Confidence: Catching Silent Errors in Deep Learning Training with Automated Proactive Checks},
booktitle = {Proceedings of the 19th USENIX Symposium on Operating Systems Design and Implementation},
series = {OSDI '25},
month = {July},
year = {2025},
address = {Boston, MA, USA},
publisher = {USENIX Association},
}