Hi, I'm Tianhang Zhu. I'm Head of LLM Training at Fundamental Research Labs, where I lead training for Ava and our broader effort to build digital human beings — autonomous, collaborative, and socially intelligent agents. Previously I was Head of Reinforcement Learning at 01.ai, leading online RLHF training, reward modeling, and reasoning research for Yi models, and before that I was a Senior Research Scientist on the Qwen LLM team at Alibaba DAMO, where I helped deploy Qwen-max and co-authored the Qwen, Qwen2, and Qwen3 technical reports. I hold two Master's degrees from the Georgia Institute of Technology and a Bachelor's in Computer Science and Actuarial Science from the University of Waterloo. My research interests include reinforcement learning from human feedback, large language model training, emergent reasoning, and multi-agent simulation.
Menlo Park, CA
tianhang.zhu[at]alibaba-inc.com / bobzhu1991[at]outlook.com
+1 (404) 281 9788
Qwen3 Technical Report
An Yang, Anfeng Li, Baosong Yang, Binyuan Hui, Bo Zheng, Bowen Yu, Dayiheng Liu, Fei Huang, Huan Lin, Jian Yang, Junyang Lin, Peng Wang, Tianhang Zhu, et al.
arXiv preprint arXiv:2505.09388
Yi-Lightning Technical Report
Alan Wake, Bei Chen, Chao Li, Chengen Huang, Chujie Zheng, Fan Zhou, Feng Hu, Ge Zhang, Guoyin Wang, Heng Ji, Tianhang Zhu, et al.
arXiv preprint arXiv:2412.01253
Qwen2 Technical Report
An Yang, Baosong Yang, Binyuan Hui, Bo Zheng, Bowen Yu, Dayiheng Liu, Fei Huang, Guanting Dong, Haoran Wei, Huan Lin, Jian Yang, Junyang Lin, Peng Wang, Tianhang Zhu, et al.
arXiv preprint arXiv:2407.10671
Qwen Technical Report
Jinze Bai, Shuai Bai, Yunfei Chu, Zeyu Cui, Kai Dang, Yang Fan, Fei Huang, Binyuan Hui, Junyang Lin, Runji Lin, Dayiheng Liu, Rui Men, Jianxin Ma, Xingzhang Ren, Peng Wang, Shijie Wang, An Yang, Tianhang Zhu, et al.
arXiv preprint arXiv:2309.16609