Lianmin Zheng
Lianmin Zheng
Verified email at - Homepage
Cited by
Cited by
TVM: An automated end-to-end optimizing compiler for deep learning
T Chen, T Moreau, Z Jiang, L Zheng, E Yan, H Shen, M Cowan, L Wang, ...
13th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2018
Learning to optimize tensor programs
T Chen, L Zheng, E Yan, Z Jiang, T Moreau, L Ceze, C Guestrin, ...
Advances in Neural Information Processing Systems 31, 2018
Magent: A many-agent reinforcement learning platform for artificial collective intelligence
L Zheng, J Yang, H Cai, M Zhou, W Zhang, J Wang, Y Yu
Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018
A hardware–software blueprint for flexible deep learning specialization
T Moreau, T Chen, L Vega, J Roesch, E Yan, L Zheng, J Fromm, Z Jiang, ...
IEEE Micro 39 (5), 8-16, 2019
Ansor: Generating {High-Performance} Tensor Programs for Deep Learning
L Zheng, C Jia, M Sun, Z Wu, CH Yu, A Haj-Ali, Y Wang, J Yang, D Zhuo, ...
14th USENIX symposium on operating systems design and implementation (OSDI …, 2020
A unified optimization approach for cnn model inference on integrated gpus
L Wang, Z Chen, Y Liu, Y Wang, L Zheng, M Li, Y Wang
Proceedings of the 48th International Conference on Parallel Processing, 1-10, 2019
Actnn: Reducing training memory footprint via 2-bit activation compressed training
J Chen, L Zheng, Z Yao, D Wang, I Stoica, M Mahoney, J Gonzalez
International Conference on Machine Learning, 1803-1813, 2021
Optimizing deep learning workloads on ARM GPU with TVM
L Zheng, T Chen
Proceedings of the 1st on Reproducible Quality-Efficient Systems Tournament …, 2018
Tenset: A large-scale program performance dataset for learned tensor compilers
L Zheng, R Liu, J Shao, T Chen, JE Gonzalez, I Stoica, AH Ali
Thirty-fifth Conference on Neural Information Processing Systems Datasets …, 2021
Alpa: Automating Inter-and Intra-Operator Parallelism for Distributed Deep Learning
L Zheng, Z Li, H Zhang, Y Zhuang, Z Chen, Y Huang, Y Wang, Y Xu, ...
arXiv preprint arXiv:2201.12023, 2022
GACT: Activation Compressed Training for Generic Network Architectures
X Liu, L Zheng, D Wang, Y Cen, W Chen, X Han, J Chen, Z Liu, J Tang, ...
International Conference on Machine Learning, 14139-14152, 2022
TensorIR: An Abstraction for Automatic Tensorized Program Optimization
S Feng, B Hou, H Jin, W Lin, J Shao, R Lai, Z Ye, L Zheng, CH Yu, Y Yu, ...
arXiv preprint arXiv:2207.04296, 2022
NumS: Scalable Array Programming for the Cloud
M Elibol, V Benara, S Yagati, L Zheng, A Cheung, MI Jordan, I Stoica
arXiv preprint arXiv:2206.14276, 2022
The system can't perform the operation now. Try again later.
Articles 1–13