Wei Niu
Cited by
Cited by
Patdnn: Achieving real-time DNN execution on mobile devices with pattern-based weight pruning
W Niu, X Ma, S Lin, S Wang, X Qian, X Lin, Y Wang, B Ren
ASPLOS'20, 907-922, 2020
PCONV: The Missing but Desirable Sparsity in DNN Weight Pruning for Real-Time Execution on Mobile Devices.
X Ma, FM Guo, W Niu, X Lin, J Tang, K Ma, B Ren, Y Wang
AAAI'20, 5117-5124, 2020
YOLObile: Real-Time Object Detection on Mobile Devices via Compression-Compilation Co-Design
Y Cai, H Li, G Yuan, W Niu, Y Li, X Tang, B Ren, Y Wang
AAAI'21, 2020
RTMobile: Beyond Real-Time Mobile Acceleration of RNNs for Speech Recognition
P Dong, S Wang, W Niu, C Zhang, S Lin, Z Li, Y Gong, B Ren, X Lin, ...
DAC'20, 2020
DNNFusion: accelerating deep neural networks execution with advanced operator fusion
W Niu, J Guan, Y Wang, G Agrawal, B Ren
PLDI'2021, 883-898, 2021
26ms inference time for resnet-50: Towards real-time execution of all dnns on smartphone
W Niu, X Ma, Y Wang, B Ren
ICML2019 workshop, 2019
An Image Enhancing Pattern-based Sparsity for Real-time Inference on Mobile Devices
X Ma, W Niu, T Zhang, S Liu, FM Guo, S Lin, H Li, X Chen, J Tang, K Ma, ...
ECCV'20: Proceedings of the European Conference on Computer Vision, 2020
SPViT: Enabling Faster Vision Transformers via Soft Token Pruning
Z Kong, P Dong, X Ma, X Meng, W Niu, M Sun, B Ren, M Qin, H Tang, ...
arXiv preprint arXiv:2112.13890, 2021
Mest: Accurate and fast memory-economic sparse training framework on the edge
G Yuan, X Ma, W Niu, Z Li, Z Kong, N Liu, Y Gong, Z Zhan, C He, Q Jin, ...
Advances in Neural Information Processing Systems 34, 20838-20850, 2021
A Privacy-Preserving DNN Pruning and Mobile Acceleration Framework
Z Zhan, Y Gong, Z Li, P Zhao, X Ma, W Niu, X Xu, B Ren, Y Wang, X Lin
GLSVLSI '20: Proceedings of the 2020 on Great Lakes Symposium on VLSI, 2020
Clicktrain: Efficient and accurate end-to-end deep learning training via fine-grained architecture-preserving pruning
C Zhang, G Yuan, W Niu, J Tian, S Jin, D Zhuang, Z Jiang, Y Wang, B Ren, ...
Proceedings of the ACM International Conference on Supercomputing, 266-278, 2021
Zhenglun Kong, Ning Liu, Yifan Gong, Zheng Zhan, Chaoyang He, Qing Jin, et al. Mest: Accurate and fast memory-economic sparse training framework on the edge
G Yuan, X Ma, W Niu, Z Li
Advances in Neural Information Processing Systems (NeurIPS) 34, 8, 2021
Grim: A general, real-time deep learning inference framework for mobile devices based on fine-grained structured weight sparsity
W Niu, Z Li, X Ma, P Dong, G Zhou, X Qian, X Lin, Y Wang, B Ren
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021
CoCoPIE: enabling real-time AI on off-the-shelf mobile devices via compression-compilation co-design
H Guan, S Liu, X Ma, W Niu, B Ren, X Shen, Y Wang, P Zhao
Communications of the ACM 64 (6), 62-68, 2021
Real-time mobile acceleration of dnns: From computer vision to medical applications
H Li, G Yuan, W Niu, Y Cai, M Sun, Z Li, B Ren, X Lin, Y Wang
2021 26th Asia and South Pacific Design Automation Conference (ASP-DAC), 581-586, 2021
Achieving on-mobile real-time super-resolution with neural architecture and pruning search
Z Zhan, Y Gong, P Zhao, G Yuan, W Niu, Y Wu, T Zhang, M Jayaweera, ...
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021
Blk-rew: A unified block-based dnn pruning framework using reweighted regularization method
X Ma, Z Li, Y Gong, T Zhang, W Niu, Z Zhan, P Zhao, J Tang, X Lin, B Ren, ...
arXiv preprint arXiv:2001.08357, 2020
Towards real-time DNN inference on mobile platforms with model pruning and compiler optimization
W Niu, P Zhao, Z Zhan, X Lin, Y Wang, B Ren
arXiv preprint arXiv:2004.11250, 2020
NPAS: A Compiler-aware Framework of Unified Network Pruning and Architecture Search for Beyond Real-Time Mobile Acceleration
Z Li, G Yuan, W Niu, P Zhao, Y Li, Y Cai, X Shen, Z Zhan, Z Kong, Q Jin, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
Towards fast and accurate multi-person pose estimation on mobile devices
X Shen, G Yuan, W Niu, X Ma, J Guan, Z Li, B Ren, Y Wang
arXiv preprint arXiv:2106.15304, 2021
The system can't perform the operation now. Try again later.
Articles 1–20