PCONV: The Missing but Desirable Sparsity in DNN Weight Pruning for Real-Time Execution on Mobile Devices. X Ma, FM Guo, W Niu, X Lin, J Tang, K Ma, B Ren, Y Wang AAAI'20, 5117-5124, 2020 | 27 | 2020 |
Patdnn: Achieving real-time DNN execution on mobile devices with pattern-based weight pruning W Niu, X Ma, S Lin, S Wang, X Qian, X Lin, Y Wang, B Ren ASPLOS'20, 907-922, 2020 | 20 | 2020 |
26ms inference time for resnet-50: Towards real-time execution of all dnns on smartphone W Niu, X Ma, Y Wang, B Ren ICML2020 workshop, 2019 | 9 | 2019 |
RTMobile: Beyond Real-Time Mobile Acceleration of RNNs for Speech Recognition P Dong, S Wang, W Niu, C Zhang, S Lin, Z Li, Y Gong, B Ren, X Lin, ... 2020 57th ACM/IEEE Design Automation Conference (DAC), 2020 | 5 | 2020 |
An Image Enhancing Pattern-based Sparsity for Real-time Inference on Mobile Devices X Ma, W Niu, T Zhang, S Liu, FM Guo, S Lin, H Li, X Chen, J Tang, K Ma, ... ECCV'20: Proceedings of the European Conference on Computer Vision, 2020 | 2 | 2020 |
A Privacy-Preserving DNN Pruning and Mobile Acceleration Framework Z Zhan, Y Gong, Z Li, P Zhao, X Ma, W Niu, X Xu, B Ren, Y Wang, X Lin GLSVLSI '20: Proceedings of the 2020 on Great Lakes Symposium on VLSI, 2020 | 1 | 2020 |
BLK-REW: A Unified Block-based DNN Pruning Framework using Reweighted Regularization Method X Ma, Z Li, Y Gong, T Zhang, W Niu, Z Zhan, P Zhao, J Tang, X Lin, B Ren, ... arXiv preprint arXiv:2001.08357, 2020 | 1 | 2020 |
Achieving Real-Time LiDAR 3D Object Detection on a Mobile Device P Zhao, W Niu, G Yuan, Y Cai, HH Sung, W Wen, S Liu, X Shen, B Ren, ... arXiv preprint arXiv:2012.13801, 2020 | | 2020 |
6.7 ms on Mobile with over 78% ImageNet Accuracy: Unified Network Pruning and Architecture Search for Beyond Real-Time Mobile Acceleration Z Li, G Yuan, W Niu, Y Li, P Zhao, Y Cai, X Shen, Z Zhan, Z Kong, Q Jin, ... arXiv preprint arXiv:2012.00596, 2020 | | 2020 |
An Efficient End-to-End Deep Learning Training Framework via Fine-Grained Pattern-Based Pruning C Zhang, G Yuan, W Niu, J Tian, S Jin, D Zhuang, Z Jiang, Y Wang, B Ren, ... arXiv preprint arXiv:2011.10170, 2020 | | 2020 |
Achieving Real-Time Execution of Transformer-based Large-scale Models on Mobile with Compiler-aware Neural Architecture Optimization W Niu, Z Kong, G Yuan, W Jiang, J Guan, C Ding, P Zhao, S Liu, B Ren, ... arXiv preprint arXiv:2009.06823, 2020 | | 2020 |
YOLObile: Real-Time Object Detection on Mobile Devices via Compression-Compilation Co-Design Y Cai, H Li, G Yuan, W Niu, Y Li, X Tang, B Ren, Y Wang AAAI'21, 2020 | | 2020 |
Achieving Real-Time Execution of 3D Convolutional Neural Networks on Mobile Devices W Niu, M Sun, Z Li, JA Chen, J Guan, X Shen, Y Wang, X Lin, B Ren AAAI'21, 2020 | | 2020 |
Towards Real-Time DNN Inference on Mobile Platforms with Model Pruning and Compiler Optimization W Niu, P Zhao, Z Zhan, X Lin, Y Wang, B Ren arXiv preprint arXiv:2004.11250, 2020 | | 2020 |