Junjie Bai
Junjie Bai
Verified email at
Cited by
Cited by
Pytorch: An imperative style, high-performance deep learning library
A Paszke, S Gross, F Massa, A Lerer, J Bradbury, G Chanan, T Killeen, ...
Advances in neural information processing systems 32, 2019
Onnx: Open neural network exchange
J Bai, F Lu, K Zhang
GitHub repository, 2017
Advances in Neural Information Processing Systems 32, Curran Associates
A Paszke, S Gross, F Massa, A Lerer, J Bradbury, G Chanan, T Killeen, ...
Inc., New York, 8024, 2019
DISC: A dynamic shape compiler for machine learning workloads
K Zhu, WY Zhao, Z Zheng, TY Guo, PZ Zhao, JJ Bai, J Yang, XY Liu, ...
Proceedings of the 1st Workshop on Machine Learning and Systems, 89-95, 2021
Parameter-efficient sparsity for large language models fine-tuning
Y Li, F Luo, C Tan, M Wang, S Huang, S Li, J Bai
arXiv preprint arXiv:2205.11005, 2022
Distrifusion: Distributed parallel inference for high-resolution diffusion models
M Li, T Cai, J Cao, Q Zhang, H Cai, J Bai, Y Jia, K Li, S Han
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
Bladedisc: Optimizing dynamic shape machine learning workloads via compiler approach
Z Zheng, Z Pan, D Wang, K Zhu, W Zhao, T Guo, X Qiu, M Sun, J Bai, ...
Proceedings of the ACM on Management of Data 1 (3), 1-29, 2023
RECom: A Compiler Approach to Accelerating Recommendation Model Inference with Massive Embedding Columns
Z Pan, Z Zheng, F Zhang, R Wu, H Liang, D Wang, X Qiu, J Bai, W Lin, ...
Proceedings of the 28th ACM International Conference on Architectural …, 2023
MonoInfer: Enabling a New Monolithic Optimization Space for Neural Network Inference Tasks on Modern GPU-Centric Architectures
D Zhuang, Z ZHENG, H Xia, X Qiu, J Bai, W Lin, SL Song
The system can't perform the operation now. Try again later.
Articles 1–9