Follow
Zhenheng Yang
Zhenheng Yang
ByteDance TikTok
Verified email at bytedance.com - Homepage
Title
Cited by
Cited by
Year
Tall: Temporal activity localization via language query
J Gao, C Sun, Z Yang, R Nevatia
Proceedings of the IEEE international conference on computer vision, 5267-5275, 2017
8342017
Turn tap: Temporal unit regression network for temporal action proposals
J Gao, Z Yang, K Chen, C Sun, R Nevatia
Proceedings of the IEEE international conference on computer vision, 3628-3636, 2017
5542017
Occlusion aware unsupervised learning of optical flow
Y Wang, Y Yang, Z Yang, L Zhao, P Wang, W Xu
Proceedings of the IEEE conference on computer vision and pattern …, 2018
3632018
Every pixel counts++: Joint learning of geometry and motion with 3d holistic understanding
C Luo, Z Yang, P Wang, Y Wang, W Xu, R Nevatia, A Yuille
IEEE transactions on pattern analysis and machine intelligence 42 (10), 2624 …, 2019
3282019
Cascaded boundary regression for temporal action detection
J Gao, Z Yang, R Nevatia
arXiv preprint arXiv:1705.01180, 2017
2552017
Unsupervised learning of geometry from videos with edge-aware depth-normal consistency
Z Yang, P Wang, W Xu, L Zhao, R Nevatia
Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018
2502018
Red: Reinforced encoder-decoder networks for action anticipation
J Gao, Z Yang, R Nevatia
arXiv preprint arXiv:1707.04818, 2017
2462017
SPAN: Spatial pyramid attention network for image manipulation localization
X Hu, Z Zhang, Z Jiang, S Chaudhuri, Z Yang, R Nevatia
Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020
2192020
Lego: Learning edge with geometry all at once by watching videos
Z Yang, P Wang, Y Wang, W Xu, R Nevatia
Proceedings of the IEEE conference on computer vision and pattern …, 2018
2132018
Unos: Unified unsupervised optical-flow and stereo-depth estimation by watching videos
Y Wang, P Wang, Z Yang, C Luo, Y Yang, W Xu
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019
1772019
A multi-scale cascade fully convolutional network face detector
Z Yang, R Nevatia
2016 23rd International Conference on Pattern Recognition (ICPR), 633-638, 2016
1292016
Every pixel counts: Unsupervised geometry learning with holistic 3d motion understanding
Z Yang, P Wang, Y Wang, W Xu, R Nevatia
Proceedings of the European conference on computer vision (ECCV) workshops, 0-0, 2018
1182018
Spatio-temporal action detection with cascade proposal and location anticipation
Z Yang, J Gao, R Nevatia
arXiv preprint arXiv:1708.00042, 2017
712017
Activity driven weakly supervised object detection
Z Yang, D Mahajan, D Ghadiyaram, R Nevatia, V Ramanathan
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019
382019
Weakly supervised instance segmentation for videos with temporal mask consistency
Q Liu, V Ramanathan, D Mahajan, A Yuille, Z Yang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
262021
Show-o: One single transformer to unify multimodal understanding and generation
J Xie, W Mao, Z Bai, DJ Zhang, W Wang, KQ Lin, Y Gu, Z Chen, Z Yang, ...
arXiv preprint arXiv:2408.12528, 2024
232024
Systems and methods for unsupervised learning of geometry from images using depth-normal consistency
P Wang, W Xu, Y Zhenheng
US Patent 10,803,546, 2020
232020
Face and body association for video-based face recognition
KG Kim, Z Yang, I Masi, R Nevatia, G Medioni
2018 IEEE Winter Conference on Applications of Computer Vision (WACV), 39-48, 2018
192018
Joint unsupervised learning of optical flow and depth by watching stereo videos
Y Wang, Z Yang, P Wang, Y Yang, C Luo, W Xu
arXiv preprint arXiv:1810.03654, 2018
182018
Openvid-1m: A large-scale high-quality dataset for text-to-video generation
K Nan, R Xie, P Zhou, T Fan, Z Yang, Z Chen, X Li, J Yang, Y Tai
arXiv preprint arXiv:2407.02371, 2024
92024
The system can't perform the operation now. Try again later.
Articles 1–20