LRW-1000: A naturally-distributed large-scale benchmark for lip reading in the wild S Yang, Y Zhang, D Feng, M Yang, C Wang, J Xiao, K Long, S Shan, ... 2019 14th IEEE International Conference on Automatic Face & Gesture …, 2019 | 207 | 2019 |
Can We Read Speech Beyond the Lips? Rethinking RoI Selection for Deep Visual Speech Recognition Y Zhang, S Yang, J Xiao, S Shan, X Chen IEEE International Conference on Automatic Face and Gesture Recognition, 2020 | 92 | 2020 |
Deformation Flow Based Two-Stream Network for Lip Reading J Xiao, S Yang, Y Zhang, S Shan, X Chen IEEE International Conference on Automatic Face and Gesture Recognition, 2020 | 91 | 2020 |
Learn an effective lip reading model without pains D Feng, S Yang, S Shan, X Chen arXiv preprint arXiv:2011.07557, 2020 | 88 | 2020 |
Mutual Information Maximization for Effective Lip Reading X Zhao, S Yang, S Shan, X Chen IEEE International Conference on Automatic Face and Gesture Recognition, 2020 | 85 | 2020 |
Pseudo-Convolutional Policy Gradient for Sequence-to-Sequence Lip-Reading M Luo, S Yang, S Shan, X Chen IEEE International Conference on Automatic Face and Gesture Recognition, 2020 | 55 | 2020 |
Multi-task sparse learning with beta process prior for action recognition C Yuan, W Hu, G Tian, S Yang, H Wang Proceedings of the IEEE conference on computer vision and pattern …, 2013 | 55 | 2013 |
UniCon: Unified Context Network for Robust Active Speaker Detection Y Zhang, S Liang, S Yang, X Liu, Z Wu, S Shan, X Chen ACM Multimedia, 2021 | 48 | 2021 |
Learning human actions by combining global dynamics and local appearance G Luo, S Yang, G Tian, C Yuan, W Hu, SJ Maybank IEEE transactions on pattern analysis and machine intelligence 36 (12), 2466 …, 2014 | 37 | 2014 |
Multi-Task Learning for Audio-Visual Active Speaker Detection YH Zhang, J Xiao, S Yang, S Shan The ActivityNet Large-Scale Activity Recognition Challenge 2019, 2019 | 35 | 2019 |
Multi-feature max-margin hierarchical bayesian model for action recognition S Yang, C Yuan, B Wu, W Hu, F Wang Proceedings of the IEEE conference on computer vision and pattern …, 2015 | 35 | 2015 |
A hierarchical model based on latent dirichlet allocation for action recognition S Yang, C Yuan, W Hu, X Ding International Conference on Pattern Recognition, 2014, 2613-2618, 2014 | 30 | 2014 |
An Efficient Software for Building Lip Reading Models Without Pains D Feng, S Yang, S Shan IEEE International Conference on Multimedia & Expo Workshops (ICMEW), 2021 | 22 | 2021 |
Synchronous Bidirectional Learning for Multilingual Lip Reading M Luo, S Yang, X Chen, Z Liu, S Shan The British Machine Vision Conference (BMVC) 2020, 2020 | 18 | 2020 |
Decoding silent speech from high-density surface electromyographic data using transformer R Song, X Zhang, X Chen, X Chen, X Chen, S Yang, E Yin Biomedical Signal Processing and Control 80, 104298, 2023 | 12 | 2023 |
Online Detection and Tracking Method of Foreign Substances in Ampoules in High-speed Pharmaceutical Lines S Yang, Y Wang Chinese Journal of Scientific Instrument 32 (003), 488-494, 2011 | 10* | 2011 |
Unicon+: Ictcas-ucas submission to the ava-activespeaker task at activitynet challenge 2022 Y Zhang, S Liang, S Yang, S Shan arXiv preprint arXiv:2206.10861, 2022 | 6 | 2022 |
Audio-driven deformation flow for effective lip reading D Feng, S Yang, S Shan, X Chen 2022 26th international conference on pattern recognition (ICPR), 274-280, 2022 | 4 | 2022 |
ICTCAS-UCAS-TAL Submission to the AVA-ActiveSpeaker Task at ActivityNet Challenge 2021 Yuanhang Zhang, Susan Liang, Shuang Yang, Xiao Liu, Zhongqin Wu, Shiguang Shan The ActivityNet Large-Scale Activity Recognition Challenge 2021, 2021 | 4 | 2021 |
Combining sparse appearance features and dense motion features via random forest for action detection S Yang, C Yuan, H Wang, W Hu IEEE International Conference on Acoustics, Speech and Signal Processing …, 2013 | 3 | 2013 |