Phi-3 technical report: A highly capable language model locally on your phone M Abdin, J Aneja, H Awadalla, A Awadallah, AA Awan, N Bach, A Bahree, ... arXiv preprint arXiv:2404.14219, 2024 | 982 | 2024 |
Bidirectional learning for domain adaptation of semantic segmentation Y Li, L Yuan, N Vasconcelos Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019 | 789 | 2019 |
Explainable object-induced action decision for autonomous vehicles Y Xu, X Yang, L Gong, HC Lin, TY Wu, Y Li, N Vasconcelos Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020 | 134 | 2020 |
Dynamic transfer for multi-source domain adaptation Y Li, L Yuan, Y Chen, P Wang, N Vasconcelos Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 95 | 2021 |
Micronet: Improving image recognition with extremely low flops Y Li, Y Chen, X Dai, D Chen, M Liu, L Yuan, Z Liu, L Zhang, ... Proceedings of the IEEE/CVF International conference on computer vision, 468-477, 2021 | 95 | 2021 |
Revisiting dynamic convolution via matrix decomposition Y Li, Y Chen, X Dai, M Liu, D Chen, Y Yu, L Yuan, Z Liu, M Chen, ... arXiv preprint arXiv:2103.08756, 2021 | 82 | 2021 |
Dense network expansion for class incremental learning Z Hu, Y Li, J Lyu, D Gao, N Vasconcelos Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 63 | 2023 |
Deep scene image classification with the MFAFVNet Y Li, M Dixit, N Vasconcelos Proceedings of the IEEE international conference on computer vision, 5746-5754, 2017 | 62 | 2017 |
Efficient multi-domain learning by covariance normalization Y Li, N Vasconcelos Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019 | 40 | 2019 |
MicroNet: Towards image recognition with extremely low FLOPs Y Li, Y Chen, X Dai, D Chen, M Liu, L Yuan, Z Liu, L Zhang, ... arXiv preprint arXiv:2011.12289, 2020 | 29 | 2020 |
Semantic fisher scores for task transfer: Using objects to classify scenes M Dixit, Y Li, N Vasconcelos IEEE transactions on pattern analysis and machine intelligence 42 (12), 3102 …, 2019 | 18 | 2019 |
Rethinking visual prompting for multimodal large language models with external knowledge Y Lin, Y Li, D Chen, W Xu, R Clark, P Torr, L Yuan arXiv preprint arXiv:2407.04681, 2024 | 6 | 2024 |
Fully authentic visual question answering dataset from online communities C Chen, M Liu, N Codella, Y Li, L Yuan, D Gurari European Conference on Computer Vision, 252-269, 2024 | 4 | 2024 |
Should all proposals be treated equally in object detection? Y Li, Y Chen, X Dai, D Chen, M Liu, P Yu, Y Jin, L Yuan, Z Liu, ... European Conference on Computer Vision, 556-572, 2022 | 4 | 2022 |
SCHEME: Scalable Channel Mixer for Vision Transformers D Sridhar, Y Li, N Vasconcelos arXiv preprint arXiv:2312.00412, 2023 | 1 | 2023 |
Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs A Abouelenin, A Ashfaq, A Atkinson, H Awadalla, N Bach, J Bao, ... arXiv preprint arXiv:2503.01743, 2025 | | 2025 |
Olympus: A Universal Task Router for Computer Vision Tasks Y Lin, Y Li, D Chen, W Xu, R Clark, PHS Torr arXiv preprint arXiv:2412.09612, 2024 | | 2024 |
SynChart: Synthesizing Charts from Language Models M Liu, Q Li, D Chen, D Chen, J Bao, Y Li arXiv preprint arXiv:2409.16517, 2024 | | 2024 |