T2VLAD: Global-Local Sequence Alignment for Text-Video Retrieval X Wang, L Zhu, Y Yang IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021 | 203 | 2021 |
CenterCLIP: Token Clustering for Efficient Text-Video Retrieval S Zhao, L Zhu, X Wang, Y Yang SIGIR 2022, 2022 | 119 | 2022 |
Symbiotic attention for egocentric action recognition with object-centric alignment X Wang, L Zhu, Y Wu, Y Yang IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020 | 102 | 2020 |
Learning to anticipate egocentric actions by imagination Y Wu, L Zhu, X Wang, Y Yang, F Wu IEEE Transactions on Image Processing 30, 1143-1152, 2020 | 87 | 2020 |
Large-Scale Video Panoptic Segmentation in the Wild: A Benchmark J Miao, X Wang, Y Wu, W Li, X Zhang, Y Wei, Y Yang IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022 | 85 | 2022 |
Symbiotic attention with privileged information for egocentric action recognition X Wang, Y Wu, L Zhu, Y Yang Proceedings of the AAAI Conference on Artificial Intelligence 34 (07), 12249 …, 2020 | 80 | 2020 |
Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models W Wu, X Wang, H Luo, J Wang, Y Yang, W Ouyang IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023 | 78 | 2023 |
Interactive Prototype Learning for Egocentric Action Recognition X Wang, L Zhu, H Wang, Y Yang ICCV 2021, 2021 | 73 | 2021 |
Parameter-efficient person re-identification in the 3d space Z Zheng, X Wang, N Zheng, Y Yang IEEE Transactions on Neural Networks and Learning Systems, 2022 | 72 | 2022 |
Align and Tell: Boosting Text-video Retrieval with Local Alignment and Fine-grained Supervision X Wang, L Zhu, Z Zheng, M Xu, Y Yang IEEE Transactions on Multimedia (TMM), 2022 | 70 | 2022 |
Bird's-Eye-View Scene Graph for Vision-Language Navigation R Liu, X Wang, W Wang, Y Yang ICCV 2023, 2023 | 38 | 2023 |
Point cloud pre-training by mixing and disentangling C Sun, Z Zheng, X Wang, M Xu, Y Yang arXiv e-prints, arXiv: 2109.00452, 2021 | 35* | 2021 |
Lana: A Language-Capable Navigator for Instruction Following and Generation X Wang, W Wang, J Shao, Y Yang IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023 | 34 | 2023 |
Global-to-Local Modeling for Video-based 3D Human Pose and Shape Estimation X Shen, Z Yang, X Wang, J Ma, C Zhou, Y Yang IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023 | 33 | 2023 |
Connecting language and vision for natural language-based vehicle retrieval S Bai, Z Zheng, X Wang, J Lin, Z Zhang, C Zhou, H Yang, Y Yang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 32 | 2021 |
Gloss-Free End-to-End Sign Language Translation K Lin, X Wang, L Zhu, K Sun, B Zhang, Y Yang ACL 2023 (Oral), 2023 | 28 | 2023 |
Scalable video object segmentation with identification mechanism Z Yang, J Miao, Y Wei, W Wang, X Wang, Y Yang IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023 | 28* | 2023 |
Clustering based point cloud representation learning for 3d analysis T Feng, W Wang, X Wang, Y Yang, Q Zheng ICCV 2023, 2023 | 27 | 2023 |
VideoAgent: Long-form Video Understanding with Large Language Model as Agent X Wang, Y Zhang, O Zohar, S Yeung-Levy ECCV 2024, 2024 | 23 | 2024 |
Baidu-UTS Submission to the EPIC-Kitchens Action Recognition Challenge 2019 X Wang, Y Wu, L Zhu, Y Yang IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Workshop, 2019 | 22 | 2019 |