Language-driven temporal activity localization: A semantic matching reinforcement learning model W Wang, Y Huang, L Wang Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019 | 201 | 2019 |
Temporal memory attention for video semantic segmentation H Wang, W Wang, J Liu 2021 IEEE International Conference on Image Processing (ICIP), 2254-2258, 2021 | 95 | 2021 |
VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset J Liu, S Chen, X He, L Guo, X Zhu, W Wang, J Tang IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024 | 88* | 2024 |
Hair: Hierarchical visual-semantic relational reasoning for video question answering F Liu, J Liu, W Wang, H Lu Proceedings of the IEEE/CVF international conference on computer vision …, 2021 | 59 | 2021 |
CASIA-E: a large comprehensive dataset for gait recognition C Song, Y Huang, W Wang, L Wang IEEE transactions on pattern analysis and machine intelligence 45 (3), 2801-2815, 2022 | 53 | 2022 |
Multi-caption text-to-face synthesis: Dataset and algorithm J Sun, Q Li, W Wang, J Zhao, Z Sun Proceedings of the 29th ACM International Conference on Multimedia, 2290-2298, 2021 | 53 | 2021 |
OPT: Omni-perception pre-trainer for cross-modal understanding and generation J Liu, X Zhu, F Liu, L Guo, Z Zhao, M Sun, W Wang, H Lu, S Zhou, J Zhang, ... arXiv preprint arXiv:2107.00249, 2021 | 47 | 2021 |
Long video question answering: A matching-guided attention model W Wang, Y Huang, L Wang Pattern Recognition 102, 107248, 2020 | 23 | 2020 |
AutoCaption: Image captioning with neural architecture search X Zhu, W Wang, L Guo, J Liu arXiv preprint arXiv:2012.09742, 2020 | 18 | 2020 |
Learning disentangled representation for one-shot progressive face swapping Q Li, W Wang, C Xu, Z Sun, MH Yang IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024 | 17 | 2024 |
An efficient sampling-based attention network for semantic segmentation X He, J Liu, W Wang, H Lu IEEE Transactions on Image Processing 31, 2850-2863, 2022 | 16 | 2022 |
Face sketch synthesis via semantic-driven generative adversarial network X Qi, M Sun, W Wang, X Dong, Q Li, C Shan 2021 IEEE International Joint Conference on Biometrics (IJCB), 1-8, 2021 | 15 | 2021 |
MOSO: Decomposing motion, scene and object for video prediction M Sun, W Wang, X Zhu, J Liu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 13 | 2023 |
Super-resolution semantic segmentation with relation calibrating network J Jiang, J Liu, J Fu, W Wang, H Lu Pattern Recognition 124, 108501, 2022 | 12 | 2022 |
Sounding video generator: A unified framework for text-guided sounding video generation J Liu, W Wang, S Chen, X Zhu, J Liu IEEE Transactions on Multimedia 26, 141-153, 2023 | 9 | 2023 |
Learnable Feature Augmentation Framework for Temporal Action Localization Y Tang, W Wang, C Zhang, J Liu, Y Zhao IEEE Transactions on Image Processing, 2024 | 7 | 2024 |
Semi-supervised temporal action proposal generation via exploiting 2-D proposal map W Wang, T Lin, D He, F Li, S Wen, L Wang, J Liu IEEE Transactions on Multimedia 24, 3624-3635, 2021 | 7 | 2021 |
Ed-t2v: An efficient training framework for diffusion-based text-to-video generation J Liu, W Wang, W Liu, Q He, J Liu 2023 International Joint Conference on Neural Networks (IJCNN), 1-8, 2023 | 6 | 2023 |
Anchor-free temporal action localization via Progressive Boundary-aware Boosting Y Tang, W Wang, Y Yang, C Zhang, J Liu Information Processing & Management 60 (1), 103141, 2023 | 6 | 2023 |
CAS-AIR-3D face: A low-quality, multi-modal and multi-pose 3D face database Q Li, X Dong, W Wang, C Shan 2021 IEEE International Joint Conference on Biometrics (IJCB), 1-8, 2021 | 6 | 2021 |