Cris: Clip-driven referring image segmentation Z Wang, Y Lu, Q Li, X Tao, Y Guo, M Gong, T Liu Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022 | 326 | 2022 |
GINet: Graph interaction network for scene parsing T Wu, Y Lu, Y Zhu, C Zhang, M Wu, Z Ma, G Guo Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020 | 44 | 2020 |
Show me a video: A large-scale narrated video dataset for coherent story illustration Y Lu, F Ni, H Wang, X Guo, L Zhu, Z Yang, R Song, L Cheng, Y Yang IEEE Transactions on Multimedia, 2023 | 9 | 2023 |
Zero-shot video grounding with pseudo query lookup and verification Y Lu, R Quan, L Zhu, Y Yang IEEE Transactions on Image Processing 33, 1643-1654, 2024 | 8 | 2024 |
Flowzero: Zero-shot text-to-video synthesis with llm-driven dynamic scene syntax Y Lu, L Zhu, H Fan, Y Yang arXiv preprint arXiv:2311.15813, 2023 | 8 | 2023 |
C-DLinkNet: considering multi-level semantic features for human parsing Y Lu, M Feng, M Wu, C Zhang arXiv preprint arXiv:2001.11690, 2020 | 3 | 2020 |
Freelong: Training-free long video generation with spectralblend temporal attention Y Lu, Y Liang, L Zhu, Y Yang arXiv preprint arXiv:2407.19918, 2024 | 2 | 2024 |
ECLIP: Efficient Contrastive Language-Image Pretraining via Ensemble Confidence Learning and Masked Language Modeling J Wang, H Wang, W Wu, J Deng, Y Lu, X Guo, D Zhang First Workshop on Pre-training: Perspectives, Pitfalls, and Paths Forward at …, 2022 | 1 | 2022 |