GiraffeDet: A Heavy-Neck Paradigm for Object Detection Y Jiang, Z Tan, J Wang, X Sun, M Lin, H Li International Conference on Learning Representations (ICLR 2022), 2022 | 124* | 2022 |
Entroformer: A Transformer-based Entropy Model for Learned Image Compression Y Qian, M Lin, X Sun, Z Tan, R Jin International Conference on Learning Representations (ICLR 2022), 2022 | 110 | 2022 |
Learning Accurate Entropy Model with Global Reference for Image Compression Y Qian, Z Tan, X Sun, M Lin, D Li, Z Sun, H Li, R Jin ICLR 2021, 2020 | 65 | 2020 |
Learning to rank proposals for object detection Z Tan, X Nie, Q Qian, N Li, H Li Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019 | 60 | 2019 |
Image co-saliency detection by propagating superpixel affinities Z Tan, L Wan, W Feng, CM Pun 2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013 | 45 | 2013 |
Mae-det: Revisiting maximum entropy principle in zero-shot nas for efficient object detection Z Sun, M Lin, X Sun, Z Tan, H Li, R Jin International Conference on Machine Learning (ICML 2022), 20810-20826, 2021 | 33 | 2021 |
Interpolation variable rate image compression Z Sun, Z Tan, X Sun, F Zhang, Y Qian, D Li, H Li Proceedings of the 29th ACM International Conference on Multimedia, 5574-5582, 2021 | 16 | 2021 |
Spatiotemporal entropy model is all you need for learned video compression Z Sun, Z Tan, X Sun, F Zhang, D Li, Y Qian, H Li arXiv preprint arXiv:2104.06083, 2021 | 15 | 2021 |
Ovo: Open-vocabulary occupancy Z Tan, Z Dong, C Zhang, W Zhang, H Ji, H Li arXiv preprint arXiv:2305.16133, 2023 | 11 | 2023 |
Revisiting efficient object detection backbones from zero-shot neural architecture search Z Sun, M Lin, X Sun, Z Tan, R Jin CoRR, abs/2111.13336, 2021 | 4 | 2021 |
Jmpnet: Joint motion prediction for learning-based video compression D Li, Z Sun, Z Tan, X Sun, F Zhang, Y Qian, H Li ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 2 | 2022 |
ZenDet: Revisiting Efficient Object Detection Backbones from Zero-Shot Neural Architecture Search Z Sun, M Lin, Z Tan, X Sun, R Jin | 2 | 2021 |
Towards Effective Usage of Human-Centric Priors in Diffusion Models for Text-based Human Image Generation Junyan Wang, Zhenhong Sun, Zhiyu Tan, Xuanbai Chen, Weihua Chen, Hao Li ... IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024), 2024 | 1* | 2024 |
VidGen-1M: A Large-Scale Dataset for Text-to-video Generation Z Tan, X Yang, L Qin, H Li arXiv preprint arXiv:2408.02629, 2024 | | 2024 |
EvalAlign: Evaluating Text-to-Image Models through Precision Alignment of Multimodal Large Models with Supervised Fine-Tuning to Human Annotations Z Tan, X Yang, L Qin, M Yang, C Zhang, H Li arXiv preprint arXiv:2406.16562, 2024 | | 2024 |
An Empirical Study and Analysis of Text-to-Image Generation Using Large Language Model-Powered Textual Representation Z Tan, M Yang, L Qin, H Yang, Y Qian, Q Zhou, C Zhang, H Li European Conference on Computer Vision (ECCV), 2024 | | 2024 |
Supplementary Materials: Towards Effective Usage of Human-Centric Priors in Diffusion Models for Text-based Human Image Generation J Wang, Z Sun, Z Tan, X Chen, W Chen, H Li, C Zhang, Y Song | | |