Follow
Lijuan Wang
Lijuan Wang
Microsoft GenAI
Verified email at microsoft.com - Homepage
Title
Cited by
Cited by
Year
Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks
JG Xiujun Li, Xi Yin, Chunyuan Li, Pengchuan Zhang, Xiaowei Hu, Lei Zhang ...
European Conference on Computer Vision (ECCV), 2020
2150*2020
Large Scale Incremental Learning
YF Yue Wu, Yinpeng Chen, Lijuan Wang, Yuancheng Ye, Zicheng Liu, Yandong Guo
The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019
1471*2019
VinVL: Making Visual Representations Matter in Vision-Language Models
P Zhang, X Li, X Hu, J Yang, L Zhang, L Wang, Y Choi, J Gao
CVPR2021, 2021
1238*2021
Grounded language-image pre-training
LH Li, P Zhang, H Zhang, J Yang, C Li, Y Zhong, L Wang, L Yuan, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
10612022
Florence: A new foundation model for computer vision
L Yuan, D Chen, YL Chen, N Codella, X Dai, J Gao, H Hu, X Huang, B Li, ...
arXiv preprint arXiv:2111.11432, 2021
8962021
End-to-End Human Pose and Mesh Reconstruction with Transformers
K Lin, L Wang, Z Liu
CVPR2021, 2020
7422020
Rethinking Classification and Localization for Object Detection
YF Yue Wu, Yinpeng Chen, Lu Yuan, Zicheng Liu, Lijuan Wang, Hongzhi Li
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020
7242020
End-to-end semi-supervised object detection with soft teacher
M Xu, Z Zhang, H Hu, J Wang, L Wang, F Wei, X Bai, Z Liu
Proceedings of the IEEE/CVF international conference on computer vision …, 2021
5542021
Phi-3 technical report: A highly capable language model locally on your phone
M Abdin, J Aneja, H Awadalla, A Awadallah, AA Awan, N Bach, A Bahree, ...
arXiv preprint arXiv:2404.14219, 2024
5522024
Git: A generative image-to-text transformer for vision and language
J Wang, Z Yang, X Hu, L Li, K Lin, Z Gan, Z Liu, C Liu, L Wang
arXiv preprint arXiv:2205.14100, 2022
5382022
The dawn of lmms: Preliminary explorations with gpt-4v (ision)
Z Yang, L Li, K Lin, J Wang, CC Lin, Z Liu, L Wang
arXiv preprint arXiv:2309.17421 9 (1), 1, 2023
5162023
Segment everything everywhere all at once
X Zou, J Yang, H Zhang, F Li, L Li, J Wang, L Wang, J Gao, YJ Lee
arXiv preprint arXiv:2304.06718, 2023
4632023
Real-time Animation for an Expressive Avatar
N Xu, L Wang, FKP Soong, X Liang, Q Luo, YQ Xu, X Zou
US Patent App. 12/950,801, 2012
4372012
Mm-vet: Evaluating large multimodal models for integrated capabilities
W Yu, Z Yang, L Li, J Wang, K Lin, Z Liu, X Wang, L Wang
arXiv preprint arXiv:2308.02490, 2023
4162023
An empirical study of gpt-3 for few-shot knowledge-based vqa
Z Yang, Z Gan, J Wang, X Hu, Y Lu, Z Liu, L Wang
Proceedings of the AAAI conference on artificial intelligence 36 (3), 3081-3089, 2022
4102022
An empirical study of training end-to-end vision-and-language transformers
ZY Dou, Y Xu, Z Gan, J Wang, S Wang, L Wang, C Zhu, P Zhang, L Yuan, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
3882022
Mesh graphormer
K Lin, L Wang, Z Liu
Proceedings of the IEEE/CVF international conference on computer vision …, 2021
3552021
Refining of segmental boundaries in speech waveforms using contextual-dependent models
Y Zhao, M Chu, JL Zhou, L Wang
US Patent 7,496,512, 2009
3412009
Mm-react: Prompting chatgpt for multimodal reasoning and action
Z Yang, L Li, J Wang, K Lin, E Azarnasab, F Ahmed, Z Liu, C Liu, M Zeng, ...
arXiv preprint arXiv:2303.11381, 2023
3192023
Scaling up vision-language pre-training for image captioning
X Hu, Z Gan, J Wang, Z Yang, Z Liu, Y Lu, L Wang
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022
2932022
The system can't perform the operation now. Try again later.
Articles 1–20