Shuai Bai

Cited by

	All	Since 2019
Citations	4610	4601
h-index	16	16
i10-index	16	16

2000

1000

500

1500

201920202021202220232024131 275 461 638 1160 1932

Public access

View all

4 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Chang ZhouAlibaba Group (ericzhou.zc@alibaba-inc.com); Peking University (zhouchang@pku.edu.cn)Verified email at alibaba-inc.com
Junyang LinQwen Team, Alibaba Group & Peking UniversityVerified email at alibaba-inc.com
Jingren ZhouAlibaba Group, MicrosoftVerified email at alibaba-inc.com
Wei Wu（武伟）Sensetime Group LimitedVerified email at sensetime.com
Jianxin MaAlibaba Group; Tsinghua UniversityVerified email at alibaba-inc.com
Shijie WangAlibaba GroupVerified email at alibaba-inc.com
Sinan TanAlibaba Group; Tsinghua UniversityVerified email at tinytangent.com
Hanzhe HuPhD, Carnegie Mellon UniversityVerified email at cs.cmu.edu
Yu QiaoProfessor of Shanghai AI Laboratory; Shenzhen Institutes of Advanced Technology, CASVerified email at siat.ac.cn
Ming SunKuaishou TechVerified email at kuaishou.com
Zhedong ZhengUniversity of Macau | NUS | UTS | FudanVerified email at um.edu.mo
Hongxia YangByteDance, Alibaba Group, Yahoo!, IBM Watson
Peng WangAlibaba GroupVerified email at alibaba-inc.com

Shuai Bai

Alibaba group

Verified email at alibaba-inc.com

Multi-Modal Learning Visual Generation


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
The sixth visual object tracking vot2018 challenge results M Kristan, A Leonardis, J Matas, M Felsberg, R Pflugfelder, ... Proceedings of the European conference on computer vision (ECCV) workshops, 0-0, 2018	943	2018
Ofa: Unifying architectures, tasks, and modalities through a simple sequence-to-sequence learning framework P Wang, A Yang, R Men, J Lin, S Bai, Z Li, J Ma, C Zhou, J Zhou, H Yang International conference on machine learning, 23318-23340, 2022	925	2022
Qwen technical report J Bai, S Bai, Y Chu, Z Cui, K Dang, X Deng, Y Fan, W Ge, Y Han, F Huang, ... arXiv preprint arXiv:2309.16609, 2023	722	2023
Qwen-vl: A versatile vision-language model for understanding, localization, text reading, and beyond J Bai, S Bai, S Yang, S Wang, S Tan, P Wang, J Lin, C Zhou, J Zhou	625*	2023
The seventh visual object tracking VOT2019 challenge results M Kristan, J Matas, A Leonardis, M Felsberg, R Pflugfelder, ... Proceedings of the IEEE/CVF international conference on computer vision …, 2019	545	2019
Dense Relation Distillation with Context-aware Aggregation for Few-Shot Object Detection H Hu, S Bai, A Li, J Cui, L Wang CVPR 2021, 2021	184	2021
Adaptive Dilated Network With Self-Correction Supervision for Counting S Bai, Z He, Y Qiao, H Hu, W Wu, J Yan Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020	180	2020
Class-wise Dynamic Graph Convolution for Semantic Segmentation H Hu, D Ji, W Gan, S Bai, W Wu, J Yan Proceedings of European Conference on Computer Vision (ECCV 2020), 2020	90	2020
One-peace: Exploring one general representation model toward unlimited modalities P Wang, S Wang, J Lin, S Bai, X Zhou, J Zhou, X Wang, C Zhou arXiv preprint arXiv:2305.11172, 2023	71	2023
Multi-hierarchical independent correlation filters for visual tracking S Bai, Z He, Y Dong, H Bai 2020 IEEE international conference on multimedia and expo (ICME), 1-6, 2020	62	2020
Single stage virtual try-on via deformable attention flows S Bai, H Zhou, Z Li, C Zhou, H Yang European Conference on Computer Vision, 409-425, 2022	58	2022
Traffic anomaly detection via perspective map based on spatial-temporal information matrix. S Bai, Z He, Y Lei, W Wu, C Zhu, M Sun, J Yan CVPR Workshops, 117-124, 2019	52	2019
Multi-Camera Vehicle Tracking with Powerful Visual Features and Spatial-Temporal Cue. Z He, Y Lei, S Bai, W Wu CVPR Workshops 1, 2019	51	2019
Connecting language and vision for natural language-based vehicle retrieval S Bai, Z Zheng, X Wang, J Lin, Z Zhang, C Zhou, H Yang, Y Yang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021	29	2021
Touchstone: Evaluating vision-language models by language models S Bai, S Yang, J Bai, P Wang, X Zhang, J Lin, X Wang, C Zhou, J Zhou arXiv preprint arXiv:2308.16890, 2023	24	2023
Pretrained diffusion models for unified human motion synthesis J Ma, S Bai, C Zhou arXiv preprint arXiv:2212.02837, 2022	21	2022
An image is worth 1/2 tokens after layer 2: Plug-and-play inference acceleration for large vision-language models L Chen, H Zhao, T Liu, S Bai, J Lin, C Zhou, B Chang arXiv preprint arXiv:2403.06764, 2024	9	2024
Ofasys: A multi-modal multi-task learning system for building generalist models J Bai, R Men, H Yang, X Ren, K Dang, Y Zhang, X Zhou, P Wang, S Tan, ... arXiv preprint arXiv:2212.04408, 2022	8	2022
M6-fashion: High-fidelity multi-modal image generation and editing Z Li, H Zhou, S Bai, P Li, C Zhou, H Yang arXiv preprint arXiv:2205.11705, 2022	6	2022
Qwen2 technical report A Yang, B Yang, B Hui, B Zheng, B Yu, C Zhou, C Li, C Li, D Liu, F Huang, ... arXiv preprint arXiv:2407.10671, 2024	2	2024

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors