Follow
Shuai Bai
Shuai Bai
Qwen Team, Alibaba Group
Verified email at alibaba-inc.com
Title
Cited by
Cited by
Year
Qwen technical report
J Bai, S Bai, Y Chu, Z Cui, K Dang, X Deng, Y Fan, W Ge, Y Han, F Huang, ...
arXiv preprint arXiv:2309.16609, 2023
9972023
Ofa: Unifying architectures, tasks, and modalities through a simple sequence-to-sequence learning framework
P Wang, A Yang, R Men, J Lin, S Bai, Z Li, J Ma, C Zhou, J Zhou, H Yang
International conference on machine learning, 23318-23340, 2022
9932022
The sixth visual object tracking vot2018 challenge results
M Kristan, A Leonardis, J Matas, M Felsberg, R Pflugfelder, ...
Proceedings of the European conference on computer vision (ECCV) workshops, 0-0, 2018
9682018
Qwen-vl: A versatile vision-language model for understanding, localization, text reading, and beyond
J Bai, S Bai, S Yang, S Wang, S Tan, P Wang, J Lin, C Zhou, J Zhou
841*2023
The seventh visual object tracking VOT2019 challenge results
M Kristan, J Matas, A Leonardis, M Felsberg, R Pflugfelder, ...
Proceedings of the IEEE/CVF international conference on computer vision …, 2019
5632019
Dense Relation Distillation with Context-aware Aggregation for Few-Shot Object Detection
H Hu, S Bai, A Li, J Cui, L Wang
CVPR 2021, 2021
1992021
Adaptive Dilated Network With Self-Correction Supervision for Counting
S Bai, Z He, Y Qiao, H Hu, W Wu, J Yan
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020
1872020
Qwen2 technical report
A Yang, B Yang, B Hui, B Zheng, B Yu, C Zhou, C Li, C Li, D Liu, F Huang, ...
arXiv preprint arXiv:2407.10671, 2024
1352024
Class-wise Dynamic Graph Convolution for Semantic Segmentation
H Hu, D Ji, W Gan, S Bai, W Wu, J Yan
Proceedings of European Conference on Computer Vision (ECCV 2020), 2020
932020
One-peace: Exploring one general representation model toward unlimited modalities
P Wang, S Wang, J Lin, S Bai, X Zhou, J Zhou, X Wang, C Zhou
arXiv preprint arXiv:2305.11172, 2023
832023
Single stage virtual try-on via deformable attention flows
S Bai, H Zhou, Z Li, C Zhou, H Yang
European Conference on Computer Vision, 409-425, 2022
652022
Multi-hierarchical independent correlation filters for visual tracking
S Bai, Z He, Y Dong, H Bai
2020 IEEE international conference on multimedia and expo (ICME), 1-6, 2020
642020
Traffic anomaly detection via perspective map based on spatial-temporal information matrix.
S Bai, Z He, Y Lei, W Wu, C Zhu, M Sun, J Yan
CVPR Workshops, 117-124, 2019
532019
Multi-Camera Vehicle Tracking with Powerful Visual Features and Spatial-Temporal Cue.
Z He, Y Lei, S Bai, W Wu
CVPR Workshops 1, 2019
522019
Connecting language and vision for natural language-based vehicle retrieval
S Bai, Z Zheng, X Wang, J Lin, Z Zhang, C Zhou, H Yang, Y Yang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
302021
Touchstone: Evaluating vision-language models by language models
S Bai, S Yang, J Bai, P Wang, X Zhang, J Lin, X Wang, C Zhou, J Zhou
arXiv preprint arXiv:2308.16890, 2023
292023
Pretrained diffusion models for unified human motion synthesis
J Ma, S Bai, C Zhou
arXiv preprint arXiv:2212.02837, 2022
222022
An image is worth 1/2 tokens after layer 2: Plug-and-play inference acceleration for large vision-language models
L Chen, H Zhao, T Liu, S Bai, J Lin, C Zhou, B Chang
arXiv preprint arXiv:2403.06764, 2024
192024
Ofasys: A multi-modal multi-task learning system for building generalist models
J Bai, R Men, H Yang, X Ren, K Dang, Y Zhang, X Zhou, P Wang, S Tan, ...
arXiv preprint arXiv:2212.04408, 2022
112022
M6-fashion: High-fidelity multi-modal image generation and editing
Z Li, H Zhou, S Bai, P Li, C Zhou, H Yang
arXiv preprint arXiv:2205.11705, 2022
62022
The system can't perform the operation now. Try again later.
Articles 1–20