Qwen technical report J Bai, S Bai, Y Chu, Z Cui, K Dang, X Deng, Y Fan, W Ge, Y Han, F Huang, ... arXiv preprint arXiv:2309.16609, 2023 | 1428 | 2023 |
Unifying architectures, tasks, and modalities through a simple sequence-to-sequence learning framework P Wang, A Yang, R Men, J Lin, S Bai, Z Li, J Ma, C Zhou, J Zhou, H Yang ICML 2022, 2022 | 1068 | 2022 |
Qwen2 technical report A Yang, B Yang, B Hui, B Zheng, B Yu, C Zhou, C Li, C Li, D Liu, F Huang, ... arXiv preprint arXiv:2407.10671, 2024 | 450 | 2024 |
Enhancing pre-trained language representations with rich knowledge for machine reading comprehension A Yang, Q Wang, J Liu, K Liu, Y Lyu, H Wu, Q She, S Li ACL 2019 (Long Paper), 2019 | 191 | 2019 |
M6: A chinese multimodal pretrainer A Yang, J Lin, R Men, C Zhou, M Ding, Y Zhang, P Wang, A Wang, ... | 143* | 2021 |
Chinese clip: Contrastive vision-language pretraining in chinese A Yang, J Pan, J Lin, R Men, Y Zhang, J Zhou, C Zhou arXiv preprint arXiv:2211.01335, 2022 | 114 | 2022 |
Interbert: Vision-and-language interaction for multi-modal pretraining J Lin, A Yang, Y Zhang, J Liu, J Zhou, H Yang arXiv preprint arXiv:2003.13198, 2020 | 94 | 2020 |
Expertprompting: Instructing large language models to be distinguished experts B Xu, A Yang, J Lin, Q Wang, C Zhou, Y Zhang, Z Mao arXiv preprint arXiv:2305.14688, 2023 | 90 | 2023 |
M6-t: Exploring sparse expert models and beyond A Yang, J Lin, R Men, C Zhou, L Jiang, X Jia, A Wang, J Zhang, J Wang, ... arXiv preprint arXiv:2105.15082, 2021 | 62 | 2021 |
SciDTB: Discourse dependency treebank for scientific abstracts A Yang, S Li ACL 2018 (Short Paper), 2018 | 53 | 2018 |
Machine reading comprehension: a literature review X Zhang, A Yang, S Li, Y Wang arXiv preprint arXiv:1907.01686, 2019 | 47 | 2019 |
A Robust Adversarial Training Approach to Machine Reading Comprehension K Liu, X Liu, A Yang, J Liu, J Su, S Li, Q She AAAI 2020, 2020 | 46 | 2020 |
Adaptations of ROUGE and BLEU to Better Evaluate Machine Reading Comprehension Task A Yang, K Liu, J Liu, Y Lyu, S Li MRQA Workshop@ACL 2018, 2018 | 43 | 2018 |
Prompt Tuning for Generative Multimodal Pretrained Models H Yang, J Lin, A Yang, P Wang, C Zhou, H Yang ACL 2023 (Findings), 2022 | 41 | 2022 |
M6-10t: A sharing-delinking paradigm for efficient multi-trillion parameter pretraining J Lin, A Yang, J Bai, C Zhou, L Jiang, X Jia, A Wang, J Zhang, Y Li, W Lin, ... arXiv preprint arXiv:2110.03888, 2021 | 37 | 2021 |
M6: Multi-Modality-to-Multi-Modality Multitask Mega-transformer for Unified Pretraining A Yang, J Lin, R Men, C Zhou, Y Zhang, P Wang, J Zhou, J Tang, H Yang KDD 2021, 2021 | 36 | 2021 |
Learning Relation Alignment for Calibrated Cross-modal Retrieval S Ren, J Lin, G Zhao, R Men, A Yang, J Zhou, X Sun, H Yang ACL 2021 (Long Paper), 2021 | 32 | 2021 |
Qwen2.5-Coder Technical Report B Hui, J Yang, Z Cui, J Yang, D Liu, L Zhang, T Liu, J Zhang, B Yu, ... arXiv preprint arXiv:2409.12186, 2024 | 27 | 2024 |
Sketch and Refine: Towards Faithful and Informative Table-to-Text Generation P Wang, J Lin, A Yang, C Zhou, Y Zhang, J Zhou, H Yang ACL 2021 (Findings), 2021 | 19 | 2021 |
Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via Self-Improvement A Yang, B Zhang, B Hui, B Gao, B Yu, C Li, D Liu, J Tu, J Zhou, J Lin, K Lu, ... arXiv preprint arXiv:2409.12122, 2024 | 14 | 2024 |