Qwen technical report J Bai, S Bai, Y Chu, Z Cui, K Dang, X Deng, Y Fan, W Ge, Y Han, F Huang, ... arXiv preprint arXiv:2309.16609, 2023 | 1429 | 2023 |
Qwen2 technical report A Yang, B Yang, B Hui, B Zheng, B Yu, C Zhou, C Li, C Li, D Liu, F Huang, ... arXiv preprint arXiv:2407.10671, 2024 | 450 | 2024 |
Towards Knowledge-Based Recommender Dialog System Q Chen, J Lin, Y Zhang, M Ding, Y Cen, H Yang, J Tang arXiv preprint arXiv:1908.05391, 2019 | 268 | 2019 |
M6: A chinese multimodal pretrainer J Lin, R Men, A Yang, C Zhou, M Ding, Y Zhang, P Wang, A Wang, ... arXiv preprint arXiv:2103.00823, 2021 | 143 | 2021 |
Chinese CLIP: Contrastive Vision-Language Pretraining in Chinese A Yang, J Pan, J Lin, R Men, Y Zhang, J Zhou, C Zhou arXiv preprint arXiv:2211.01335, 2022 | 114 | 2022 |
Towards Knowledge-Based Personalized Product Description Generation in E-commerce Q Chen, J Lin, Y Zhang, H Yang, J Zhou, J Tang Proceedings of the 25th ACM SIGKDD International Conference on Knowledge …, 2019 | 98 | 2019 |
Interbert: Vision-and-language interaction for multi-modal pretraining J Lin, A Yang, Y Zhang, J Liu, J Zhou, H Yang arXiv preprint arXiv:2003.13198, 2020 | 94 | 2020 |
Meta-KD: A Meta Knowledge Distillation Framework for Language Model Compression across Domains H Pan, C Wang, M Qiu, Y Zhang, Y Li, J Huang arXiv preprint arXiv:2012.01266, 2020 | 46 | 2020 |
M6: Multi-modality-to-multi-modality multitask mega-transformer for unified pretraining J Lin, R Men, A Yang, C Zhou, Y Zhang, P Wang, J Zhou, J Tang, H Yang Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data …, 2021 | 36 | 2021 |
Qwen2. 5-coder technical report B Hui, J Yang, Z Cui, J Yang, D Liu, L Zhang, T Liu, J Zhang, B Yu, K Lu, ... arXiv preprint arXiv:2409.12186, 2024 | 27 | 2024 |
Qwen technical report. 2023 J Bai, S Bai, Y Chu, Z Cui, K Dang, X Deng, Y Fan, W Ge, Y Han, F Huang, ... arXiv preprint cs.CL/2309.16609, 2023 | 20 | 2023 |
Mining activation force defined dependency patterns for relation extraction C Zhang, Y Zhang, W Xu, Z Ma, Y Leng, J Guo Knowledge-Based Systems 86, 278-287, 2015 | 20 | 2015 |
Sketch and Refine: Towards Faithful and Informative Table-to-Text Generation P Wang, J Lin, A Yang, C Zhou, Y Zhang, J Zhou, H Yang arXiv preprint arXiv:2105.14778, 2021 | 19 | 2021 |
OFASys: A Multi-Modal Multi-Task Learning System for Building Generalist Models J Bai, R Men, H Yang, X Ren, K Dang, Y Zhang, X Zhou, P Wang, S Tan, ... arXiv preprint arXiv:2212.04408, 2022 | 14 | 2022 |
Graph-based multi-hop reasoning for long text generation L Zhao, J Xu, J Lin, Y Zhang, H Yang, X Sun arXiv preprint arXiv:2009.13282, 2020 | 13 | 2020 |
PRIS at Knowledge Base Population 2013. Y Li, Y Zhang, D Li, X Tong, J Wang, N Zuo, Y Wang, W Xu, G Chen, ... TAC, 2013 | 12 | 2013 |
Transferring General Multimodal Pretrained Models to Text Recognition J Lin, X Ren, Y Zhang, G Liu, P Wang, A Yang, C Zhou arXiv preprint arXiv:2212.09297, 2022 | 5 | 2022 |
Can Large Language Models Always Solve Easy Problems if They Can Solve Harder Ones? Z Yang, Y Zhang, T Liu, J Yang, J Lin, C Zhou, Z Sui arXiv preprint arXiv:2406.12809, 2024 | 4 | 2024 |
Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models B Gao, F Song, Z Yang, Z Cai, Y Miao, Q Dong, L Li, C Ma, L Chen, R Xu, ... arXiv preprint arXiv:2410.07985, 2024 | 2 | 2024 |
Language Models can Self-Lengthen to Generate Long Texts S Quan, T Tang, B Yu, A Yang, D Liu, B Gao, J Tu, Y Zhang, J Zhou, J Lin arXiv preprint arXiv:2410.23933, 2024 | | 2024 |