Zhe Chen (陈喆)

Cited by

	All	Since 2019
Citations	1120	1120
h-index	11	11
i10-index	11	11

680

340

170

510

20212022202320245 37 666 410

Public access

View all

6 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Wenhai Wang (王文海)CUHK | Shanghai AI Laboratory | NJUVerified email at cuhk.edu.hk
Yu QiaoProfessor of Shanghai AI Laboratory; Shenzhen Institutes of Advanced Technology, CASVerified email at siat.ac.cn
Jifeng DaiAssociate Professor of EE, Tsinghua University; Adjuct Researcher of Shanghai AI LaboratoryVerified email at tsinghua.edu.cn
Xizhou ZhuTsinghua UniversityVerified email at tsinghua.edu.cn
Ping Luo (羅平)Associate Professor, The University of Hong KongVerified email at hku.hk
Lewei LuResearch Director (We're Hiring, luotto@sensetime.com) @ SenseTime ResearchVerified email at sensetime.com
Enze XieNVIDIA, HKUVerified email at connect.hku.hk
Zhenhang HuangShanghai AI LabVerified email at pjlab.org.cn
Yuchen Duan (段雨辰)Ph.D Student, The Chinese University of Hong KongVerified email at link.cuhk.edu.hk
Jiannan WuThe University of Hong KongVerified email at connect.hku.hk
Zhiqi LiPhD candidate, Nanjing UniversityVerified email at smail.nju.edu.cn
Guo ChenNanjing UniversityVerified email at smail.nju.edu.cn
Weiyun WangShanghai AI Laboratory; Fudan UniversityVerified email at pjlab.org.cn
Yuanfeng JiThe Univeristy of HongkongVerified email at connect.hku.hk
Tao WangNanjing UniversityVerified email at smail.nju.edu.cn
Kai ChenHong Kong University of Science and TechnologyVerified email at connect.ust.hk
Tong LuNanjing University

Zhe Chen (陈喆)

PhD candidate, Nanjing University

Verified email at smail.nju.edu.cn - Homepage

Computer Vision Foundation Model


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions W Wang, J Dai, Z Chen, Z Huang, Z Li, X Zhu, X Hu, T Lu, L Lu, H Li, ... IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 14408 …, 2023	356	2023
Vision Transformer Adapter for Dense Predictions Z Chen, Y Duan, W Wang, J He, T Lu, J Dai, Y Qiao International Conference on Learning Representation (ICLR), 2022	352	2022
VisionLLM: Large Language Model is Also an Open-Ended Decoder for Vision-Centric Tasks W Wang, Z Chen, X Chen, J Wu, X Zhu, G Zeng, P Luo, T Lu, J Zhou, ... Advances in Neural Information Processing Systems (NeurIPS), 2023	160	2023
InternGPT: Solving vision-centric tasks by interacting with chatbots beyond language Z Liu, Y He, W Wang, W Wang, Y Wang, S Chen, Q Zhang, Y Yang, Q Li, ... arXiv preprint arXiv:2305.05662, 2023	53	2023
DDP: Diffusion Model for Dense Visual Prediction Y Ji, Z Chen, E Xie, L Hong, X Liu, Z Liu, T Lu, Z Li, P Luo IEEE/CVF International Conference on Computer Vision (ICCV), 2023	40	2023
InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges G Chen, S Xing, Z Chen, Y Wang, K Li, Y Li, Y Liu, J Wang, YD Zheng, ... Technical Report of Ego4D Challenge 2022 @ ECCV, 2022	29	2022
The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World W Wang, M Shi, Q Li, W Wang, Z Huang, L Xing, Z Chen, H Li, X Zhu, ... The Twelfth International Conference on Learning Representations (ICLR), 2023	28	2023
Towards Ultra-Resolution Neural Style Transfer via Thumbnail Instance Normalization Z Chen, W Wang, E Xie, T Lu, P Luo Proceedings of the AAAI Conference on Artificial Intelligence 36 (1), 393-400, 2022	19	2022
GeoDiffusion: Text-Prompted Geometric Control for Object Detection Data Generation K Chen, E Xie, Z Chen, L Hong, Z Li, DY Yeung The Twelfth International Conference on Learning Representations (ICLR), 2023	17*	2023
FAST: Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation Z Chen, J Wang, W Wang, G Chen, E Xie, P Luo, T Lu arXiv preprint arXiv:2111.02394, 2021	13	2021
InternVL: Scaling up vision foundation models and aligning for generic visual-linguistic tasks Z Chen, J Wu, W Wang, W Su, G Chen, S Xing, Z Muyan, Q Zhang, X Zhu, ... IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Oral, 2024	12	2024
AVSegFormer: Audio-Visual Segmentation with Transformer S Gao, Z Chen, G Chen, W Wang, T Lu AAAI Conference on Artificial Intelligence (AAAI), 2023	9	2023
Video mamba suite: State space model as a versatile alternative for video understanding G Chen, Y Huang, J Xu, B Pei, Z Chen, Z Li, J Wang, K Li, T Lu, L Wang arXiv preprint arXiv:2403.09626, 2024	6	2024
Graph Propagation Transformer for Graph Representation Learning Z Chen, H Tan, T Wang, T Shen, T Lu, Q Peng, C Cheng, Y Qi The 32nd International Joint Conference on Artificial Intellgence (IJCAI), 2023	6	2023
SiameseCCR: A Novel Method for One‐Shot and Few‐Shot Chinese CAPTCHA Recognition using Deep Siamese Network Z Chen, W Ma, N Xu, C Ji, Y Zhang IET Image Processing 14 (12), 2855-2859, 2020	5	2020
Mm-interleaved: Interleaved image-text generative modeling via multi-modal feature synchronizer C Tian, X Zhu, Y Xiong, W Wang, Z Chen, W Wang, Y Chen, L Lu, T Lu, ... arXiv preprint arXiv:2401.10208, 2024	4	2024
Block Shuffle: A Method for High-Resolution Fast Style Transfer with Limited Memory W Ma, Z Chen, C Ji IEEE Access 8, 158056-158066, 2020	4	2020
Vision-rwkv: Efficient and scalable visual perception with rwkv-like architectures Y Duan, W Wang, Z Chen, X Zhu, L Lu, T Lu, Y Qiao, H Li, J Dai, W Wang arXiv preprint arXiv:2403.02308, 2024	3	2024
Champion Solution for the WSDM2023 Toloka VQA Challenge S Gao, Z Chen, G Chen, W Wang, T Lu Technical Report of WSDM Cup 2023 @ WSDM, 2023	2	2023
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD X Dong, P Zhang, Y Zang, Y Cao, B Wang, L Ouyang, S Zhang, H Duan, ... arXiv preprint arXiv:2404.06512, 2024	1	2024

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors