Xize Cheng（成曦泽）

Cited by

	All	Since 2019
Citations	65	65
h-index	5	5
i10-index	1	1

2023202450 15

Public access

View all

2 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Zhou ZhaoZhejiang UniversityVerified email at zju.edu.cn
Linjun LiZhejiang UniversityVerified email at zju.edu.cn
Wang LinZhejiang UniversityVerified email at zju.edu.cn
Rongjie HuangZhejiang UniversityVerified email at zju.edu.cn
Ye WangZhejiang UniversityVerified email at zju.edu.cn
Zehan WangZhejiang UniversityVerified email at zju.edu.cn
Yi Ren (任意)Research Scientist, TiktokVerified email at bytedance.com
Huadai LiuZhejiang UniversityVerified email at zju.edu.cn
Luping Liu (刘路平)Zhejiang UniversityVerified email at zju.edu.cn

Xize Cheng（成曦泽）

Zhejiang University

Verified email at zju.edu.cn - Homepage

Audio-Visual Speech Processing


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Mixspeech: Cross-modality self-learning with audio-visual stream mixup for visual speech translation and recognition X Cheng, T Jin, R Huang, L Li, W Lin, Z Wang, Y Wang, H Liu, A Yin, ... Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023	12	2023
Connecting multi-modal contrastive representations Z Wang, Y Zhao, H Huang, J Liu, A Yin, L Tang, L Li, Y Wang, Z Zhang, ... Advances in Neural Information Processing Systems 36, 22099-22114, 2023	7	2023
Opensr: Open-modality speech recognition via maintaining multi-modality alignment X Cheng, T Jin, L Li, W Lin, X Duan, Z Zhao arXiv preprint arXiv:2306.06410, 2023	6	2023
Av-transpeech: Audio-visual robust speech-to-speech translation R Huang, H Liu, X Cheng, Y Ren, L Li, Z Ye, J He, L Zhang, J Liu, X Yin, ... arXiv preprint arXiv:2305.15403, 2023	6	2023
Diffusion denoising process for perceptron bias in out-of-distribution detection L Liu, Y Ren, X Cheng, R Huang, C Li, Z Zhao arXiv preprint arXiv:2211.11255, 2022	6	2022
TAVT: Towards Transferable Audio-Visual Text Generation W Lin, T Jin, W Pan, L Li, X Cheng, Y Wang, Z Zhao Proceedings of the 61st Annual Meeting of the Association for Computational …, 2023	5	2023
Distilling coarse-to-fine semantic matching knowledge for weakly supervised 3d visual grounding Z Wang, H Huang, Y Zhao, L Li, X Cheng, Y Zhu, A Yin, Z Zhao Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023	5	2023
3drp-net: 3d relative position-aware network for 3d visual grounding Z Wang, H Huang, Y Zhao, L Li, X Cheng, Y Zhu, A Yin, Z Zhao arXiv preprint arXiv:2307.13363, 2023	4	2023
Contrastive token-wise meta-learning for unseen performer visual temporal-aligned translation L Li, T Jin, X Cheng, Y Wang, W Lin, R Huang, Z Zhao Findings of the Association for Computational Linguistics: ACL 2023, 10993-11007, 2023	4	2023
Weakly-supervised spoken video grounding via semantic interaction learning Y Wang, W Lin, S Zhang, T Jin, L Li, X Cheng, Z Zhao Proceedings of the 61st Annual Meeting of the Association for Computational …, 2023	3	2023
Semantic-conditioned dual adaptation for cross-domain query-based visual segmentation Y Wang, T Jin, W Lin, X Cheng, L Li, Z Zhao Findings of the Association for Computational Linguistics: ACL 2023, 9797-9815, 2023	2	2023
Wav2sql: Direct generalizable speech-to-sql parsing H Liu, R Huang, J He, G Sun, R Shen, X Cheng, Z Zhao arXiv preprint arXiv:2305.12552, 2023	2	2023
Exploring group video captioning with efficient relational approximation W Lin, T Jin, Y Wang, W Pan, L Li, X Cheng, Z Zhao Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023	2	2023
Rethinking missing modality learning from a decoding perspective T Jin, X Cheng, L Li, W Lin, Y Wang, Z Zhao Proceedings of the 31st ACM International Conference on Multimedia, 4431-4439, 2023	1	2023
TransFace: Unit-Based Audio-Visual Speech Synthesizer for Talking Head Translation X Cheng, R Huang, L Li, T Jin, Z Wang, A Yin, M Li, X Duan, Z Zhao arXiv preprint arXiv:2312.15197, 2023		2023
Chat-3D v2: Bridging 3D Scene and Large Language Models with Object Identifiers H Huang, Z Wang, R Huang, L Liu, X Cheng, Y Zhao, T Jin, Z Zhao arXiv preprint arXiv:2312.08168, 2023		2023
Out-of-distribution Detection with Diffusion-based Neighborhood L Liu, Y Ren, X Cheng, Z Zhao		2022
NaturalSigner: Diffusion Models are Natural Sign Language Generator A Yin, J Xun, X Cheng, T Jin, S Zhang, Z Zhao, S Tang, F Wu
Listen to Motion: Robustly Learning Correlated Audio-Visual Representations Z Wang, X Cheng, L Tang, L Liu, Y Zhao, T Jin, C Cai, W HongFa, W Liu, ...

The system can't perform the operation now. Try again later.

Articles 1–19

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors