Follow
Chen Zhang
Chen Zhang
Research Scientist, ByteDance
Verified email at zju.edu.cn - Homepage
Title
Cited by
Cited by
Year
Discriminative and Correlative Partial Multi-Label Learning.
H Wang, W Liu, Y Zhao, C Zhang, T Hu, G Chen
IJCAI, 3691-3697, 2019
822019
SimulSpeech: End-to-end simultaneous speech to text translation
Y Ren, J Liu, X Tan, C Zhang, T Qin, Z Zhao, TY Liu
Proceedings of the 58th Annual Meeting of the Association for Computational …, 2020
622020
Uwspeech: Speech to speech translation for unwritten languages
C Zhang, X Tan, Y Ren, T Qin, K Zhang, TY Liu
AAAI 2021, 2020
432020
Denoispeech: Denoising text to speech with frame-level noise modeling
C Zhang, Y Ren, X Tan, J Liu, K Zhang, T Qin, S Zhao, TY Liu
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
422021
Task-level curriculum learning for non-autoregressive neural machine translation
J Liu, Y Ren, X Tan, C Zhang, T Qin, Z Zhao, TY Liu
IJCAI 2020, 2020
342020
S3T: Self-Supervised Pre-training with Swin Transformer for Music Classification
H Zhao, C Zhang, B Zhu, Z Ma, K Zhang
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
322022
TeleMelody: Lyric-to-Melody Generation with a Template-Based Two-Stage Method
Z Ju, P Lu, X Tan, R Wang, C Zhang, S Wu, K Zhang, X Li, T Qin, TY Liu
EMNLP 2022, 2022
292022
Mega-tts: Zero-shot text-to-speech at scale with intrinsic inductive bias
Z Jiang, Y Ren, Z Ye, J Liu, C Zhang, Q Yang, S Ji, R Huang, C Wang, ...
arXiv preprint arXiv:2306.03509, 2023
202023
PDAugment: Data Augmentation by Pitch and Duration Adjustments for Automatic Lyrics Transcription
C Zhang, J Yu, LC Chang, X Tan, J Chen, T Qin, K Zhang
ISMIR 2022, 2021
162021
FastLR: Non-autoregressive lipreading model with integrate-and-fire
J Liu, Y Ren, Z Zhao, C Zhang, B Huai, J Yuan
Proceedings of the 28th ACM International Conference on Multimedia, 4328-4336, 2020
132020
Make-an-audio 2: Temporal-enhanced text-to-audio generation
J Huang, Y Ren, R Huang, D Yang, Z Ye, C Zhang, J Liu, X Yin, Z Ma, ...
arXiv preprint arXiv:2305.18474, 2023
112023
Automatic Song Translation for Tonal Languages
F Guo, C Zhang, Z Zhang, Q He, K Zhang, J Xie, J Boyd-Graber
Proceedings of the 60th Annual Meeting of the Association for Computational …, 2022
102022
Mega-tts 2: Zero-shot text-to-speech with arbitrary length speech prompts
Z Jiang, J Liu, Y Ren, J He, C Zhang, Z Ye, P Wei, C Wang, X Yin, Z Ma, ...
arXiv preprint arXiv:2307.07218, 2023
92023
Relyme: Improving lyric-to-melody generation by incorporating lyric-melody relationships
C Zhang, L Chang, S Wu, X Tan, T Qin, TY Liu, K Zhang
Proceedings of the 30th ACM International Conference on Multimedia, 1047-1056, 2022
82022
SDMuse: Stochastic Differential Music Editing and Generation via Hybrid Representation
C Zhang, Y Ren, K Zhang, S Yan
IEEE Transactions on MultiMedia, 2023
62023
Towards Effective Multi-Modal Interchanges in Zero-Resource Sounding Object Localization
Y Zhao, C Zhang, H Huang, H Li, Z Zhao
Advances in Neural Information Processing Systems, 2022
52022
Songdriver: Real-time music accompaniment generation without logical latency nor exposure bias
Z Wang, K Zhang, Y Wang, C Zhang, Q Liang, P Yu, Y Feng, W Liu, ...
Proceedings of the 30th ACM International Conference on Multimedia, 1057-1067, 2022
42022
Real3d-portrait: One-shot realistic 3d talking portrait synthesis
Z Ye, T Zhong, Y Ren, J Yang, W Li, J Huang, Z Jiang, J He, R Huang, ...
arXiv preprint arXiv:2401.08503, 2024
22024
Bag of tricks for unsupervised text-to-speech
Y Ren, C Zhang, YAN Shuicheng
The Eleventh International Conference on Learning Representations, 2022
22022
Ada-TTA: Towards Adaptive High-Quality Text-to-Talking Avatar Synthesis
Z Ye, Z Jiang, Y Ren, J Liu, C Zhang, X Yin, Z Ma, Z Zhao
ICML 2023 Workshop, 2023
12023
The system can't perform the operation now. Try again later.
Articles 1–20