Mega-tts: Zero-shot text-to-speech at scale with intrinsic inductive bias Z Jiang, Y Ren, Z Ye, J Liu, C Zhang, Q Yang, S Ji, R Huang, C Wang, ... arXiv preprint arXiv:2306.03509, 2023 | 22 | 2023 |
Mega-tts 2: Zero-shot text-to-speech with arbitrary length speech prompts Z Jiang, J Liu, Y Ren, J He, Z Ye, S Ji, Q Yang, C Zhang, P Wei, C Wang, ... ICLR 2024, 2023 | 9 | 2023 |
TextrolSpeech: A Text Style Control Speech Corpus With Codec Language Text-to-Speech Models S Ji, J Zuo, M Fang, Z Jiang, F Chen, X Duan, B Huai, Z Zhao ICASSP 2024, 2023 | 5 | 2023 |
MobileSpeech: A Fast and High-Fidelity Framework for Mobile Zero-Shot Text-to-Speech S Ji, Z Jiang, H Wang, J Zuo, Z Zhao ACL 2024 Main, 2024 | 2 | 2024 |
Boosting Prompting Mechanisms for Zero-Shot Speech Synthesis Z Jiang, J Liu, Y Ren, J He, Z Ye, S Ji, Q Yang, C Zhang, P Wei, C Wang, ... ICLR 2024, 2023 | 1 | 2023 |
Unlocking the Potential of Multimodal Unified Discrete Representation through Training-Free Codebook Optimization and Hierarchical Alignment H Huang, Y Xia, S Ji, S Wang, H Wang, J Zhu, Z Dong, Z Zhao arXiv preprint arXiv:2403.05168, 2024 | | 2024 |
Language-Codec: Reducing the Gaps Between Discrete Codec Representation and Speech Language Models S Ji, M Fang, Z Jiang, R Huang, J Zuo, S Wang, Z Zhao arXiv preprint arXiv:2402.12208, 2024 | | 2024 |
Generating Neural Networks for Diverse Networking Classification Tasks via Hardware-Aware Neural Architecture Search G Xie, Q Li, Z Shi, H Fang, S Ji, Y Jiang, Z Yuan, L Ma, M Xu IEEE Transactions on Computers, 2023 | | 2023 |