Shaojin Ding

Cited by

	All	Since 2019
Citations	1128	1122
h-index	13	13
i10-index	14	14

320

160

240

20192020202120222023202414 115 249 255 315 172

Public access

View all

9 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Ricardo Gutierrez-OsunaTexas A&M University, Computer Science and EngineeringVerified email at tamu.edu
Tianlong ChenAssistant Professor, CS@UNC Chapel Hill; PostDoc, CSAIL@MIT+BMI@Harvard; Ph.D., ECE@UT AustinVerified email at cs.unc.edu
Guanlong ZhaoGoogleVerified email at google.com
Yanzhang HeGoogle Inc.Verified email at google.com
Zhangyang (Atlas) WangXTX Markets & University of Texas at AustinVerified email at utexas.edu
Christopher LiberatoreAir Force Research LabVerified email at afrl.af.mil
Quan WangSenior Staff Software Engineer @ Google; Instructor @ Udemy; Textbook Author; IEEE Senior MemberVerified email at google.com
Shuo-yiin ChangSenior Staff Research Scientist, Google DeepMindVerified email at google.com
Tara SainathPrincipal Research Scientist, GoogleVerified email at google.com
Rybakov OlegGoogleVerified email at amazon.com
Weiran WangGoogleVerified email at ttic.edu
Li WanAmazon AWSVerified email at amazon.com
Ignacio Lopez MorenoGoogle IncVerified email at google.com
John LevisIowa State UniversityVerified email at iastate.edu
Xinyu GongThe University of Texas at AustinVerified email at utexas.edu

Shaojin Ding

Google

Verified email at google.com - Homepage

Speech Recognition Large Language Model Speech Synthesis Model Compression AutoML


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Abd-net: Attentive but diverse person re-identification T Chen, S Ding, J Xie, Y Yuan, W Chen, Y Yang, Z Ren, Z Wang Proceedings of the IEEE/CVF international conference on computer vision …, 2019	597	2019
Personal VAD: Speaker-conditioned voice activity detection S Ding, Q Wang, S Chang, L Wan, IL Moreno arXiv preprint arXiv:1908.04284, 2019	87	2019
Group Latent Embedding for Vector Quantized Variational Autoencoder in Non-Parallel Voice Conversion. S Ding, R Gutierrez-Osuna Interspeech, 724-728, 2019	59	2019
Golden speaker builder–An interactive tool for pronunciation training S Ding, C Liberatore, S Sonsaat, I Lučić, A Silpachai, G Zhao, ... Speech Communication 115, 51-66, 2019	57	2019
Autospeech: Neural architecture search for speaker recognition S Ding, T Chen, X Gong, W Zha, Z Wang arXiv preprint arXiv:2005.03215, 2020	56	2020
Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams. G Zhao, S Ding, R Gutierrez-Osuna Interspeech, 2843-2847, 2019	56	2019
Audio lottery: Speech recognition made ultra-lightweight, noise-robust, and transferable S Ding, T Chen, Z Wang International Conference on Learning Representations, 2022	28	2022
Accentron: Foreign accent conversion to arbitrary non-native speakers using zero-shot learning S Ding, G Zhao, R Gutierrez-Osuna Computer Speech & Language 72, 101302, 2022	24	2022
4-bit conformer with native quantization aware training for speech recognition S Ding, P Meadowlark, Y He, L Lew, S Agrawal, O Rybakov arXiv preprint arXiv:2203.15952, 2022	23	2022
Converting foreign accent speech without a reference G Zhao, S Ding, R Gutierrez-Osuna IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 2367-2381, 2021	22	2021
Personal VAD 2.0: Optimizing personal voice activity detection for on-device speech recognition S Ding, R Rikhye, Q Liang, Y He, Q Wang, A Narayanan, T O'Malley, ... arXiv preprint arXiv:2204.03793, 2022	20	2022
Improving the Speaker Identity of Non-Parallel Many-to-Many Voice Conversion with Adversarial Speaker Recognition. S Ding, G Zhao, R Gutierrez-Osuna INTERSPEECH, 776-780, 2020	18	2020
A unified cascaded encoder asr model for dynamic model sizes S Ding, W Wang, D Zhao, TN Sainath, Y He, R David, R Botros, X Wang, ... arXiv preprint arXiv:2204.06164, 2022	14	2022
Learning structured sparse representations for voice conversion S Ding, G Zhao, C Liberatore, R Gutierrez-Osuna IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 343-354, 2019	13	2019
2-bit conformer quantization for automatic speech recognition O Rybakov, P Meadowlark, S Ding, D Qiu, J Li, D Rim, Y He arXiv preprint arXiv:2305.16619, 2023	7	2023
Textual echo cancellation S Ding, Y Jia, K Hu, Q Wang 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021	7	2021
Learning Structured Dictionaries for Exemplar-based Voice Conversion. S Ding, C Liberatore, R Gutierrez-Osuna INTERSPEECH, 481-485, 2018	6	2018
Golden Speaker Builder: an interactive online tool for L2 learners to build pronunciation models S Ding, C Liberatore, G Zhao, S Sonsaat, E Chukharev-Hudilainen, ... Pronunciation in Second Language Learning & Teaching (PSLLT) 9th Annual …, 2017	6	2017
Towards lifelong learning of multilingual text-to-speech synthesis M Yang, S Ding, T Chen, T Wang, Z Wang ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	5	2022
Sharing low rank conformer weights for tiny always-on ambient speech recognition models SM Hernandez, D Zhao, S Ding, A Bruguier, R Prabhavalkar, TN Sainath, ... ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	4	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors