Dan Su

Hivatkozott rá

	Összes	2019 óta
Hivatkozások	3034	2979
h-index	31	31
i10-index	66	65

880

440

220

660

20182019202020212022202320249 125 245 508 657 879 550

Nyilvános hozzáférés

Összes megtekintése

26 cikk

4 cikk

elérhető

nem érhető el

Finanszírozási megbízások alapján

Társszerzők

Dong Yu (俞栋)Distinguished Scientist @ Tencent AI Lab, ACM/IEEE/ISCA FellowE-mail megerősítve itt: global.tencent.com
Meng YUTencent AI LabE-mail megerősítve itt: tencent.com
Jun WangPeking UniversityE-mail megerősítve itt: tencent.com
Lianwu CHENKuaishou TechnologyE-mail megerősítve itt: kuaishou.com
Shiyin KangXVerse Inc.E-mail megerősítve itt: xverse.cn
Zhiyong WU (吴志勇)Associate Professor, Tsinghua UniversityE-mail megerősítve itt: sz.tsinghua.edu.cn
Lei XieNorthwestern Polytechnical UniversityE-mail megerősítve itt: nwpu.edu.cn
Xunying LiuChinese University of Hong KongE-mail megerősítve itt: se.cuhk.edu.hk
Guangsen WangTencent AI LabE-mail megerősítve itt: tencent.com
Shan YangTencent AI LabE-mail megerősítve itt: nwpu-aslp.org
Yong XuPrincipal Researcher, Tencent America, Bellevue, USAE-mail megerősítve itt: tencent.com
Shi-Xiong (Austin) ZhangSr. Director | AI Foundations@Capital One | ex-Microsoft, ex-Tencent, Cambridge PhDE-mail megerősítve itt: capitalone.com
Rongzhi GuTencent AI LabE-mail megerősítve itt: pku.edu.cn
Yuexian ZouPeking University Shenzhen Graduate SchoolE-mail megerősítve itt: pku.edu.cn
Jia CuiTencentE-mail megerősítve itt: tencent.com
Xihong WuPeking UniversityE-mail megerősítve itt: cis.pku.edu.cn
Chao Weng
Songxiang LiuPhD. from CUHK
Max W. Y. LamIndependent Researcher

Követés

Dan Su

Tencent AI Lab

E-mail megerősítve itt: tencent.com

speech recognition speech synthesis speaker recognition


Cím Rendezés hivatkozások szerint Rendezés év szerint Rendezés cím szerint	Hivatkozott rá Hivatkozott rá	Év
Gigaspeech: An evolving, multi-domain asr corpus with 10,000 hours of transcribed audio G Chen, S Chai, G Wang, J Du, WQ Zhang, C Weng, D Su, D Povey, ... arXiv preprint arXiv:2106.06909, 2021	189	2021
Replay and synthetic speech detection with res2net architecture X Li, N Li, C Weng, X Liu, D Su, D Yu, H Meng ICASSP 2021-2021 IEEE international conference on acoustics, speech and …, 2021	150	2021
Fastdiff: A fast conditional diffusion model for high-quality speech synthesis R Huang, MWY Lam, J Wang, D Su, D Yu, Y Ren, Z Zhao arXiv preprint arXiv:2204.09934, 2022	134	2022
DurIAN: Duration Informed Attention Network for Speech Synthesis. C Yu, H Lu, N Hu, M Yu, C Weng, K Xu, P Liu, D Tuo, S Kang, G Lei, D Su, ... Interspeech, 2027-2031, 2020	104	2020
Deep extractor network for target speaker recovery from single channel speech mixtures J Wang, J Chen, D Su, L Chen, M Yu, Y Qian, D Yu arXiv preprint arXiv:1807.08974, 2018	103	2018
Durian: Duration informed attention network for multimodal synthesis C Yu, H Lu, N Hu, M Yu, C Weng, K Xu, P Liu, D Tuo, S Kang, G Lei, D Su, ... arXiv preprint arXiv:1909.01700, 2019	99	2019
Component fusion: Learning replaceable language model component for end-to-end speech recognition system C Shan, C Weng, G Wang, D Su, M Luo, D Yu, L Xie ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	99	2019
Neural Spatial Filter: Target Speaker Speech Separation Assisted with Directional Information. R Gu, L Chen, SX Zhang, J Zheng, Y Xu, M Yu, D Su, Y Zou, D Yu Interspeech, 4290-4294, 2019	95	2019
Investigating end-to-end speech recognition for mandarin-english code-switching C Shan, C Weng, G Wang, D Su, M Luo, D Yu, L Xie ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	85	2019
BDDM: Bilateral denoising diffusion models for fast and high-quality speech synthesis MWY Lam, J Wang, D Su, D Yu arXiv preprint arXiv:2203.13508, 2022	84	2022
End-to-end multi-channel speech separation R Gu, J Wu, SX Zhang, L Chen, Y Xu, M Yu, D Su, Y Zou, D Yu arXiv preprint arXiv:1905.06286, 2019	80	2019
Deep Discriminative Embeddings for Duration Robust Speaker Verification. N Li, D Tuo, D Su, Z Li, D Yu, A Tencent Interspeech, 2262-2266, 2018	79	2018
Enhancing end-to-end multi-channel speech separation via spatial feature learning R Gu, SX Zhang, L Chen, Y Xu, M Yu, D Su, Y Zou, D Yu ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	62	2020
Improving Attention Based Sequence-to-Sequence Models for End-to-End English Conversational Speech Recognition. C Weng, J Cui, G Wang, J Wang, C Yu, D Su, D Yu Interspeech, 761-765, 2018	61	2018
Speech-XLNet: Unsupervised acoustic model pretraining for self-attention networks X Song, G Wang, Z Wu, Y Huang, D Su, D Yu, H Meng arXiv preprint arXiv:1910.10387, 2019	56	2019
Mm-llms: Recent advances in multimodal large language models D Zhang, Y Yu, C Li, J Dong, D Su, C Chu, D Yu arXiv preprint arXiv:2401.13601, 2024	54	2024
Diffgan-tts: High-fidelity and efficient text-to-speech with denoising diffusion gans S Liu, D Su, D Yu arXiv preprint arXiv:2201.11972, 2022	50	2022
Simple attention module based speaker verification with iterative noisy label detection X Qin, N Li, C Weng, D Su, M Li ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	49	2022
Diffsvc: A diffusion probabilistic model for singing voice conversion S Liu, Y Cao, D Su, H Meng 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021	46	2021
Sandglasset: A light multi-granularity self-attentive network for time-domain speech separation MWY Lam, J Wang, D Su, D Yu ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	45	2021

A rendszer jelenleg nem tudja elvégezni a műveletet. Próbálkozzon újra később.

Cikkek 1–20

Hivatkozások évente

Ismétlődő hivatkozások

Összevont hivatkozások

Társszerzők hozzáadásaTársszerzők

Követés

Hivatkozott rá

Társszerzők