Követés
Dan Su
Dan Su
Tencent AI Lab
E-mail megerősítve itt: tencent.com
Cím
Hivatkozott rá
Hivatkozott rá
Év
Gigaspeech: An evolving, multi-domain asr corpus with 10,000 hours of transcribed audio
G Chen, S Chai, G Wang, J Du, WQ Zhang, C Weng, D Su, D Povey, ...
arXiv preprint arXiv:2106.06909, 2021
1712021
Replay and synthetic speech detection with res2net architecture
X Li, N Li, C Weng, X Liu, D Su, D Yu, H Meng
ICASSP 2021-2021 IEEE international conference on acoustics, speech and …, 2021
1422021
Fastdiff: A fast conditional diffusion model for high-quality speech synthesis
R Huang, MWY Lam, J Wang, D Su, D Yu, Y Ren, Z Zhao
arXiv preprint arXiv:2204.09934, 2022
1322022
Deep extractor network for target speaker recovery from single channel speech mixtures
J Wang, J Chen, D Su, L Chen, M Yu, Y Qian, D Yu
arXiv preprint arXiv:1807.08974, 2018
1032018
DurIAN: Duration Informed Attention Network for Speech Synthesis.
C Yu, H Lu, N Hu, M Yu, C Weng, K Xu, P Liu, D Tuo, S Kang, G Lei, D Su, ...
Interspeech, 2027-2031, 2020
1022020
Durian: Duration informed attention network for multimodal synthesis
C Yu, H Lu, N Hu, M Yu, C Weng, K Xu, P Liu, D Tuo, S Kang, G Lei, D Su, ...
arXiv preprint arXiv:1909.01700, 2019
1002019
Component fusion: Learning replaceable language model component for end-to-end speech recognition system
C Shan, C Weng, G Wang, D Su, M Luo, D Yu, L Xie
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
992019
Neural Spatial Filter: Target Speaker Speech Separation Assisted with Directional Information.
R Gu, L Chen, SX Zhang, J Zheng, Y Xu, M Yu, D Su, Y Zou, D Yu
Interspeech, 4290-4294, 2019
922019
BDDM: Bilateral denoising diffusion models for fast and high-quality speech synthesis
MWY Lam, J Wang, D Su, D Yu
arXiv preprint arXiv:2203.13508, 2022
792022
End-to-end multi-channel speech separation
R Gu, J Wu, SX Zhang, L Chen, Y Xu, M Yu, D Su, Y Zou, D Yu
arXiv preprint arXiv:1905.06286, 2019
792019
Investigating end-to-end speech recognition for mandarin-english code-switching
C Shan, C Weng, G Wang, D Su, M Luo, D Yu, L Xie
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
792019
Deep Discriminative Embeddings for Duration Robust Speaker Verification.
N Li, D Tuo, D Su, Z Li, D Yu, A Tencent
Interspeech, 2262-2266, 2018
772018
Enhancing end-to-end multi-channel speech separation via spatial feature learning
R Gu, SX Zhang, L Chen, Y Xu, M Yu, D Su, Y Zou, D Yu
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
622020
Improving Attention Based Sequence-to-Sequence Models for End-to-End English Conversational Speech Recognition.
C Weng, J Cui, G Wang, J Wang, C Yu, D Su, D Yu
Interspeech, 761-765, 2018
612018
Speech-XLNet: Unsupervised acoustic model pretraining for self-attention networks
X Song, G Wang, Z Wu, Y Huang, D Su, D Yu, H Meng
arXiv preprint arXiv:1910.10387, 2019
572019
Diffgan-tts: High-fidelity and efficient text-to-speech with denoising diffusion gans
S Liu, D Su, D Yu
arXiv preprint arXiv:2201.11972, 2022
492022
Sandglasset: A light multi-granularity self-attentive network for time-domain speech separation
MWY Lam, J Wang, D Su, D Yu
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
462021
Simple attention module based speaker verification with iterative noisy label detection
X Qin, N Li, C Weng, D Su, M Li
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
452022
Investigating robustness of adversarial samples detection for automatic speaker verification
X Li, N Li, J Zhong, X Wu, X Liu, D Su, D Yu, H Meng
arXiv preprint arXiv:2006.06186, 2020
442020
Diffsvc: A diffusion probabilistic model for singing voice conversion
S Liu, Y Cao, D Su, H Meng
2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021
432021
A rendszer jelenleg nem tudja elvégezni a műveletet. Próbálkozzon újra később.
Cikkek 1–20