Követés
Jianwei Yu
Jianwei Yu
Tencent AI lab
E-mail megerősítve itt: tencent.com
Cím
Hivatkozott rá
Hivatkozott rá
Év
Diffsound: Discrete diffusion model for text-to-sound generation
D Yang, J Yu, H Wang, W Wang, C Weng, Y Zou, D Yu
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023
1752023
Speech emotion recognition using capsule networks
X Wu, S Liu, Y Cao, X Li, J Yu, D Dai, X Ma, S Hu, Z Wu, X Liu, H Meng
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
1202019
Adversarial attacks on GMM i-vector based speaker verification systems
X Li, J Zhong, X Wu, J Yu, X Liu, H Meng
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
982020
Audio-visual recognition of overlapped speech for the lrs2 dataset
J Yu, SX Zhang, J Wu, S Ghorbani, B Wu, S Kang, S Liu, X Liu, H Meng, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
942020
Music source separation with band-split RNN
Y Luo, J Yu
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023
602023
Investigation of data augmentation techniques for disordered speech recognition
M Geng, X Xie, S Liu, J Yu, S Hu, X Liu, H Meng
arXiv preprint arXiv:2201.05562, 2022
572022
Development of the CUHK Dysarthric Speech Recognition System for the UA Speech Corpus.
J Yu, X Xie, S Liu, S Hu, MWY Lam, X Wu, KH Wong, X Liu, H Meng
Interspeech, 2938-2942, 2018
532018
Recent progress in the cuhk dysarthric speech recognition system
S Liu, M Geng, S Hu, X Xie, M Cui, J Yu, X Liu, H Meng
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 2267-2281, 2021
482021
Dirichlet graph variational autoencoder
J Li, J Yu, J Li, H Zhang, K Zhao, Y Rong, H Cheng, J Huang
Advances in Neural Information Processing Systems 33, 5274-5283, 2020
452020
End-to-end code-switched tts with mix of monolingual recordings
Y Cao, X Wu, S Liu, J Yu, X Li, Z Wu, X Liu, H Meng
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
432019
Gaussian process lstm recurrent neural network language models for speech recognition
MWY Lam, X Chen, S Hu, J Yu, X Liu, H Meng
ICASSP 2019-2019 IEEE international conference on acoustics, speech and …, 2019
402019
Improved end-to-end dysarthric speech recognition via meta-learning based model re-initialization
D Wang, J Yu, X Wu, L Sun, X Liu, H Meng
2021 12th International Symposium on Chinese Spoken Language Processing …, 2021
372021
End-to-end voice conversion via cross-modal knowledge distillation for dysarthric speech reconstruction
D Wang, J Yu, X Wu, S Liu, L Sun, X Liu, H Meng
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
372020
Development of the cuhk elderly speech recognition system for neurocognitive disorder detection using the dementiabank corpus
Z Ye, S Hu, J Li, X Xie, M Geng, J Yu, J Xu, B Xue, S Liu, X Liu, H Meng
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
312021
Neural architecture search for LF-MMI trained time delay neural networks
S Hu, X Xie, M Cui, J Deng, S Liu, J Yu, M Geng, X Liu, H Meng
IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 1093-1107, 2022
282022
A comparative study of acoustic and linguistic features classification for alzheimer's disease detection
J Li, J Yu, Z Ye, S Wong, M Mak, B Mak, X Liu, H Meng
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
282021
Bayesian transformer language models for speech recognition
B Xue, J Yu, J Xu, S Liu, S Hu, Z Ye, M Geng, X Liu, H Meng
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
252021
Deconvolutional networks on graph data
J Li, J Li, Y Liu, J Yu, Y Li, H Cheng
Advances in Neural Information Processing Systems 34, 21019-21030, 2021
232021
Audio-visual multi-channel integration and recognition of overlapped speech
J Yu, SX Zhang, B Wu, S Liu, S Hu, M Geng, X Liu, H Meng, D Yu
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 2067-2082, 2021
232021
Exploiting Cross-Domain Visual Feature Generation for Disordered Speech Recognition.
S Liu, X Xie, J Yu, S Hu, M Geng, R Su, SX Zhang, X Liu, H Meng
Interspeech, 711-715, 2020
202020
A rendszer jelenleg nem tudja elvégezni a műveletet. Próbálkozzon újra később.
Cikkek 1–20