Yu Gu

Cited by

	All	Since 2019
Citations	333	266
h-index	8	7
i10-index	8	7

20152016201720182019202020212022202320249 11 12 34 40 51 49 49 54 23

Public access

View all

3 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Zhen-Hua Ling（凌震华）Professor, University of Science and Technology of ChinaVerified email at ustc.edu.cn
Ao LiUniversity of Science and Technology of ChinaVerified email at ustc.edu.cn
Yuxuan WangByteDanceVerified email at cse.ohio-state.edu
Yang AiPostdoctoral Researcher, University of Science and Technology of ChinaVerified email at ustc.edu.cn
Jitong ChenByteDanceVerified email at cse.ohio-state.edu
Jie Zhang (张结)Assistant Professor, University of Science and Technology of China (USTC)Verified email at ustc.edu.cn
Qiushi ZhuUniversity of Science and Technology of ChinaVerified email at mail.ustc.edu.cn
Yuchen HuNanyang Technological UniversityVerified email at e.ntu.edu.sg
Dan SuTencent AI LabVerified email at tencent.com
benlaitang(汤本来)

Yu Gu

Tencent AI Lab

Verified email at mail.ustc.edu.cn

Speech Signal Processing Speech Synthesis and Enhancement audio and music generation


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
ByteSing: A chinese singing voice synthesis system using duration allocated encoder-decoder acoustic models and WaveRNN vocoders Y Gu, X Yin, Y Rao, Y Wan, B Tang, Y Zhang, J Chen, Y Wang, Z Ma 2021 12th International Symposium on Chinese Spoken Language Processing …, 2021	75	2021
Waveform modeling and generation using hierarchical recurrent neural networks for speech bandwidth extension ZH Ling, Y Ai, Y Gu, LR Dai IEEE/ACM Transactions on Audio, Speech, and Language Processing 26 (5), 883-894, 2018	75	2018
Speech bandwidth extension using bottleneck features and deep recurrent neural networks. Y Gu, ZH Ling, LR Dai Interspeech, 297-301, 2016	54	2016
A Kinect based gesture recognition algorithm using GMM and HMM Y Song, Y Gu, P Wang, Y Liu, A Li 2013 6th International Conference on Biomedical Engineering and Informatics …, 2013	34	2013
Waveform Modeling Using Stacked Dilated Convolutional Neural Networks for Speech Bandwidth Extension. Y Gu, ZH Ling INTERSPEECH, 1123-1127, 2017	28	2017
Human action recognition based on depth images from microsoft kinect T Liu, Y Song, Y Gu, A Li 2013 Fourth Global Congress on Intelligent Systems, 200-204, 2013	28	2013
Multi-task WaveNet: A multi-task generative model for statistical parametric speech synthesis without fundamental frequency conditions Y Gu, Y Kang arXiv preprint arXiv:1806.08619, 2018	23	2018
Restoring high frequency spectral envelopes using neural networks for speech bandwidth extension Y Gu, ZH Ling 2015 International Joint Conference on Neural Networks (IJCNN), 1-8, 2015	11	2015
Speech vocoder based on deep convolutional neural networks HC Wu, Y Gu, ZH Ling Proc. of the 14th National Conference on Man-Machine Speech Communicationn …, 2017	2*	2017
DurIAN-E: Duration Informed Attention Network For Expressive Text-to-Speech Synthesis Y Gu, Y Bian, G Lei, C Weng, D Su arXiv preprint arXiv:2309.12792, 2023	1	2023
Rep2wav: Noise Robust text-to-speech Using self-supervised representations Q Zhu, Y Gu, C Weng, Y Hu, L Dai, J Zhang arXiv preprint arXiv:2308.14553, 2023	1	2023
Eeg2vec: Self-Supervised Electroencephalographic Representation Learning Q Zhu, X Zhao, J Zhang, Y Gu, C Weng, Y Hu arXiv preprint arXiv:2305.13957, 2023	1	2023
Opine: Leveraging a Optimization-Inspired Deep Unfolding Method for Multi-Channel Speech Enhancement A Li, R Chen, Y Gu, C Weng, D Su ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024		2024
DurIAN-E 2: Duration Informed Attention Network with Adaptive Variational Autoencoder and Adversarial Learning for Expressive Text-to-Speech Synthesis Y Gu, Q Zhu, G Lei, C Weng, D Su ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024		2024
Sifisinger: A High-Fidelity End-to-End Singing Voice Synthesizer Based on Source-Filter Model J Cui, Y Gu, C Weng, J Zhang, L Chen, L Dai ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024		2024
Multichannel AV-wav2vec2: A Framework for Learning Multichannel Multi-Modal Speech Representation Q Zhu, J Zhang, Y Gu, Y Hu, L Dai arXiv preprint arXiv:2401.03468, 2024		2024

The system can't perform the operation now. Try again later.

Articles 1–16

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors