Követés
Chen Zhang
Chen Zhang
Research Scientist, ByteDance
E-mail megerősítve itt: zju.edu.cn - Kezdőlap
Cím
Hivatkozott rá
Hivatkozott rá
Év
Discriminative and Correlative Partial Multi-Label Learning.
H Wang, W Liu, Y Zhao, C Zhang, T Hu, G Chen
IJCAI, 3691-3697, 2019
952019
SimulSpeech: End-to-end simultaneous speech to text translation
Y Ren, J Liu, X Tan, C Zhang, T Qin, Z Zhao, TY Liu
Proceedings of the 58th Annual Meeting of the Association for Computational …, 2020
822020
Mega-tts: Zero-shot text-to-speech at scale with intrinsic inductive bias
Z Jiang, Y Ren, Z Ye, J Liu, C Zhang, Q Yang, S Ji, R Huang, C Wang, ...
arXiv preprint arXiv:2306.03509, 2023
772023
Uwspeech: Speech to speech translation for unwritten languages
C Zhang, X Tan, Y Ren, T Qin, K Zhang, TY Liu
AAAI 2021, 2020
602020
Make-an-audio 2: Temporal-enhanced text-to-audio generation
J Huang, Y Ren, R Huang, D Yang, Z Ye, C Zhang, J Liu, X Yin, Z Ma, ...
arXiv preprint arXiv:2305.18474, 2023
572023
Denoispeech: Denoising text to speech with frame-level noise modeling
C Zhang, Y Ren, X Tan, J Liu, K Zhang, T Qin, S Zhao, TY Liu
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
512021
S3T: Self-Supervised Pre-training with Swin Transformer for Music Classification
H Zhao, C Zhang, B Zhu, Z Ma, K Zhang
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
422022
Task-level curriculum learning for non-autoregressive neural machine translation
J Liu, Y Ren, X Tan, C Zhang, T Qin, Z Zhao, TY Liu
IJCAI 2020, 2020
402020
TeleMelody: Lyric-to-Melody Generation with a Template-Based Two-Stage Method
Z Ju, P Lu, X Tan, R Wang, C Zhang, S Wu, K Zhang, X Li, T Qin, TY Liu
EMNLP 2022, 2022
382022
Mega-tts 2: Zero-shot text-to-speech with arbitrary length speech prompts
Z Jiang, J Liu, Y Ren, J He, C Zhang, Z Ye, P Wei, C Wang, X Yin, Z Ma, ...
arXiv preprint arXiv:2307.07218, 2023
322023
Real3d-portrait: One-shot realistic 3d talking portrait synthesis
Z Ye, T Zhong, Y Ren, J Yang, W Li, J Huang, Z Jiang, J He, R Huang, ...
ICLR 2024 (Spotlight), 2024
282024
Mega-TTS 2: Boosting Prompting Mechanisms for Zero-Shot Speech Synthesis
Z Jiang, J Liu, Y Ren, J He, Z Ye, S Ji, Q Yang, C Zhang, P Wei, C Wang, ...
The Twelfth International Conference on Learning Representations, 2024
252024
PDAugment: Data Augmentation by Pitch and Duration Adjustments for Automatic Lyrics Transcription
C Zhang, J Yu, LC Chang, X Tan, J Chen, T Qin, K Zhang
ISMIR 2022, 2021
192021
Relyme: improving lyric-to-melody generation by incorporating lyric-melody relationships
C Zhang, L Chang, S Wu, X Tan, T Qin, TY Liu, K Zhang
Proceedings of the 30th ACM International Conference on Multimedia, 1047-1056, 2022
172022
Songdriver: Real-time music accompaniment generation without logical latency nor exposure bias
Z Wang, K Zhang, Y Wang, C Zhang, Q Liang, P Yu, Y Feng, W Liu, ...
Proceedings of the 30th ACM International Conference on Multimedia, 1057-1067, 2022
152022
Automatic Song Translation for Tonal Languages
F Guo, C Zhang, Z Zhang, Q He, K Zhang, J Xie, J Boyd-Graber
Proceedings of the 60th Annual Meeting of the Association for Computational …, 2022
152022
FastLR: Non-autoregressive lipreading model with integrate-and-fire
J Liu, Y Ren, Z Zhao, C Zhang, B Huai, J Yuan
Proceedings of the 28th ACM International Conference on Multimedia, 4328-4336, 2020
142020
SDMuse: Stochastic Differential Music Editing and Generation via Hybrid Representation
C Zhang, Y Ren, K Zhang, S Yan
IEEE Transactions on MultiMedia, 2023
112023
Boosting prompting mechanisms for zero-shot speech synthesis
Z Jiang, J Liu, Y Ren, J He, Z Ye, S Ji, Q Yang, C Zhang, P Wei, C Wang, ...
The Twelfth International Conference on Learning Representations, 2023
82023
Towards Effective Multi-Modal Interchanges in Zero-Resource Sounding Object Localization
Y Zhao, C Zhang, H Huang, H Li, Z Zhao
Advances in Neural Information Processing Systems, 2022
72022
A rendszer jelenleg nem tudja elvégezni a műveletet. Próbálkozzon újra később.
Cikkek 1–20