Multiple sound sources localization from coarse to fine R Qian, D Hu, H Dinkel, M Wu, N Xu, W Lin Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020 | 159 | 2020 |
Audio caption: Listen and tell M Wu, H Dinkel, K Yu ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 80 | 2019 |
Towards duration robust weakly supervised sound event detection H Dinkel, M Wu, K Yu IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 887-900, 2021 | 65 | 2021 |
Investigating local and global information for automated audio captioning with transfer learning X Xu, H Dinkel, M Wu, Z Xie, K Yu ICASSP 2021-2021 IEEE international conference on acoustics, speech and …, 2021 | 64 | 2021 |
LLM-empowered Chatbots for Psychiatrist and Patient Simulation: Application and Evaluation S Chen, M Wu, KQ Zhu, LC Kunyao Lan, Zhiling Zhang arXiv preprint arXiv:2305.13614, 2023 | 59* | 2023 |
What does a Car-ssette tape tell? X Xu, H Dinkel, M Wu, K Yu arXiv preprint arXiv:1905.13448v1, 2019 | 59* | 2019 |
Depa: Self-supervised audio embedding for depression detection P Zhang, M Wu, H Dinkel, K Yu Proceedings of the 29th ACM international conference on multimedia, 135-143, 2021 | 55 | 2021 |
A CRNN-GRU Based Reinforcement Learning Approach to Audio Captioning. X Xu, H Dinkel, M Wu, K Yu DCASE, 225-229, 2020 | 52 | 2020 |
Can audio captions be evaluated with image caption metrics? Z Zhou, Z Zhang, X Xu, Z Xie, M Wu, KQ Zhu ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 50 | 2022 |
Voice activity detection in the wild: A data-driven approach using teacher-student training H Dinkel, S Wang, X Xu, M Wu, K Yu IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 1542-1555, 2021 | 47 | 2021 |
Building interpretable interaction trees for deep nlp models D Zhang, H Zhang, H Zhou, X Bao, D Huo, R Chen, X Cheng, M Wu, ... Proceedings of the AAAI conference on artificial intelligence 35 (16), 14328 …, 2021 | 39 | 2021 |
The SJTU system for DCASE2022 challenge task 6: Audio captioning with audio-text retrieval pre-training X Xu, Z Xie, M Wu, K Yu Tech. Rep., DCASE2022 Challenge, 2022 | 34 | 2022 |
Voice activity detection in the wild via weakly supervised sound event detection H Dinkel, Y Chen, M Wu, K Yu arXiv preprint arXiv:2003.12222, 2020 | 32 | 2020 |
Text-based depression detection on sparse data H Dinkel, M Wu, K Yu arXiv preprint arXiv:1904.05154, 2019 | 31 | 2019 |
Audio-text retrieval in context S Lou, X Xu, M Wu, K Yu ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 29 | 2022 |
Psychiatric scale guided risky post screening for early detection of depression Z Zhang, S Chen, M Wu, KQ Zhu arXiv preprint arXiv:2205.09497, 2022 | 25 | 2022 |
Text-to-audio grounding: Building correspondence between captions and sound events X Xu, H Dinkel, M Wu, K Yu ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 25 | 2021 |
Symptom identification for interpretable detection of multiple mental disorders Z Zhang, S Chen, M Wu, KQ Zhu arXiv preprint arXiv:2205.11308, 2022 | 23 | 2022 |
Climate and weather: Inspecting depression detection via emotion recognition W Wu, M Wu, K Yu ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 21 | 2022 |
Decoupled dialogue modeling and semantic parsing for multi-turn text-to-SQL Z Chen, L Chen, H Li, R Cao, D Ma, M Wu, K Yu arXiv preprint arXiv:2106.02282, 2021 | 21 | 2021 |