Follow
Desh Raj
Desh Raj
Meta AI
Verified email at meta.com - Homepage
Title
Cited by
Cited by
Year
CHiME-6 challenge: Tackling multispeaker speech recognition for unsegmented recordings
S Watanabe, M Mandel, J Barker, E Vincent, A Arora, X Chang, ...
arXiv preprint arXiv:2004.09249, 2020
2852020
Probing the information encoded in x-vectors
D Raj, D Snyder, D Povey, S Khudanpur
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019
1032019
Integration of speech separation, diarization, and recognition for multi-speaker meetings: System description, comparison, and analysis
D Raj, P Denisov, Z Chen, H Erdogan, Z Huang, M He, S Watanabe, J Du, ...
2021 IEEE spoken language technology workshop (SLT), 897-904, 2021
812021
Dover-lap: A method for combining overlap-aware diarization outputs
D Raj, LP Garcia-Perera, Z Huang, S Watanabe, D Povey, A Stolcke, ...
2021 IEEE Spoken Language Technology Workshop (SLT), 881-888, 2021
662021
Learning local and global contexts using a convolutional recurrent network model for relation classification in biomedical text
D Raj, S Sahu, A Anand
Proceedings of the 21st conference on computational natural language …, 2017
462017
Sequential multi-frame neural beamforming for speech separation and enhancement
ZQ Wang, H Erdogan, S Wisdom, K Wilson, D Raj, S Watanabe, Z Chen, ...
2021 IEEE Spoken Language Technology Workshop (SLT), 905-911, 2021
402021
The Hitachi-JHU DIHARD III system: Competitive end-to-end neural diarization and x-vector clustering systems combined by DOVER-Lap
S Horiguchi, N Yalta, P Garcia, Y Takashima, Y Xue, D Raj, Z Huang, ...
arXiv preprint arXiv:2102.01363, 2021
372021
Multi-class spectral clustering with overlaps for speaker diarization
D Raj, Z Huang, S Khudanpur
2021 IEEE Spoken Language Technology Workshop (SLT), 582-589, 2021
322021
Target-speaker voice activity detection with improved i-vector estimation for unknown number of speaker
M He, D Raj, Z Huang, J Du, Z Chen, S Watanabe
arXiv preprint arXiv:2108.03342, 2021
282021
Using ASR methods for OCR
A Arora, CC Chang, B Rekabdar, B BabaAli, D Povey, D Etter, D Raj, ...
2019 International Conference on Document Analysis and Recognition (ICDAR …, 2019
232019
The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios
S Cornell, M Wiesner, S Watanabe, D Raj, X Chang, P Garcia, ...
arXiv preprint arXiv:2306.13734, 2023
222023
GPU-accelerated guided source separation for meeting transcription
D Raj, D Povey, S Khudanpur
arXiv preprint arXiv:2212.05271, 2022
212022
Uncertain fuzzy self-organization based clustering: interval type-2 approach to adaptive resonance theory
S Majheed, A Gupta, D Raj, FCH Rhee
Information Sciences, 2017
21*2017
Continuous streaming multi-talker asr with dual-path transducers
D Raj, L Lu, Z Chen, Y Gaur, J Li
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
172022
The JHU multi-microphone multi-speaker ASR system for the CHiME-6 challenge
A Arora, D Raj, AS Subramanian, K Li, B Ben-Yair, M Maciejewski, ...
arXiv preprint arXiv:2006.07898, 2020
162020
Analysis of Data Generated from Multidimensional Type-1 and Type-2 Fuzzy Membership Functions
D Raj, A Gupta, B Garg, K Tanna, FCH Rhee
IEEE Transactions on Fuzzy Systems, 0
12*
Adapting self-supervised models to multi-talker speech recognition using speaker embeddings
Z Huang, D Raj, P García, S Khudanpur
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
112023
Injecting text and cross-lingual supervision in few-shot learning from self-supervised models
M Wiesner, D Raj, S Khudanpur
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
82022
Low-latency speech separation guided diarization for telephone conversations
G Morrone, S Cornell, D Raj, L Serafini, E Zovato, A Brutti, S Squartini
2022 IEEE Spoken Language Technology Workshop (SLT), 641-646, 2023
62023
Joint speaker diarization and speech recognition based on region proposal networks
Z Huang, M Delcroix, LP Garcia, S Watanabe, D Raj, S Khudanpur
Computer Speech & Language 72, 101316, 2022
62022
The system can't perform the operation now. Try again later.
Articles 1–20