Követés
Minhua Wu
Minhua Wu
E-mail megerősítve itt: amazon.com
Cím
Hivatkozott rá
Hivatkozott rá
Év
Model Compression Applied to Small-Footprint Keyword Spotting.
G Tucker, M Wu, M Sun, S Panchapagesan, G Fu, S Vitaladevuni
INTERSPEECH, 1878-1882, 2016
1052016
MONOPHONE-BASED BACKGROUND MODELING FOR TWO-STAGE ON-DEVICE WAKE WORD DETECTION
M Wu, S Panchapagesan, M Sun, J Gu, R Thomas, SNP Vitaladevuni, ...
852018
Improving noise robustness of automatic speech recognition via parallel data and teacher-student learning
L Mošner, M Wu, A Raju, SHK Parthasarathi, K Kumatani, S Sundaram, ...
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
702019
Direct modeling of raw audio with dnns for wake word detection
K Kumatani, S Panchapagesan, M Wu, M Kim, N Strom, G Tiwari, ...
2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017
572017
Wav2vec-C: A Self-supervised Model for Speech Representation Learning
S Sadhu, D He, CW Huang, SH Mallidi, M Wu, A Rastrow, A Stolcke, ...
arXiv preprint arXiv:2103.08393, 2021
552021
Pronunciation and silence probability modeling for ASR
G Chen, H Xu, M Wu, D Povey, S Khudanpur
Sixteenth Annual Conference of the International Speech Communication …, 2015
542015
Frequency domain multi-channel acoustic modeling for distant speech recognition
W Minhua, K Kumatani, S Sundaram, N Ström, B Hoffmeister
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
512019
Time-delayed bottleneck highway networks using a dft feature for keyword spotting
J Guo, K Kumatani, M Sun, M Wu, A Raju, N Ström, A Mandal
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
442018
Multi-geometry spatial acoustic modeling for distant speech recognition
K Kumatani, W Minhua, S Sundaram, N Ström, B Hoffmeister
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
222019
Deep multi-channel acoustic modeling
A Mandal, K Kumatani, N Strom, M Wu, S Sundaram, B Hoffmeister, ...
US Patent 10,726,830, 2020
172020
Deep multi-channel acoustic modeling
A Mandal, K Kumatani, N Strom, M Wu, S Sundaram, B Hoffmeister, ...
US Patent App. 16/932,049, 2020
122020
An empirical study of cross-lingual transfer learning techniques for small-footprint keyword spotting
M Sun, A Schwarz, M Wu, N Strom, S Matsoukas, S Vitaladevuni
2017 16th IEEE International Conference on Machine Learning and Applications …, 2017
122017
Listen with Intent: Improving Speech Recognition with Audio-to-Intent Front-End
SN Ray, M Wu, A Raju, P Ghahremani, R Bilgi, M Rao, H Arsikere, ...
arXiv preprint arXiv:2105.07071, 2021
102021
Robust Multi-Channel Speech Recognition Using Frequency Aligned Network
T Park, K Kumatani, M Wu, S Sundaram
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
82020
Deep multi-channel acoustic modeling using multiple microphone array geometries
K Kumatani, M Wu, S Sundaram, N Strom, B Hoffmeister
US Patent 11,574,628, 2023
72023
Monophone-based background modeling for wakeword detection
M Wu, S Panchapagesan, M Sun, SNP Vitaladevuni, B Hoffmeister, ...
US Patent 10,964,315, 2021
52021
Deep multi-channel acoustic modeling using frequency aligned network
M Wu, S Sundaram, TJ Park, K Kumatani
US Patent 11,495,215, 2022
32022
Speech processing optimizations based on microphone array
SK Sundaram, M Wu, A Raju, S Matsoukas, A Mandal, K Kumatani
US Patent 10,679,621, 2020
32020
Multi-channel Acoustic Modeling using Mixed Bitrate OPUS Compression
A Khare, S Sundaram, M Wu
arXiv preprint arXiv:2002.00122, 2020
32020
Fully Learnable Front-End for Multi-Channel Acoustic Modeling Using Semi-Supervised Learning
S Wager, A Khare, M Wu, K Kumatani, S Sundaram
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
22020
A rendszer jelenleg nem tudja elvégezni a műveletet. Próbálkozzon újra később.
Cikkek 1–20