A fast maximum likelihood feature transformation method for GMM-HMM speaker adaptation, Neurocomputing, vol.128, pp.145-152, 2014. ,
End-to-end accented speech recognition, pp.2140-2144, 2019. ,
Domain and speaker adaptation for Cortana speech recognition, IEEE International Conference on Acoustics, Speech and Signal Processing, pp.5984-5988, 2018. ,
KL-divergence regularized deep neural network adaptation for improved large vocabulary speech recognition, IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.7893-7897, 2013. ,
Speaker-adaptation for hybrid HMM-ANN continuous speech recognition system, Eurospeech, pp.2171-2174, 1995. ,
Intermediate-layer DNN adaptation for offline and session-based iterative speaker adaptation, pp.1091-1095, 2015. ,
Hermitian polynomial for speaker adaptation of connectionist speech recognition, IEEE Transactions on Audio, Speech, and Language Processing, vol.21, pp.2152-2161, 2013. ,
Learning hidden unit contributions for unsupervised speaker adaptation of neural network acoustic models, IEEE Spoken Language Technology Workshop (SLT), pp.171-176, 2014. ,
Comparison of BLSTM layer specific affine transformations for speaker adaptation, Interspeech, pp.877-881, 2018. ,
Front-end factor analysis for speaker verification, IEEE Transactions on Audio, Speech and Language Processing, vol.19, issue.4, pp.788-798, 2011. ,
Speaker adaptation of neural network acoustic models using i-vectors, IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), pp.55-59, 2013. ,
Improving DNN speaker independence with i-vector inputs, IEEE International Conference on Acoustics, Speech and Signal Processing, pp.225-229, 2014. ,
Fast DNN acoustic model adaptation by learning hidden unit contribution features, Interspeech, pp.759-763, 2019. ,
Factorized hidden layer adaptation for deep neural network based acoustic modeling, IEEE Transactions on Audio, Speech, and Language Processing, vol.24, issue.12, pp.2241-2250, 2016. ,
Cluster adaptive training for deep neural network, IEEE International Conference on Acoustics, Speech and Signal Processing, pp.4325-4329, 2015. ,
Multi-basis adaptive neural network for rapid adaptation in speech recognition, IEEE International Conference on Acoustics, Speech and Signal Processing, pp.4315-4319, 2015. ,
Adaptation methods for nonnative speech, Multilinguality in Spoken Language Processing, 2001. ,
Towards acoustic model unification across dialects, IEEE Spoken Language Technology Workshop (SLT), pp.624-628, 2016. ,
Multi-accent speech recognition with hierarchical grapheme based models, IEEE International Conference on Acoustics, Speech and Signal Processing, pp.4815-4819, 2017. ,
Improved speaker adaptation by combining i-vector and fMLLR with deep bottleneck networks, International Conference on Speech and Computer (SPECOM, pp.417-426, 2017. ,
Joint modeling of accents and acoustics for multi-accent speech recognition, IEEE International Conference on Acoustics, Speech and Signal Processing, pp.1-5, 2018. ,
Improved accented speech recognition using accent embeddings and multi-task learning, in Interspeech, pp.2454-2458, 2018. ,
Multi-accent deep neural network acoustic model with accent-specific top layer, Interspeech, pp.2977-2981, 2014. ,
Improving deep neural networks based multi-accent mandarin speech recognition using i-vectors and accent-specific top layer, 2015. ,
Crosslanguage knowledge transfer using multilingual deep neural network with shared hidden layers, IEEE International Conference on Acoustics, Speech and Signal Processing, pp.7304-7308, 2013. ,
Ctc regularized model adaptation for improving lstm rnn based multi-accent mandarin speech recognition, Journal of Signal Processing Systems, vol.90, issue.7, pp.985-997, 2018. ,
X-vectors: Robust DNN embeddings for speaker recognition, IEEE International Conference on Acoustics, Speech and Signal Processing, pp.5329-5333, 2018. ,
The Kaldi speech recognition toolkit, Tech. Rep, 2011. ,
Phoneme recognition using time-delay neural networks, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol.37, issue.3, pp.328-339, 1989. ,
Purely sequence-trained neural networks for ASR based on lattice-free MMI, pp.2751-2755, 2016. ,
Semisupervised training of acoustic models using lattice-free MMI, IEEE International Conference on Acoustics, Speech and Signal Processing, pp.4844-4848, 2018. ,
Phondat-Verbmobil speech corpus, European Conference on Speech Communication and Technology, 1995. ,
, Voxforge: an open and free speech corpus for speaker recognition, pp.2020-2023
A highly adaptive acoustic model for accurate multi-dialect speech recognition, IEEE International Conference on Acoustics, Speech and Signal Processing, pp.5716-5720, 2019. ,
A time delay neural network architecture for efficient modeling of long temporal contexts, pp.3214-3218, 2015. ,
Audio augmentation for speech recognition, pp.3586-3589, 2015. ,
Visualizing data using t-SNE, Journal of Machine Learning Research, vol.9, pp.2579-2605, 2008. ,