Librispeech: an ASR corpus based on public domain audio books, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.5206-5210, 2015. ,
The design for the Wall Street Journal-based CSR corpus, Proceedings of the workshop on Speech and Natural Language. Association for Computational Linguistics, pp.357-362, 1992. ,
The fisher corpus: a resource for the next generations of speech-to-text, LREC, vol.4, pp.69-71, 2004. ,
Switchboard: Telephone speech corpus for research and development, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing, vol.1, pp.517-520, 1992. ,
Semi-supervised training of deep neural networks, 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, pp.267-272, 2013. ,
Semi-supervised end-to-end speech recognition.," in Interspeech, pp.2-6, 2018. ,
Active learning for speech recognition: the power of gradients, 2016. ,
Active and semi-supervised learning in ASR: Benefits on the acoustic and language models, 2019. ,
Active learning and semi-supervised learning for speech recognition: A unified framework using the global entropy reduction maximization criterion, Computer Speech & Language, vol.24, pp.433-444, 2010. ,
Transfer learning for speech and language processing, Proc. AP-SIPA Annual Summit and Conf, pp.1225-1237, 2015. ,
Wav2letter: an end-to-end convnetbased speech recognition system, 2016. ,
Making deep neural networks robust to label noise: A loss correction approach, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp.1944-1952, 2017. ,
Label-noise robust generative adversarial networks, CoRR, 2018. ,
Training convolutional networks with noisy labels, 2014. ,
Learning deep networks from noisy labels with dropout regularization, 2016 IEEE 16th International Conference on Data Mining (ICDM), pp.967-972, 2016. ,
Training deep neural-networks using a noise adaptation layer, ICLR, 2017. ,
Learning from massive noisy labeled data for image classification, Proceedings of the IEEE conference on computer vision and pattern recognition, pp.2691-2699, 2015. ,
Learning from binary labels with instance-dependent corruption, 2016. ,
A spelling correction model for end-to-end speech recognition, ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing ,
, IEEE, pp.5651-5655, 2019.
ASR for under-resourced languages from probabilistic transcription, IEEE/ACM Transactions on Audio, Speech and Language Processing, vol.25, issue.1, pp.50-63, 2017. ,
Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks, Proceedings of the 23rd international conference on Machine learning, pp.369-376, 2006. ,
Purely sequencetrained neural networks for asr based on lattice-free mmi, pp.2751-2755, 2016. ,
A fully differentiable beam search decoder, 2019. ,
Wav2letter++: The fastest opensource speech recognition system, 2018. ,
Letter-based speech recognition with gated convnets, 2017. ,
Weight normalization: A simple reparameterization to accelerate training of deep neural networks, Advances in Neural Information Processing Systems, pp.901-909, 2016. ,
Language modeling with gated convolutional networks, Proceedings of the 34th International Conference on Machine Learning, vol.70, pp.933-941, 2017. ,
RWTH ASR systems for librispeech: Hybrid vs attention-w/o data augmentation, 2019. ,
Improving LSTM-CTC based ASR performance in domains with limited training data, 2017. ,