Statistical approach to enhancing esophageal speech based on Gaussian mixture models, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.4250-4253, 2010. ,
DOI : 10.1109/ICASSP.2010.5495676
Enhancement of Esophageal Speech Using Statistical Voice Conversion, APSIPA 2009, pp.805-808, 2009. ,
Enhancement of esophageal speech using formant synthesis, Proc. ICASSP, pp.1831-1834, 1999. ,
Real-time clarification of esophageal speech using a comb filter, Proc. ICDVRAT, 2002. ,
Voice Conversion Based on Maximum-Likelihood Estimation of Spectral Parameter Trajectory, IEEE Transactions on Audio, Speech and Language Processing, vol.15, issue.8, pp.2222-2235, 2007. ,
DOI : 10.1109/TASL.2007.907344
Continuous probabilistic transform for voice conversion, IEEE Transactions on Speech and Audio Processing, vol.6, issue.2, pp.131-142, 1998. ,
DOI : 10.1109/89.661472
Application of speech conversion to alaryngeal speech enhancement, IEEE Transactions on Speech and Audio Processing, vol.5, issue.2, pp.97-105, 1997. ,
Characteristics of Voicing Source Waveforms Produced by Esophageal and Tracheoesophageal Speakers, Journal of Speech Language and Hearing Research, vol.38, issue.3, p.536, 1995. ,
DOI : 10.1044/jshr.3803.536
Replacing tracheoesophageal voicing sources using LPC synthesis, The Journal of the Acoustical Society of America, vol.88, issue.3, pp.1228-1235, 1990. ,
DOI : 10.1121/1.399700
Enhancement of female esophageal and tracheoesophageal speech, The Journal of the Acoustical Society of America, vol.98, issue.5, pp.2461-2465, 1995. ,
DOI : 10.1121/1.413279
Continuous Tracheoesophageal Speech Repair, Proc. EUSIPCO, 2006. ,
Reconstruction of Dysphonic Speech by MELP, Iberoamerican Congress on Pattern Recognition, 2008. ,
DOI : 10.1109/ICDSP.1997.628419
Reconstruction of Normal Sounding Speech for Laryngectomy Patients Through a Modified CELP Codec, IEEE Transactions on Biomedical Engineering, vol.57, issue.10, pp.2448-2458, 2010. ,
DOI : 10.1109/TBME.2010.2053369
Application of noise reduction techniques for alaryngeal speech enhancement, TENCON '97 Brisbane, Australia. Proceedings of IEEE TENCON '97. IEEE Region 10 Annual Conference. Speech and Image Technologies for Computing and Telecommunications (Cat. No.97CH36162), pp.491-494, 1997. ,
DOI : 10.1109/TENCON.1997.648252
Time-spectral technique for esophageal speech regeneration, p.11, 2002. ,
Repairing Tracheoesophageal Speech Duration, Proc. Speech Prosody, 2008. ,
A Hybrid Approach to Electrolaryngeal Speech Enhancement Based on Noise Reduction and Statistical Excitation Generation, IEICE Transactions on Information and Systems, vol.97, issue.6, p.14291437, 2014. ,
DOI : 10.1587/transinf.E97.D.1429
Alaryngeal speech enhancement based on one-tomany eigenvoice conversion, IEEE/ACM Transactions on Audio , Speech, and Language Processing, vol.221, pp.172-183, 2014. ,
Sequence error (SE) minimization training of neural network for voice conversion, Proc. Interspeech, 2014. ,
Voice conversion using artificial neural networks, Proc. IEEE Int. Conf. Acoust. Speech Signal Process, pp.3893-3896, 2009. ,
Voice Conversion Using Deep Neural Networks With Layer-Wise Generative Training, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.22, issue.12, pp.1859-1872, 2014. ,
DOI : 10.1109/TASLP.2014.2353991
Voice Conversion Using RNN Pre-Trained by Recurrent Temporal Restricted Boltzmann Machines, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.23, issue.3, pp.580-587, 2015. ,
DOI : 10.1109/TASLP.2014.2379589
Voice conversion in high-order eigen space using deep belief nets, pp.369-372, 2013. ,
Dynamic programming algorithm optimization for spoken word recognition, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol.26, issue.1, pp.43-49, 1978. ,
DOI : 10.1109/TASSP.1978.1163055
Système de conversion de voix pour la synthèse de parole, 1993. ,
Deep Learning for Acoustic Modeling in Parametric Speech Generation: A systematic review of existing techniques and future trends, IEEE Signal Processing Magazine, vol.32, issue.3, pp.35-52, 2015. ,
DOI : 10.1109/MSP.2014.2359987
Nearest neighbor searching and applications, 1995. ,
Encyclopedia of Distances, 2009. ,
Spectral Mapping Using Artificial Neural Networks for Voice Conversion, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, issue.5, pp.954-964, 2010. ,
DOI : 10.1109/TASL.2010.2047683
Rectified linear units improve restricted boltzmann machines, Proceedings of the 27th international conference on machine learning (ICML-10, 2010. ,
Rectifier nonlinearities improve neural network acoustic models, Proc. ICML, 2013. ,
Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification, 2015 IEEE International Conference on Computer Vision (ICCV), 2015. ,
DOI : 10.1109/ICCV.2015.123
Information retrieval for music and motion, 2007. ,
DOI : 10.1007/978-3-540-74048-3
Real-time KD-tree construction on graphics hardware, ACM Transactions on Graphics, vol.27, issue.5, p.1, 2008. ,
Voice conversion using deep Bidirectional Long Short-Term Memory based Recurrent Neural Networks, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2015. ,
DOI : 10.1109/ICASSP.2015.7178896
Dropout: a simple way to prevent neural networks from overfitting, Journal of machine learning research, vol.151, pp.1929-1958, 2014. ,