K. Matui, N. Hara, N. Kobayashi, and H. Hirose, Enhancement of esophageal speech using formant synthesis, Proc. ICASSP, pp.1831-1834, 1999.

A. Hisada and H. Sawada, Real-time clarification of oesophageal speech using a comb filter, International Conference on Disability, Virtual Reality and Associated Technologies, pp.39-46, 2002.

D. Yu and L. Deng, Deep learning and its applications to signal and information processing, IEEE Signal Processing Magazine, pp.145-154, 2011.
DOI : 10.1109/msp.2010.939038

Y. Stylianou, O. Cappé, and E. Moulines, Continuous probabilistic transform for voice conversion, IEEE Transactions on Speech and Audio Processing, vol.6, issue.2, pp.131-142, 1998.
DOI : 10.1109/89.661472

H. Sakoe and S. Chiba, Dynamic programming algorithm optimization for spoken word recognition, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol.26, issue.1, pp.43-49, 1978.
DOI : 10.1109/TASSP.1978.1163055

X. Zhu, G. T. Beauregard, and L. L. Wyse, Real-Time Signal Estimation From Modified Short-Time Fourier Transform Magnitude Spectra, IEEE Transactions on Audio, Speech and Language Processing, vol.15, issue.5, pp.1645-1653, 2007.
DOI : 10.1109/TASL.2007.899236