A. Ehsaneddin, R. Mohammad, and . Mofrad, Continuous Distributed Representation of Biological Sequences for Deep Proteomics and Genomics, PloS one, vol.10, pp.11-0141287, 2015.

B. Dzmitry, C. Kyunghyun, and B. Yoshua, Neural machine translation by jointly learning to align and translate, 2014.

C. Kyunghyun, Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation, 2014.

J. A. Peter and . Cock, Biopython : freely available Python tools for computational molecular biology and bioinformatics, Bioinformatics, vol.2511, pp.1422-1423, 2009.

D. John, H. Elad, and S. Yoram, Adaptive subgradient methods for online learning and stochastic optimization, Journal of Machine Learning Research, vol.12, pp.2121-2159, 2011.

K. Naomi, . Fox, E. Steven, . Brenner, and C. John-marc, SCOPe : Structural Classification of Proteins?extended, integrating SCOP and ASTRAL data and classification of new structures, Nucleic acids research 42, pp.1-304, 2014.

G. Yarin, A theoretically grounded application of dropout in recurrent neural networks " . In : arXiv preprint, 2015.

G. Xavier, B. Antoine, and B. Yoshua, Deep Sparse Rectifier Neural Networks, In : Aistats. T, vol.15, issue.106, p.275, 2011.

H. Thomas and M. Bernard, PDB file parser and structure class implemented in Python, In : Bioinformatics, vol.1917, pp.2308-2310, 2003.

E. Geoffrey, . Hinton, R. Ruslan, and . Salakhutdinov, Reducing the dimensionality of data with neural networks ", In : Science, vol.3135786, pp.504-507, 2006.

H. Geoffrey, S. Nirsh, and S. Kevin, Lecture 6a Overview of mini?batch gradient descent

H. Sepp and S. Jürgen, Long short-term memory, Neural computation 9, pp.1735-1780, 1997.

J. Liu, Predicting protein structural classes with autoencoder neural networks, Control and Decision Conference (CCDC), 2013 25th Chinese. IEEE. 2013, pp.1894-1899

K. Wolfgang and S. Christian, Dictionary of protein secondary structure : pattern recognition of hydrogen-bonded and geometrical features, Biopolymers, vol.2212, pp.2577-2637, 1983.

L. Yann, Gradient-based learning applied to document recognition, Proceedings of the IEEE, pp.2278-2324, 1998.

M. Abadi, TensorFlow : Large-Scale Machine Learning on Heterogeneous Systems Software available from tensorflow.org, 2015.

Q. Yanjun, A unified multitask architecture for predicting local protein properties, PloS one, vol.73, p.32235, 2012.

A. Rami, Theano : A Python framework for fast computation of mathematical expressions In : arXiv e-prints abs/1605, p.2688, 2016.

S. Richard, Semi-supervised recursive autoencoders for predicting sentiment distributions, Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp.151-161, 2011.

S. Matt, J. Eickholt, and C. Jianlin, A Deep Learning Network Approach to Ab Initio Protein Secondary Structure Prediction, IEEE/ACM Trans. Comput. Biol. Bioinformatics, vol.121, pp.103-112, 2015.

S. Nitish, Dropout : a simple way to prevent neural networks from overfitting, In : Journal of Machine Learning Research, vol.151, pp.1929-1958, 2014.

S. Ilya, V. Oriol, V. Quoc, and . Le-ghahramani, Sequence to Sequence Learning with Neural Networks URL : http : / / papers.nips.cc/paper/5346-sequence-to-sequence-learning-with-neural-networks, Advances in Neural Information Processing Systems 27, pp.3104-3112, 2014.

V. Pascal, Extracting and Composing Robust Features with Denoising Autoencoders, Proceedings of the 25th International Conference on Machine Learning. ICML '08, pp.1096-1103, 2008.

V. Pascal, Stacked denoising autoencoders : Learning useful representations in a deep network with a local denoising criterion, Journal of Machine Learning Research, vol.11, pp.3371-3408, 2010.