J. Aucouturier and F. Pachet, The influence of polyphony on the dynamical modelling of musical timbre, Pattern Recognition Letters, vol.28, issue.5, pp.654-661, 2007.
DOI : 10.1016/j.patrec.2006.11.004

M. Cuturi, J. Vert, O. Birkenes, and T. Matsui, A Kernel for Time Series Based on Global Alignments, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '07, pp.413-416, 2007.
DOI : 10.1109/ICASSP.2007.366260

D. P. Ellis, Sinewave and sinusoid+noise analysis/synthesis in Matlab. Online web resource, 2003.

D. P. Ellis, PLP and RASTA (and MFCC, and inversion) in Matlab. Online web resource, 2005.

A. Eronen, Musical instrument recognition using ICA-based transform of features and discriminatively trained HMMs, Seventh International Symposium on Signal Processing and Its Applications, 2003. Proceedings., 2003.
DOI : 10.1109/ISSPA.2003.1224833

S. Essid, G. Richard, and B. David, Musical instrument recognition based on class pairwise feature selection, Proc. of ISMIR, 2004.

M. Goto, H. Hashiguchi, T. Nishimura, and R. Oka, RWC Music Database: Music Genre Database and Musical Instrument Sound Database, Proceedings of the 4th International Conference on Music Information Retrieval, 2003.

J. M. Grey and J. A. Moorer, Perceptual evaluations of synthesized musical instrument tones, The Journal of the Acoustical Society of America, vol.62, issue.2, pp.454-462, 1977.
DOI : 10.1121/1.381508

C. Joder, S. Essid, and G. Richard, Temporal Integration for Audio Classification With Application to Musical Instrument Classification, IEEE Transactions on Audio, Speech, and Language Processing, vol.17, issue.1, pp.174-186, 2009.
DOI : 10.1109/TASL.2008.2007613

T. Kitahara, M. Goto, K. Komatani, T. Ogata, and H. G. Okuno, Musical Instrument Recognizer "Instrogram" and Its Application to Music Retrieval Based on Instrumentation Similarity, Eighth IEEE International Symposium on Multimedia (ISM'06), 2006.
DOI : 10.1109/ISM.2006.113

M. Lagrange, Sinusoidal Modeling of Polyphonic Sounds, 2004.
URL : https://hal.archives-ouvertes.fr/hal-00308194

M. Lagrange, A New Dissimilarity Metric For The Clustering Of Partials Using The Common Variation Cue, Proc. ICMC. ICMA, 2005.
URL : https://hal.archives-ouvertes.fr/hal-00308192

M. Lagrange, S. Marchand, and J. Rault, Enhancing the Tracking of Partials for the Sinusoidal Modeling of Polyphonic Sounds, IEEE Transactions on Audio, Speech and Language Processing, vol.15, issue.5, pp.357-366, 2007.
DOI : 10.1109/TASL.2007.896654

URL : https://hal.archives-ouvertes.fr/hal-00308191

S. Marchand and M. Raspaud, Enhanced Time-Stretching Using Order-2 Sinusoidal Modeling, Proc. DAFx. Federico II University of, pp.76-82, 2004.
URL : https://hal.archives-ouvertes.fr/hal-00308045

J. D. Markel and A. M. Gray, Linear Prediction of Speech, 1976.
DOI : 10.1007/978-3-642-66286-7

A. Martin, G. Doddington, T. Kamm, M. Ordowski, and M. Przybocki, The DET curve in Assessment of Detection Task Performance, Proceedings of EuroSpeech, 1997.

S. Mcadams, Segregation of concurrent sounds. I: Effects of frequency modulation coherence, The Journal of the Acoustical Society of America, vol.86, issue.6, pp.2148-2159, 1989.
DOI : 10.1121/1.398475

URL : https://hal.archives-ouvertes.fr/hal-01105651

S. Mcadams and E. Bigand, Thinking in Sound, 1993.

R. J. Mcaulay and T. F. Quatieri, Speech analysis/Synthesis based on a sinusoidal representation, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol.34, issue.4, pp.744-754, 1986.
DOI : 10.1109/TASSP.1986.1164910

M. Mellody and G. Wakefield, The time-frequency characteristics of violin vibrato: Modal distribution analysis and synthesis, The Journal of the Acoustical Society of America, vol.107, issue.1, pp.598-611, 2000.
DOI : 10.1121/1.428326

A. Meng, P. Ahrendt, J. Larsen, and L. Hansen, Temporal Feature Integration for Music Genre Classification, IEEE Transactions on Audio, Speech and Language Processing, vol.15, issue.5, pp.1654-1664, 2007.
DOI : 10.1109/TASL.2007.899293

L. Rabiner and B. H. Juang, Fundamentals of Speech Recognition, 1993.

M. Raspaud, Hierarchical spectral models for sound and applications, 2007.

M. Raspaud, S. Marchand, and L. Girin, A Generalized Polynomial and Sinusoidal Model for Partial Tracking and Time Stretching, 2005.
URL : https://hal.archives-ouvertes.fr/hal-00307987

A. Robel, Adaptive additive modeling with continuous parameter trajectories, IEEE Transactions on Audio, Speech and Language Processing, vol.14, issue.4, pp.1440-1453, 2006.
DOI : 10.1109/TSA.2005.858529

N. Scaringella and G. Zoia, On the modelling of time information for automatic genre recognition systems in audio signals, Proceedings of the 6th International Conference on Music Information Retrieval (ISMIR), 2005.

H. Shimodaira, K. Noma, M. Nakai, and S. Sagayama, Dynamic timealignment kernel in support vector machine, Advances in Neural Information Processing Systems, 2002.

G. Tzanetakis and P. Cook, Musical genre classification of audio signals, IEEE Transactions on Speech and Audio Processing, vol.10, issue.5, pp.293-302, 2002.
DOI : 10.1109/TSA.2002.800560