Parametric stereo extension of ITU-T G.722 based on a new downmixing scheme

Abstract : In this paper, we present a novel frequency-domain stereo to mono downmixing, which preserves the energy of spectral components and avoids setting the left or right channel as a phase reference. Based on this downmixing technique, a parametric stereo analysis-synthesis model is described in which subband stereo parameters consist of interchannel level differences and phase differences between the mono signal and one of the stereo channels (left or right). Thismodel is applied to the stereo extension of ITU-T G.722 at 56+8 and 64+16 kbit/s with a frame length of 5ms. AB test results are provided to assess the quality of the proposed downmixing technique. In addition, the quality of the proposed G.722-based stereo coder is compared against reference coders (G.722.1 at 24 and 32kbit/s dual mono and G.722 at 64kbit/s dual mono) for clean speech, noisy speech and music.
Type de document :
Communication dans un congrès
IEEE Multimedia Signal Processing Conference, Oct 2010, Saint-Malo, France. 2010
Liste complète des métadonnées

https://hal.inria.fr/inria-00512646
Contributeur : Pascal Scalart <>
Soumis le : mardi 31 août 2010 - 10:59:43
Dernière modification le : jeudi 15 novembre 2018 - 11:57:39

Identifiants

  • HAL Id : inria-00512646, version 1

Citation

Thi Minh Nguyet Hoang, Stéphane Ragot, Balazs Kövesi, Pascal Scalart. Parametric stereo extension of ITU-T G.722 based on a new downmixing scheme. IEEE Multimedia Signal Processing Conference, Oct 2010, Saint-Malo, France. 2010. 〈inria-00512646〉

Partager

Métriques

Consultations de la notice

1134