Transfer Learning for Music Genre Classification - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2017

Transfer Learning for Music Genre Classification

Zhijie Wang
  • Fonction : Auteur
  • PersonId : 1033406
Fang Han
  • Fonction : Auteur
  • PersonId : 1033407

Résumé

Modern music information retrieval system provides high-level features (genre, instrument, mood and so on) for searching and recommending conveniently. Among these music tags, genre is the most widely used in practice. Machine learning technique has the ability of cataloguing different genres from raw music. A disadvantage of it is that the final performance heavily depends on the used features. As a powerful learning algorithm, deep neural network can extract useful features automatically and effectively instead of time-consuming feature engineering. But deeper architecture means larger data are needed to train the neural network. In many cases, we may not have enough data to train a deep network. Transfer learning solves the problem by pre-training the network in a similar task which has enough data, then fine-tuning the parameters of the pre-trained network using the target dataset. Magnatagatune dataset is used for pre-training the proposed five-layer Recurrent Neural Network (RNN) with Gated Recurrent Unit (GRU). And in order to reduce the input of the network, scattering transform is used in this paper. Then GTZAN dataset is used as the target dataset of genre classification. Experimental results show the transfer learning way can achieve a higher average classification accuracy (95.8%) than the same deep RNN which initials the parameters randomly (93.5%). In addition, the deep RNN using transfer learning converges to the final accuracy faster than using random initialization.
Fichier principal
Vignette du fichier
978-3-319-68121-4_19_Chapter.pdf (378.83 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01820925 , version 1 (22-06-2018)

Licence

Paternité

Identifiants

Citer

Guangxiao Song, Zhijie Wang, Fang Han, Shenyi Ding. Transfer Learning for Music Genre Classification. 2nd International Conference on Intelligence Science (ICIS), Oct 2017, Shanghai, China. pp.183-190, ⟨10.1007/978-3-319-68121-4_19⟩. ⟨hal-01820925⟩
545 Consultations
373 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More