Incremental Cross-Modality Deep Learning for Pedestrian Recognition

Danut Ovidiu Pop; Alexandrina Rogozan; Fawzi Nashashibi; Abdelaziz Bensrhair

doi:10.1109/IVS.2017.7995771

Communication Dans Un Congrès Année : 2017

Incremental Cross-Modality Deep Learning for Pedestrian Recognition

(1, 2, 3) , (2) , (1) , (2)

1
2
3

Danut Ovidiu Pop

Fonction : Auteur

Robotics & Intelligent Transportation Systems

Institut national des sciences appliquées Rouen Normandie

Babes-Bolyai University [Cluj-Napoca]

Alexandrina Rogozan

Fonction : Auteur

Institut national des sciences appliquées Rouen Normandie

Fawzi Nashashibi

Fonction : Auteur
PersonId : 20861
IdHAL : fawzi-nashashibi
ORCID : 0000-0002-4209-1233
IdRef : 079565948

Robotics & Intelligent Transportation Systems

Abdelaziz Bensrhair

Fonction : Auteur

Institut national des sciences appliquées Rouen Normandie

Résumé

In spite of the large amount of existent methods, pedestrian detection is still an open challenge. In recent years, deep learning classification methods combined with multi-modality images within different fusion schemes achieved the best performance. It was proven that late-fusion scheme out-performs both direct and intermediate integration of modalities for pedestrian recognition. Hence, in this paper, we focus on improving the late-fusion scheme for pedestrian classification on the Daimler stereo vision data set. Each image modality among Intensity, Depth and Flow, is classified by an independent Convolution Neural Network (CNN). The CNN outputs are then fused by a Multi-layer Perceptron (MLP) before the recognition decision. We propose different methods based on Cross-Modality deep learning of CNNs: (1) a correlated model where a unique CNN is learned with Intensity, Depth and respectively Flow images for each frame, (2) an incremental model where a CNN is learned with the first modality images frames, then a second CNN, initialized by transfer learning on the first CNN, is learned on the second modality images frames, and finally a third CNN initialized on the second CNN, is learned on the last modality images frames. The experiments show that the incremental cross-modality deep learning of CNNs allows the improvement of classification performances not only for each independent modality classifier, but also for the multi-modality classifier based on late-fusion. Different learning algorithms were also investigated.

Domaines

Intelligence artificielle [cs.AI] Vision par ordinateur et reconnaissance de formes [cs.CV] Apprentissage [cs.LG]

Fichier principal

IV.pdf (433.45 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Danut Ovidiu Pop : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01501711

Soumis le : jeudi 8 juin 2017-17:10:53

Dernière modification le : vendredi 22 décembre 2023-15:16:05

Archivage à long terme le : samedi 9 septembre 2017-13:22:31

Dates et versions

hal-01501711 , version 1 (04-04-2017)

hal-01501711 , version 2 (05-04-2017)

hal-01501711 , version 3 (08-06-2017)

Identifiants

HAL Id : hal-01501711 , version 3
DOI : 10.1109/IVS.2017.7995771

Citer

Danut Ovidiu Pop, Alexandrina Rogozan, Fawzi Nashashibi, Abdelaziz Bensrhair. Incremental Cross-Modality Deep Learning for Pedestrian Recognition. IV'17 - IEEE Intelligent Vehicles Symposium , Jun 2017, Redondo Beach, CA, United States. ⟨10.1109/IVS.2017.7995771⟩. ⟨hal-01501711v3⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INRIA INSA-ROUEN COMUE-NORMANDIE INRIA2 LMI-ROUEN INSA-GROUPE

323 Consultations

485 Téléchargements

Incremental Cross-Modality Deep Learning for Pedestrian Recognition

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager