A generic structure-from-motion algorithm for cross-camera scenarios

Srikumar Ramalingam 1 Suresh K. Lodha 1 Peter Sturm 2
2 MOVI - Modeling, localization, recognition and interpretation in computer vision
GRAVIR - IMAG - Graphisme, Vision et Robotique, Inria Grenoble - Rhône-Alpes, CNRS - Centre National de la Recherche Scientifique : FR71
Abstract : We introduce a generic structure-from-motion approach based on a previously introduced, highly general imaging model, where cameras are modeled as possibly unconstrained sets of projection rays. This allows to describe most existing camera types (at least for those operating in the visible domain), including pinhole cameras, sensors with radial or more general distortions, catadioptric cameras (central or non-central), etc. We introduce a structure-from-motion approach for this general imaging model, that allows to reconstruct scenes from calibrated images, possibly taken by cameras of different types (cross-camera scenarios). Structure-from-motion is naturally handled via camera independent ray intersection problems, solved via linear or simple polynomial equations. We also propose two approaches for obtaining optimal solutions using bundle adjustment, where camera motion, calibration and 3D point coordinates are refined simultaneously. One is relatively straightforward, minimizing distances between 3D points and projection rays. The other minimizes reprojection error; the general imaging model does not provide analytical expressions for the reprojection error and its derivatives, which are desirable for efficient optimization. To achieve this, we propose to approximate the set of projection rays of a general non-central camera by several clusters of central rays, allowing us to formulate an analytical cost function. % that can be minimized using non-linear iteration. We present results for two cross-camera scenarios -- a pinhole used together with an omnidirectional camera and a stereo system used with an omnidirectional camera. Using ground-truth and 3D reconstruction results from classical techniques, we show that our generic algorithm is simple, general and accurate for extensions to various cross-camera and multi-camera scenarios.
Type de document :
Communication dans un congrès
Peter Sturm and Tomas Svoboda and Seth Teller. Workshop on Omnidirectional Vision, Camera Networks and Non-classical Cameras, May 2004, Prague, Czech Republic. 103, pp.175-186, 2004, Computer Vision and Image Understanding. 〈http://cmp.felk.cvut.cz/events/omnivis2004/ramalingam/ramalingam.html〉
Liste complète des métadonnées

https://hal.inria.fr/inria-00524412
Contributeur : Peter Sturm <>
Soumis le : mercredi 25 mai 2011 - 14:40:52
Dernière modification le : jeudi 11 janvier 2018 - 06:20:04
Document(s) archivé(s) le : vendredi 26 août 2011 - 02:21:57

Fichier

RamalingamLodhaSturm-omnivis04...
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : inria-00524412, version 1

Collections

INRIA | IMAG | UGA

Citation

Srikumar Ramalingam, Suresh K. Lodha, Peter Sturm. A generic structure-from-motion algorithm for cross-camera scenarios. Peter Sturm and Tomas Svoboda and Seth Teller. Workshop on Omnidirectional Vision, Camera Networks and Non-classical Cameras, May 2004, Prague, Czech Republic. 103, pp.175-186, 2004, Computer Vision and Image Understanding. 〈http://cmp.felk.cvut.cz/events/omnivis2004/ramalingam/ramalingam.html〉. 〈inria-00524412〉

Partager

Métriques

Consultations de la notice

252

Téléchargements de fichiers

172