Learning Grammars for Architecture-Specific Facade Parsing

Raghudeep Gadde 1, 2, 3 Renaud Marlet 2, 3 Nikos Paragios 1, 4
2 IMAGINE [Marne-la-Vallée]
LIGM - Laboratoire d'Informatique Gaspard-Monge, CSTB - Centre Scientifique et Technique du Bâtiment, ENPC - École des Ponts ParisTech
Abstract : Parsing facade images requires optimal handcrafted grammar for a given class of buildings. Such a handcrafted grammar is often designed manually by experts. In this paper, we present a novel framework to learn a compact grammar from a set of ground-truth images. To this end, parse trees of ground-truth annotated images are obtained running existing inference algorithms with a simple, very general grammar. From these parse trees, repeated subtrees are sought and merged together to share derivations and produce a grammar with fewer rules. Furthermore, unsupervised clustering is performed on these rules, so that, rules corresponding to the same complex pattern are grouped together leading to a rich compact grammar. Experimental validation and comparison with the state-of-the-art grammar-based methods on four diff erent datasets show that the learned grammar helps in much faster convergence while producing equal or more accurate parsing results compared to handcrafted grammars as well as grammars learned by other methods. Besides, we release a new dataset of facade images from Paris following the Art-deco style and demonstrate the general applicability and extreme potential of the proposed framework.
Liste complète des métadonnées

Cited literature [71 references]  Display  Hide  Download

https://hal.inria.fr/hal-01069379
Contributor : Raghudeep Gadde <>
Submitted on : Friday, October 2, 2015 - 5:34:53 PM
Last modification on : Tuesday, February 26, 2019 - 10:14:09 AM
Document(s) archivé(s) le : Sunday, January 3, 2016 - 10:56:26 AM

File

paper2.pdf
Files produced by the author(s)

Identifiers

Citation

Raghudeep Gadde, Renaud Marlet, Nikos Paragios. Learning Grammars for Architecture-Specific Facade Parsing. International Journal of Computer Vision, Springer Verlag, 2015, 117 (3), pp.290-316. ⟨https://link.springer.com/article/10.1007/s11263-016-0887-4⟩. ⟨10.1007/s11263-016-0887-4⟩. ⟨hal-01069379v2⟩

Share

Metrics

Record views

762

Files downloads

720