Skip to Main content Skip to Navigation
Conference papers

Extraction of Web Image Information: Semantic or Visual Cues?

Abstract : Text based approaches for web image information retrieval have been exploited for many years, however the noisy textual content of the web pages makes their task challenging. Moreover, text based systems that retrieve information from textual sources such as image file names, anchor texts, existing keywords and, of course, surrounding text often share the inability to correctly assign all relevant text to an image and discard the irrelevant. A novel method for indexing web images is discussed in the present paper. The main concern of the proposed system is to overcome the obstacle of correctly assigning textual information to web images, while disregarding text that is unrelated to them. The proposed system uses visual cues in order to cluster a web page into several regions and compares this method to the use of semantic information and the realization of a k-means clustering. The evaluation reveals the advantages and disadvantages of the different clustering techniques and confirms the validity of the proposed method for web image indexing.
Document type :
Conference papers
Complete list of metadata

Cited literature [9 references]  Display  Hide  Download

https://hal.inria.fr/hal-01521418
Contributor : Hal Ifip <>
Submitted on : Thursday, May 11, 2017 - 5:10:37 PM
Last modification on : Thursday, March 5, 2020 - 5:41:39 PM
Long-term archiving on: : Saturday, August 12, 2017 - 2:07:30 PM

File

978-3-642-33409-2_38_Chapter.p...
Files produced by the author(s)

Licence


Distributed under a Creative Commons Attribution 4.0 International License

Identifiers

Citation

Georgina Tryfou, Nicolas Tsapatsoulis. Extraction of Web Image Information: Semantic or Visual Cues?. 8th International Conference on Artificial Intelligence Applications and Innovations (AIAI), Sep 2012, Halkidiki, Greece. pp.368-373, ⟨10.1007/978-3-642-33409-2_38⟩. ⟨hal-01521418⟩

Share

Metrics

Record views

162

Files downloads

204