HAL will be down for maintenance from Friday, June 10 at 4pm through Monday, June 13 at 9am. More information
Skip to Main content Skip to Navigation
Conference papers

Extraction of Web Image Information: Semantic or Visual Cues?

Abstract : Text based approaches for web image information retrieval have been exploited for many years, however the noisy textual content of the web pages makes their task challenging. Moreover, text based systems that retrieve information from textual sources such as image file names, anchor texts, existing keywords and, of course, surrounding text often share the inability to correctly assign all relevant text to an image and discard the irrelevant. A novel method for indexing web images is discussed in the present paper. The main concern of the proposed system is to overcome the obstacle of correctly assigning textual information to web images, while disregarding text that is unrelated to them. The proposed system uses visual cues in order to cluster a web page into several regions and compares this method to the use of semantic information and the realization of a k-means clustering. The evaluation reveals the advantages and disadvantages of the different clustering techniques and confirms the validity of the proposed method for web image indexing.
Document type :
Conference papers
Complete list of metadata

Cited literature [9 references]  Display  Hide  Download

Contributor : Hal Ifip Connect in order to contact the contributor
Submitted on : Thursday, May 11, 2017 - 5:10:37 PM
Last modification on : Thursday, March 5, 2020 - 5:41:39 PM
Long-term archiving on: : Saturday, August 12, 2017 - 2:07:30 PM


Files produced by the author(s)


Distributed under a Creative Commons Attribution 4.0 International License



Georgina Tryfou, Nicolas Tsapatsoulis. Extraction of Web Image Information: Semantic or Visual Cues?. 8th International Conference on Artificial Intelligence Applications and Innovations (AIAI), Sep 2012, Halkidiki, Greece. pp.368-373, ⟨10.1007/978-3-642-33409-2_38⟩. ⟨hal-01521418⟩



Record views


Files downloads