Service interruption on Monday 11 July from 12:30 to 13:00: all the sites of the CCSD (HAL, EpiSciences, SciencesConf, AureHAL) will be inaccessible (network hardware connection).
Abstract : It has become a challenging work to collect valuable information from fast text streams. In this work, we propose a method which gains useful information effectively and efficiently. Firstly, we maintain an analyzer based on the Trie structure and the dynamic N-Gram tokenizer; secondly, unlike the traditional search engine principle, we consider the documents as a query by building the indexes for the whole query base. The experimental results show that it has the strong adaption ability, low latency and high quality support for the complex query combination compared with the conventional methods.
https://hal.inria.fr/hal-01383321 Contributor : Hal IfipConnect in order to contact the contributor Submitted on : Tuesday, October 18, 2016 - 2:53:44 PM Last modification on : Thursday, March 5, 2020 - 5:41:03 PM
Baoyuan Qi, Gang Ma, Zhongzhi Shi, Wei Wang. Collecting Valuable Information from Fast Text Streams. 8th International Conference on Intelligent Information Processing (IIP), Oct 2014, Hangzhou, China. pp.96-105, ⟨10.1007/978-3-662-44980-6_11⟩. ⟨hal-01383321⟩