Collecting Valuable Information from Fast Text Streams
It has become a challenging work to collect valuable information from fast text streams.In this work, we propose a method which gains useful information effectively and efficiently.Firstly, we maintain an analyzer based on the Trie structure and the dynamic N-Gram tokenizer;secondly, unlike the traditional search engine principle, we consider the documents as a query by building the indexes for the whole query base.The experimental results show that it has the strong adaption ability, low latency and high quality support for the complex query combination compared with the conventional methods.
Fast Text Stream Information Collection Trie N-Gram
Baoyuan Qi Gang Ma Zhongzhi Shi Wei Wang
Key Lab of Intelligent Information Processing, Institute of Computing Technology,CAS, Beijing 100190 Key Lab of Intelligent Information Processing, Institute of Computing Technology,CAS, Beijing 100190 Beijing Lexo Technologies Co., Ltd.Beijing 100080, China
国际会议
8th International Conference on Intelligent Information Processing(2014年IFIP智能信息处理国际会议)
杭州
英文
96-105
2014-10-01(万方平台首次上网日期,不代表论文的发表时间)