会议专题

Collecting Valuable Information from Fast Text Streams

  It has become a challenging work to collect valuable information from fast text streams.In this work, we propose a method which gains useful information effectively and efficiently.Firstly, we maintain an analyzer based on the Trie structure and the dynamic N-Gram tokenizer;secondly, unlike the traditional search engine principle, we consider the documents as a query by building the indexes for the whole query base.The experimental results show that it has the strong adaption ability, low latency and high quality support for the complex query combination compared with the conventional methods.

Fast Text Stream Information Collection Trie N-Gram

Baoyuan Qi Gang Ma Zhongzhi Shi Wei Wang

Key Lab of Intelligent Information Processing, Institute of Computing Technology,CAS, Beijing 100190 Key Lab of Intelligent Information Processing, Institute of Computing Technology,CAS, Beijing 100190 Beijing Lexo Technologies Co., Ltd.Beijing 100080, China

国际会议

8th International Conference on Intelligent Information Processing(2014年IFIP智能信息处理国际会议)

杭州

英文

96-105

2014-10-01(万方平台首次上网日期,不代表论文的发表时间)