会议专题

Automatic Facet Extraction based on Multidimensional Semantic Index

  Faceted search on webpages needs exact facets.However,it is difficult to extract facets exactly from webpages because the webpages are unstructured and lack of facet information.Therefore,facet extraction is a key to faceted search.This paper proposed a method of extracting facets automatically from unstructured webpages to improve the faceted search on web.The Multidimensional Semantic Index (MDSI) of webpages is constructed by mining all kinds of semantic relations among the words from webpages,which creates a semantic-rich index for webpages.In MDSI,the differently dimensional semantic indexes are bridged by mining the semantic mapping between them.Based on the MDSI of webpages,the facets are extracted by analyzing semantic mapping relations in MDSI.To validate the effect of the proposed method,two datasets are constructed and the experimental results show that the proposed method is feasible and comparatively precise.

faceted search multidimensional semantic index semantic mapping facet extraction

Xiao Wei Xiangfeng Luo Qing Li

School of Computing Engineering and Science, High Performance Computing Center,Shanghai University. School of Computing Engineering and Science, High Performance Computing Center,Shanghai University. City University of Hong Kong Department of Computer Science,City University of Hong Kong, Hong Kong

国际会议

第8届语义知识与网络国际会议(2012 Eighth International Conference on Semanties,Knowledge and Grids )(SKG2012)

北京

英文

64-71

2012-10-22(万方平台首次上网日期,不代表论文的发表时间)