会议专题

A Combined Index for Mixed Structured and Unstructured Data

  In big data epoch, one of the major challenges is the large volume of mixed structured and unstructured data,which comes in heterogeneous sources.Because of different form, structured and unstructured data are often considered apart from each other.However, they may speak about the same entities of the world.If a query involve both structured data and its unstructured counterpart, it is inefficient to execute it separately.The paper presents a novel index structure tailored towards the combinations of structured and unstructured data.The combined index is a joint index over structured database and unstructured document, based on entity co-occurrences.It is also a semantic index which describes the semantic relationships between entities and their multiple resources.We store the index as RDF graphs and queries are SPARQL-like.Experiments show that the associated index can not only provide apposite information but also execute queries efficiently.

index structured data unstructured data semantic index

Chunying Zhu Qingzhong Li Lanju Kong Song Wei

Computer Science and Technology Shandong University Jinan, China Shandong Hoteam Software Co., Ltd Jinan, China

国际会议

The 12th Web Information System and Application Conference第十二届全国Web信息系统及其应用学术会议(WISA2015)、全国第十次语义Web 与本体论学术研讨会(SWON2015)、全国第九次电子政务技术及应用学术研讨会(EGTA2015)

济南

英文

217-222

2015-09-11(万方平台首次上网日期,不代表论文的发表时间)