基于图划分的领域本体RDF存储方法A domain ontology RDF storage method based on graph partitioning
王红,王雪君,杨蓉
摘要(Abstract):
针对海量RDF图数据分布式存储无法有效保持数据语义结构完整性的问题,提出一种基于标签传播和标签能量函数的多级图划分方法。该方法首先对领域本体解析所得的RDF图进行顶点ID标识,并为实例数据的主语分配初始标签;利用标签传播方法对各顶点进行标签设置形成语义结构相似的顶点集合;在此基础上通过多级图粗化和标签能量函数限制顶点集合的大小实现数据的语义分区;将该方法应用于民航突发事件领域本体的分布式存储与查询。采用边割率对领域本体数据的分区效果进行了分析与比较,实验表明该方法在减少边割率的基础上,保证了查全率并同时提高了民航突发事件相似案例的查询效率,为大规模领域本体的分布式存储与语义查询提供了进一步的方法支持。
关键词(KeyWords): 标签传播;图划分;领域本体;分布式存储;民航突发事件;相似案例
基金项目(Foundation): 国家自然科学基金资助项目:基于跨媒体网络大数据的民航突发事件应急决策语义服务关键技术研究(U1633110)~~
作者(Author): 王红,王雪君,杨蓉
DOI: 10.16652/j.issn.1004-373x.2018.24.035
参考文献(References):
- [1] BOZSAK E,EHRIG M,HANDSCHUH S,et al. KAON:towards a large scale semantic web[C]//Proceedings of International Conference on Electronic Commerce and Web Technologies. Berlin:Springer-Verlag,2002:304-313.
- [2] RDF Working Group. Resource description framework(RDF)[EB/OL].[2014-02-25]. http://www.w3.org/RDF/.
- [3]崔义童,冯志勇,王鑫,等.基于图聚类算法的大规模RDF数据查询方法研究[J].小型微型计算机系统,2015,36(12):2625-2628.CUI Yitong,FENG Zhiyong,WANG Xin,et al. Research on large-scale RDF data query method based on graph clustering[J]. Journal of Chinese computer systems,2015,36(12):2625-2628.
- [4] MALEWICZ G,AUSTERN M H,BIK A J C,et al. Pregel:a system for large-scale graph processing[C]//Proceedings of ACM SIGMOD International Conference on Management of Data. Indianapolis:ACM,2010:135-146.
- [5] HUANG J,ABADI D J,REN K. Scalable SPARQL querying of large RDF graphs[J]. Proceedings of the Vldb Endowment,2012,4(11):1123-1134.
- [6] Karypis Lab. METIS:serial graph partitioning and fill-reducing Matrix ordering[EB/OL].[2016-10-30]. http://glaros.dtc.umn.edu/gkhome/views/metis.
- [7] WANG L,XIAO Y,SHAO B,et al. How to partition a billionnode graph[C]//Proceedings of IEEE 30th International Conference on Data Engineering. Chicago:IEEE,2014:568-579.
- [8]王红,张青青,蔡伟伟,等.基于Neo4j的领域本体存储方法研究[J].计算机应用研究,2017,34(8):2404-2407.WANG Hong, ZHANG Qingqing, CAI Weiwei, et al. Research on storage method for domain ontology based on Neo4j[J]. Application research of computers,2017,34(8):2404-2407.
- [9]王红,杨璇,王静,等.基于本体的民航应急决策知识表达与推理方法研究[J].计算机工程与科学,2011,33(4):129-133.WANG Hong,YANG Xuan,WANG Jing,et al. Research on ontology-based knowledge presentation and reasoning in civil aviation emergency decision[J]. Computer engineering&science,2011,33(4):129-133.
- [10] PENG P,ZOU L,CHEN L,et al. Processing SPARQL queries over distributed RDF graphs[J]. International journal on very large data bases,2016,25(2):243-268.
- [11] ZOU L,?ZSU M T,CHEN L,et al. gStore:a graph-based SPARQL query engine[J]. International journal on very large data bases,2014,23(4):565-590.