大型数据库的关联挖掘算法设计Design of association mining algorithm for large-scale database
黄瑜
摘要(Abstract):
针对大型数据库在进行关联挖掘过程中,挖掘准确度低、效率差的问题,提出并设计了基于贝叶斯信息标准BIC评分函数的大型数据库关联挖掘算法。在对大型数据库关联数据获取基础上,采用贝叶斯信息标准BIC评分函数对数据进行预处理,并给出预处理流程,建立挖掘所需的新关联规则,根据其关联规则实现大型数据库的关联挖掘。实验结果表明,采用改进挖掘算法,其挖掘准确率达到了91.3%,相比传统挖掘算法提高了约35.9%,具有一定的优势。
关键词(KeyWords): 大型数据库;关联规则;挖掘算法;关联挖掘;评分函数;数据预处理
基金项目(Foundation): 广西教育厅高校科研项目(KY2015YB314)~~
作者(Author): 黄瑜
DOI: 10.16652/j.issn.1004-373x.2018.20.011
参考文献(References):
- [1]张忠林,田苗凤,刘宗成.大数据环境下关联规则并行分层挖掘算法研究[J].计算机科学,2016,43(1):286-289.ZHANG Zhonglin,TIAN Miaofeng,LIU Zongcheng. Parallel hierarchical association rule mining in big data environment[J]. Computer science,2016,43(1):286-289.
- [2]郝海涛,马元元.应用Aprion算法实现大规模数据库关联规则挖掘的技术研究[J].现代电子技术,2016,39(7):124-126.HAO Haitao,MA Yuanyuan. Using Aprion algorithm to implement association rule mining technology of large-scale database[J]. Modern electronics technique,2016,39(7):124-126.
- [3]刘平,王晓,刘春.小差异化图像数据库中的特定特征挖掘方法设计[J].沈阳工业大学学报,2017,39(5):562-566.LIU Ping,WANG Xiao,LIU Chun. Design of specific feature mining method in image database with small alienation[J].Journal of Shenyang University of Technology,2017,39(5):562-566.
- [4]杨小琴.大型数据库中的并行高效检测方法仿真分析[J].计算机仿真,2016,33(7):392-394.YANG Xiaoqin. Simulation analysis of parallel and efficient detection method in large database[J]. Computer simulation,2016,33(7):392-394.
- [5]赵学健,孙知信,袁源.基于预判筛选的高效关联规则挖掘算法[J].电子与信息学报,2016,38(7):1654-1659.ZHAO Xuejian,SUN Zhixin,YUAN Yuan. An efficient association rule mining algorithm based on prejudging and screening[J]. Journal of electronics&information technology,2016,38(7):1654-1659.
- [6]徐春,李广原,王玄,等.一种基于倒排索引树的增量更新关联挖掘算法[J].计算机工程与科学,2016,38(5):1039-1045.XU Chun,LI Guangyuan,WANG Xuan,et al. An incremental updating association rule mining algorithm based on inverted index tree[J]. Computer engineering and science,2016,38(5):1039-1045.
- [7]朱益立,邓珍荣,谢攀.基于有向无环图的频繁模式挖掘算法[J].计算机工程与设计,2017,38(5):1237-1241.ZHU Yili,DENG Zhenrong,XIE Pan. Mining frequent itemsets algorithm based on directed acycline graph[J]. Computer engineering and design,2017,38(5):1237-1241.
- [8]张亚玲,王婷,王尚平.增量式隐私保护频繁模式挖掘算法[J].计算机应用,2018,38(1):176-181.ZHANG Yaling,WANG Ting,WANG Shangping. Incremental frequent pattern mining algorithm for privacy-preserving[J].Journal of computer applications,2018,38(1):176-181.
- [9]林基明,班文娇,王俊义,等.基于并行遗传-最大最小蚁群算法的分布式数据库查询优化[J].计算机应用,2016,36(3):675-680.LIN Jiming,BAN Wenjiao,WANG Junyi,et al. Query optimization for distributed database based on parallel genetic algorithm and max-min ant system[J]. Journal of computer applications,2016,36(3):675-680.
- [10]林凌,许然.基于图像特征细化的海量数据挖掘系统设计与实现[J].现代电子技术,2016,39(24):113-115.LIN Ling,XU Ran. Design and implementation of mass data mining system based on image feature refinement[J]. Modern electronics technique,2016,39(24):113-115.