决策树ID3新属性选择方法A new attribute selection method of ID3 algorithm for decision tree
王子京,刘毓
摘要(Abstract):
针对传统ID3算法存在多值属性偏向及运算量大的问题,引入粗糙集思想,定义了条件属性的相容度。利用属性的相容度作为分裂数据集的标准,构造决策树,避免传统ID3算法中对数的计算及多值属性的偏向。在3个UCI公共数据集上进行仿真实验,结果表明提出的新属性选择方法具有更高的预测准确率。
关键词(KeyWords): 数据挖掘;决策树;粗糙集;ID3算法;大数据;算法改进
基金项目(Foundation): 陕西省工业攻关(2016GY-113)~~
作者(Author): 王子京,刘毓
DOI: 10.16652/j.issn.1004-373x.2018.23.003
参考文献(References):
- [1]李泓波,白劲波,杨高明,等.决策树技术研究综述[J].电脑知识与技术,2015,11(24):1-4.LI Hongbo,BAI Jinbo,YANG Gaoming,et al.Review on decision tree technology research[J].Computer knowledge and technology,2015,11(24):1-4.
- [2]黄秀霞,孙力.C4.5算法的优化[J].计算机工程与设计,2016,37(5):1265-1270.HUANG Xiuxia,SUN Li.Optimization of C4.5 algorithm[J].Computer engineering and design,2016,37(5):1265-1270.
- [3]朱付保,霍晓齐,徐显景.基于粗糙集的ID3决策树算法改进[J].郑州轻工业学院学报(自然科学版),2015,30(1):50-54.ZHU Fubao,HUO Xiaoqi,XU Xianjing.Improved ID3 decision tree algorithm based on rough set[J].Journal of Zhengzhou University of Light Industry(natural science),2015,30(1):50-54.
- [4]蒋芸,李战杯,张强,等.一种基于粗糙集构造决策树的新方法[J].计算机应用,2004,24(8):21-23.JIANG Yun,LI Zhanbei,ZHANG Qiang,et al.New method for constructing decision tree based on rough sets theory[J]Computer applications,2004,24(8):21-23.
- [5]邹永贵,范程华.基于属性重要度的ID3改进算法[J].计算机应用,2008,28(6):144-145.ZOU Yonggui,FAN Chenghua.Improved ID3 algorithm based on attribute importance[J].Computer applications,2008,28(6):144-145.
- [6]胡煜,郑娟.基于粗糙集理论的ID3算法的改进与应用[J].贵阳学院学报(自然科学版),2015,10(1):16-20.HU Yu,ZHENG Juan.Improvement and application of ID3 algorithm based on the rough set theory[J].Journal of Guiyang College(natural sciences),2015,10(1):16-20.
- [7]LIU X W,WANG D H,JIANG L X.A novel method for inducing ID3 decision trees based on variable precision rough set[C]//2011 Seventh International Conference on Natural Computation.Shanghai,China:IEEE,2011:494-497.
- [8]翟俊海,侯少星,王熙照.粗糙模糊决策树归纳算法[J].南京大学学报(自然科学版),2016,52(2):306-312.ZHAI Junhai,HOU Shaoxing,WANG Xizhao.Induction of rough fuzzy decision tree[J].Journal of Nanjing University(natural sciences),2016,52(2):306-312.
- [9]王小巍,蒋玉明.决策树ID3算法的分析与改进[J].计算机工程与设计,2011,32(9):3069-3072.WANG Xiaowei,JIANG Yuming.Analysis and improvement of ID3 decision tree algorithm[J].Computer engineering and design,2011,32(9):3069-3072.
- [10]翟俊海,王华超,张素芳.一种基于模糊熵的模糊分类算法[J].计算机工程与应用,2010,46(20):176-180.ZHAI Junhai,WANG Huachao,ZHANG Sufang.Fuzzy classification algorithm based on fuzzy entropy[J].Computer engineering and applications,2010,46(20):176-180.
- [11]ZHAI J H,HOU S X,ZHANG S F.Induction of tolerance rough fuzzy decision tree[C]//Proceedings of 2015 International Conference on Machine Learning and Cybernetics.Guangzhou,China:IEEE,2015:844-848.
- [12]WANG C R,OU F F.An algorithm for decision tree construction based on rough set theory[C]//2008 International Conference on Computer Science and Information Technology.Singapore:IEEE,2008:295-298.
- [13]XU W X,WANG Q R,ZHANG X T.Multi-granulation fuzzy rough sets in a fuzzy tolerance approximation space[J].International journal of fuzzy systems,2011,13(4):246-259.