基于复杂句式短文本情感分类研究Research on sentiment classification for complex sentence patterns of short text
李毅捷,段利国,李爱萍
摘要(Abstract):
目前,网络文本中主观内容的情感倾向性识别成为文本信息处理的研究热点。针对汉语中复杂句式的结构特点以及对多种复杂句式的有效分析,基于word2vec进行情感词典的扩建,将扩充后的情感词典、关联词表、否定词表进行特征提取,得到有效的特征词序列,构建新的复杂句式模型并结合SVM进行训练和预测,完成复杂句式情感分类。实验结果表明,提出的复杂句式情感分类模型在处理精度方面比传统的句子级情感分类方法有了明显的提高,获得良好的情感分析效果。
关键词(KeyWords): 文本信息处理;情感分析;复杂句式;word2vec;情感分类模型;SVM
基金项目(Foundation): 武汉大学软件工程国家重点实验室开放课题(SKLSE2012-09-30);; 山西省自然科学基金资助项目(2013011015-2)~~
作者(Author): 李毅捷,段利国,李爱萍
DOI: 10.16652/j.issn.1004-373x.2018.22.045
参考文献(References):
- [1]赵妍妍,秦兵,刘挺.文本情感分析[J].软件学报,2010,21(8):1834-1848.ZHAO Yanyan,QIN Bing,LIN Ting. Text sentiment analysis[J]. Journal of software,2010,21(8):1834-1848.
- [2] PANG B,LEE L,VAITHYANATHAN S. Thumbs up? sentiment classification using machine learning techniques[C]//Proceedings of the Conference on Empirical Methods in Natural Language Processing. Philadelphia:Association for Computational Linguistics,2002:79-86.
- [3]吴晓吟.中文复杂句型的情感分析研究[EB/OL].[2013-03-15].http://www.doc88.com/p-1738770331623.html.WU Xiaoyin. Sentiment analysis of complex sentences for Chinese document[EB/OL].[2013-03-15]. http://www.doc88.com/p-1738770331623.html.
- [4]杨富平,黄志勇.基于SVM和复杂句式的中文微博情感分析[EB/OL].[2016-01-12].http://www.doc88.com/p-3317610703317.html.YANG Fuping,HUANG Zhiyong. Chinese micro-blog sentiment classification based on SVM and complex phrasing[EB/OL].[2016-01-12]. http://www.doc88.com/p-3317610703317.html.
- [5]宋锐,林鸿飞,常富洋.中文比较句识别及比较关系抽取[J].中文信息学报,2009,23(2):102-107.SONG Rui,LIN Hongfei,CHANG Fuyang. Chinese comparative sentences identification and comparative relations extraction[J]. Journal of Chinese information processing,2009,23(2):102-107.
- [6] NARAYANAN R,LIU B,CHOUDHARY A. Sentiment analysis of conditional sentences[C]//Proceedings of the Conference on Empirical Methods in Natural Language Processing. Singapore:Association for Computational Linguistics,2009:180-189.
- [7]李爱萍,邸鹏,段利国.基于句子情感加权算法的篇章情感分析[J].小型微型计算机系统,2015,36(10):2252-2256.LI Aiping,DI Peng,DUAN Liguo. Document sentiment orientation analysis based on sentence weighted algorithm[J]. Journal of Chinese computer systems,2015,36(10):2252-2256.
- [8] BACCIANELLA S,ESULI A,SEBASTIANI F. SentiWordNet3.0:an enhanced lexical resource for sentiment analysis and opinion mining[C]//Proceedings of the International Conference on Language Resources and Evaluation. Valletta:European Language Resources Association,2010:2200-2204.
- [9] LILLEBERG J,ZHU Y,ZHANG Y. Support vector machines and word2vec for text classification with semantic features[C]//Proceedings of 14th International Conference on Cognitive Informatics&Cognitive Computing. Beijing:IEEE,2015:136-140.
- [10]江敏,肖诗斌,王弘蔚,等.一种改进的基于《知网》的词语语义相似度计算[J].中文信息学报,2008,22(5):84-89.JIANG Min,XIAO Shibin,WANG Hongwei,et al. An improved word similarity computing method based on HowNet[J]. Journal of Chinese information processing,2008,22(5):84-89.
- [11]邸鹏,段利国.基于复杂句式的文本情感倾向性分析[J].计算机应用与软件,2015,32(11):57-61.DI Peng, DUAN Liguo. Text sentiment polarity analysis based on complex sentences[J]. Computer applications and software,2015,32(11):57-61.