[ "徐康庭(2000- ),男,北方工业大学信息学院在读,主要研究方向为机器学习、自然语言处理。" ]
[ "宋威(1980- ),男,博士,北方工业大学信息学院教授、博士生导师,主要研究方向为数据挖掘、推荐系统。" ]
网络首发:2022-05,
纸质出版:2022-05-15
移动端阅览
徐康庭, 宋威. 结合语言知识和深度学习的中文文本情感分析方法[J]. 大数据, 2022,8(3):115-127.
Kangting XU, Wei Song. A Chinese text sentiment analysis method combining language knowledge and deep learning[J]. Big data research, 2022, 8(3): 115-127.
徐康庭, 宋威. 结合语言知识和深度学习的中文文本情感分析方法[J]. 大数据, 2022,8(3):115-127. DOI: 10.11959/j.issn.2096-0271.2022026.
Kangting XU, Wei Song. A Chinese text sentiment analysis method combining language knowledge and deep learning[J]. Big data research, 2022, 8(3): 115-127. DOI: 10.11959/j.issn.2096-0271.2022026.
在目前的中文文本情感分析研究中,基于语义规则和情感词典的方法通常需要人工设置情感阈值;而基于深度学习的方法由于未能运用语义规则和情感词典等语言知识,不能充分提取情感特征。针对这两种方法的缺点,提出了一种将语言知识和深度学习结合的文本情感分析方法。该方法首先根据语义规则提取文本中的关键情感片段,再根据情感词典从关键情感片段中抽取出情感更加明确的情感词来构建情感集合,然后利用深度学习模型分别从原始文本、关键情感片段、情感集合中抽取深层次特征,最后对提取的特征进行加权融合,并利用分类器实现情感极性的判断。实验结果表明,与未引入语言知识的深度学习模型相比,该方法的情感极性分类能力有明显提升。
At present
in the research of Chinese text emotion analysis
the method based on semantic rules and emotion dictionary usually needs to set the emotional threshold manually.However
the method based on deep learning can’t fully extract emotional features because it fails to use language knowledge such as semantic rules and emotional dictionary.As to shortcomings of two methods
a text emotion analysis method combining language knowledge and deep learning was proposed.Firstly
the key emotional segments in the text were extracted according to the semantic rules.Secondly
more explicit emotion words were extracted from the key emotional segments according to the emotional dictionary to construct the emotion set.Thirdly
the deep level features were extracted from the original text
key emotional segments and emotional set by using the deep learning model.Finally
the features were weighted and fused
and the classifier was used to judge the emotional polarity.The experimental results show that compared with the deep learning model without language knowledge
this method has significantly improved the ability of emotional polarity classification.
HATZIVASSILOGLOU V , MCKEOWN K R . Predicting the semantic orientation of adjectives [C ] // Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and 8th Conference of the European Chapter of the Association for Computational Linguistics . Stroudsburg:Association for Computational Linguistics , 1997 : 174 - 181 .
WU J S , LU K , SU S Z , et al . Chinese micro-blog sentiment analysis based on multiple sentiment dictionaries and semantic rule sets [J ] . IEEE Access , 2019 , 7 : 183924 - 183939 .
赵妍妍 , 秦兵 , 石秋慧 , 等 . 大规模情感词典的构建及其在情感分类中的应用 [J ] . 中文信息学报 , 2017 , 31 ( 2 ): 187 - 193 .
ZHAO Y Y , QIN B , SHI Q H , et al . Large-scale sentiment lexicon collection and its application in sentiment classification [J ] . Journal of Chinese Information Processing , 2017 , 31 ( 2 ): 187 - 193 .
XU G X , YU Z H , YAO H S , et al . Chinese text sentiment analysis based on extended sentiment dictionary [J ] . IEEE Access , 2019 , 7 : 43749 - 43762 .
KESHAVARZ H , ABADEH M S . ALGA:adaptive lexicon learning using genetic algorithm for sentiment analysis of microblogs [J ] . Knowledge-Based Systems , 2017 , 122 : 1 - 16 .
李继东 , 王移芝 . 基于扩展词典与语义规则的中文微博情感分析 [J ] . 计算机与现代化 , 2018 ( 2 ): 89 - 95 .
LI J D , WANG Y Z . Sentiment analysis of Chinese microblog based on expand-dictionary and semantic rule [J ] . Computer and Modernization , 2018 ( 2 ): 89 - 95 .
ZHANG S X , WEI Z L , WANG Y , et al . Sentiment analysis of Chinese microblog text based on extended sentiment dictionary [J ] . Future Generation Computer Systems , 2018 , 81 : 395 - 403 .
PANG B , LEE L , VAITHYANATHAN S . Thumbs up?:sentiment classification using machine learning techniques [C ] // Proceedings of the ACL-02 Conference on Empirical methods in Natural Language Processing . Stroudsburg:Association for Computational Linguistics , 2002 : 79 - 86 .
苏莹 , 张勇 , 胡珀 , 等 . 基于朴素贝叶斯与潜在狄利克雷分布相结合的情感分析 [J ] . 计算机应用 , 2016 , 36 ( 6 ): 1613 - 1618 .
SU Y , ZHANG Y , HU P , et al . Sentiment analysis research based on combination of naive Bayes and latent Dirichlet allocation [J ] . Journal of Computer Applications , 2016 , 36 ( 6 ): 1613 - 1618 .
刘栋军 , 王宇涵 , 凌文芬 , 等 . 基于脑机协同智能的情绪识别 [J ] . 智能科学与技术学报 , 2021 , 3 ( 1 ): 65 - 75 .
LIU D J , WANG Y H , LING W F , et al . Emotion recognition based on brain and machine collaborative intelligence [J ] . Chinese Journal of Intelligent Science and Technology , 2021 , 3 ( 1 ): 65 - 75 .
KIM Y , . Convolutional neural networks for sentence classification [C ] // Proceedings of 2014 Conference on Empirical Methods in Natural Language Processing . Stroudsburg:Association for Computational Linguistics , 2014 : 1746 - 1751 .
胡荣磊 , 芮璐 , 齐筱 , 等 . 基于循环神经网络和注意力模型的文本情感分析 [J ] . 计算机应用研究 , 2019 , 36 ( 11 ): 3282 - 3285 .
HU R L , RUI L , QI X , et al . Text sentiment analysis based on recurrent neural networks and attention model [J ] . Application Research of Computers , 2019 , 36 ( 11 ): 3282 - 3285 .
李洋 , 董红斌 . 基于CNN和BiLSTM网络特征融合的文本情感分析 [J ] . 计算机应用 , 2018 , 38 ( 11 ): 3075 - 3080 .
LI Y , DONG H B . Text sentiment analysis based on feature fusion of convolution neural network and bidirectional long short-term memory network [J ] . Journal of Computer Applications , 2018 , 38 ( 11 ): 3075 - 3080 .
宋婷 , 陈战伟 , 杨海峰 . 基于分层注意力网络的方面情感分析 [J ] . 大数据 , 2020 , 6 ( 5 ): 82 - 91 .
SONG T , CHEN Z W , YANG H F . Aspect sentiment analysis based on a hierarchical attention network [J ] . Big Data Research , 2020 , 6 ( 5 ): 82 - 91 .
谢润忠 , 李烨 . 基于BERT和双通道注意力的文本情感分类模型 [J ] . 数据采集与处理 , 2020 , 35 ( 4 ): 642 - 652 .
XIE R Z , LI Y . Text sentiment classification model based on BERT and dual channel attention [J ] . Journal of Data Acquisition and Processing , 2020 , 35 ( 4 ): 642 - 652 .
邱宁佳 , 王晓霞 , 王鹏 , 等 . 融合语法规则的双通道中文情感模型分析 [J ] . 计算机应用 , 2021 , 41 ( 2 ): 318 - 323 .
QIU N J , WANG X X , WANG P , et al . Analysis of double-channel Chinese sentiment model integrating grammar rules [J ] . Journal of Computer Applications , 2021 , 41 ( 2 ): 318 - 323 .
CHUNG Y A , ZHU C G , ZENG M . SPLAT:speech-language joint pre-training for spoken language understanding [C ] // Proceedings of 2021 Conference of the North American Chapter of Association for Computational Linguistics:Human Language Technologies . Stroudsburg:Association for Computational Linguistics , 2021 : 1897 - 1907 .
HOCHREITER S , SCHMIDHUBER J . Long short-term memory [J ] . Neural Computation , 1997 , 9 ( 8 ): 1735 - 1780 .
GRAVES A . Supervised sequence labelling with recurrent neural networks [M ] . Heidelberg : Springer , 2012 .
CUI Z C , CHEN W L , CHEN Y X . Multiscale convolutional neural network for time series classification [J ] . arXiv preprint , 2016 ,arXiv:1603.06995.
0
浏览量
778
下载量
0
CSCD
关联资源
相关文章
相关作者
相关机构
京公网安备11010802024621