您的位置:山东大学 -> 科技期刊社 -> 《山东大学学报(工学版)》

山东大学学报 (工学版) ›› 2019, Vol. 49 ›› Issue (2): 102-106.doi: 10.6040/j.issn.1672-3961.0.2018.189

• 机器学习与数据挖掘 • 上一篇    下一篇

基于C-GRU的微博谣言事件检测方法

李力钊1(),蔡国永1,潘角2   

  1. 1. 桂林电子科技大学计算机与信息安全学院, 广西 桂林 541004
    2. 桂林凯歌信息科技有限公司, 广西 桂林 541004
  • 收稿日期:2018-05-25 出版日期:2019-04-20 发布日期:2019-04-19
  • 作者简介:李力钊(1993—),男,山西长治人,硕士研究生,主要研究方向为数据挖掘,谣言检测.E-mail:786225251@qq.com
  • 基金资助:
    桂林市科学研究与技术开发计划项目(20170113-6)

A microblog rumor events detection method based on C-GRU

Lizhao LI1(),Guoyong CAI1,Jiao PAN2   

  1. 1. School of Computer Science and Information Security, Guilin University of Electronic Technology, Guilin 541004, Guangxi, China
    2. Guilin Kaige Information Technology Co., Ltd., Guilin 541004, Guangxi, China
  • Received:2018-05-25 Online:2019-04-20 Published:2019-04-19
  • Supported by:
    桂林市科学研究与技术开发计划项目(20170113-6)

摘要:

提出基于卷积-门控循环单元(convolution-gated recurrent unit, C-GRU)的微博谣言事件检测模型。结合卷积神经网络(convolutional neural networks, CNN)和门控循环单元(gated recurrent unit, GRU)的优点,将微博事件博文句向量化,通过CNN中的卷积层学习微博窗口的特征表示,将微博窗口特征按时间顺序拼接成窗口特征序列,将窗口特征序列输入GRU中学习序列特征表示进行谣言事件检测。在真实数据集上的试验结果表明,相比基于传统机器学习方法、CNN和GRU的谣言检测模型,该模型有更好的谣言识别能力。

关键词: 谣言事件检测, 深度学习, 卷积-门控循环单元, 窗口特征序列

Abstract:

A microblog rumor events detection model based on convolution-gated recurrent unit(C-GRU) was proposed. Combining the advantages of CNN and GRU, the microblog event′s posts was vectorized. By learning the features representation of the microblog windows through the convolution layer of CNN, the features of microblog windows was spliced into a sequence of window feature according to the time order, and the sequence of window feature was put into the GRU to learn feature representation of sequence for rumor events detection. Experimental results from real data sets showed that this model had better ability to rumor detection than other models based on traditional machine learning, CNN or RNN.

Key words: rumor events detection, deep learning, convolution-gated recurrent unit, window feature sequence

中图分类号: 

  • TP391.1

图1

基于C-GRU的谣言事件检测模型"

图2

卷积提取窗口特征"

图3

窗口特征拼接及窗口特征序列构建"

图4

GRU学习序列特征并输出结果"

表1

各方法准确率对比结果"

方法 Ac/%
SVM-RBF 79.75
DTC 81.25
RNN 87.25
1-LSTM 89.75
1-GRU 90.25
2-GRU 90.75
CNN 95.25
C-GRU 95.75

表2

准确率比较"

过滤器长度 滤器个数 Ac/%
2 180 92.50
3 180 93.25
4 180 94.25
5 180 93.75
2, 3 90 94.00
3, 4 90 94.50
4, 5 90 94.75
2, 3, 4 60 94.25
3, 4, 5 60 95.75
3, 4, 5 50 95.50
3, 4, 5 70 90.25
1 霍恩比. 牛津高阶英语词典[M]. 9版 北京: 商务印书馆, 2018.
2 QAZVINIAN V, ROSENGREN E, RADEV D R, et al. Rumor has it: identifying misinformation in microblogs[C]//Proceedings of the Conference on Empirical Methods in Natural Language Processing. Scotland, UK: Association for Computational Linguistics, 2011: 1589-1599.
3 HASSAN A, QAZVINIAN V, RADEV D. What's with the attitude?: identifying sentences with attitude in online discussions[C]//Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing. Massachusetts, USA: Association for Computational Linguistics. ACM, 2010: 1245-1255.
4 MA Ben , LIN Dazhen , CAO Donglin . Content representation for microblog rumor detection[M]. Advances in Computational Intelligence Systems. Lancaster, UK: Springer International Publishing, 2017: 245- 251.
5 CASTILLO C, MENDOZA M, POBLET B. Information credibility on Twitter[C]//Proceedings of the 20th international conference on World wide web. Hyderabad, India: ACM, 2011: 675-684.
6 MORRIS M R, COUNTS S, ROSEWAY A, et al. Tweeting is believing? understanding microblog credibility perceptions[C]//Proceedings of the ACM 2012 conference on Computer Supported Cooperative Work. Washington, USA: ACM, 2012: 441-450.
7 LIANG Gang , HE Wenbo , XU Chun , et al. Rumor identification in microblogging systems based on users' behavior[J]. IEEE Transactions on Computational Social Systems, 2015, 2 (3): 99- 108.
doi: 10.1109/TCSS.2016.2517458
8 MENDOZA M, POBLETE B, CASTILLO C. Twitter under crisis: can we trust what we RT?[C]//Proceedings of the First Workshop on Social Media Analytics. Washington, USA: ACM, 2010: 71-79.
9 CAI Guoyong, BI Mengying, LIU Jianxing. A novel rumor detection method based on labeled cascade propagation tree[C]//Proceedings of the 2017 13th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery. Changsha, China: ACM, 2017.
10 BAO Yuanyuan, YI Chengqi, XUE Yibo, et al. A new rumor propagation model and control strategy on social networks[C]//Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining. Ontario, Canada: ACM, 2013: 1472-1473.
11 KWON S, CHA M, JUNG K, et al. Prominent features of rumor propagation in online social media[C]//Data Mining (ICDM), 2013 IEEE 13th International Conference. Dallas, TX, USA: IEEE, 2013: 1103-1108.
12 MA Jing, GAO Wei, WEI Zhongyu, et al. Detect rumors using time series of social context information on microblogging websites[C]//Proceedings of the 24th ACM International on Conference on Information and Knowledge Management. Melbourne, Australia: ACM, 2015: 1751-1754.
13 毛二松, 陈刚, 刘欣, 等. 基于深层特征和集成分类器的微博谣言检测研究[J]. 计算机应用研究, 2016, (11): 3369- 3373.
MAO Ersong , CHEN Gang , LIU Xin , et al. Research on detecting micro-blog rumors based on deep features and ensemble classifier[J]. Application Research of Computers, 2016, (11): 3369- 3373.
14 MA Jing, GAO Wei, MITRA P, et al. Detecting rumors from microblogs with recurrent neural networks[C]//Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence. New York, USA: AAAI Press, 2016: 3818-3824.
15 CHEN Tong, WU Lin, LI Xue, et al. Call attention to rumors: deep attention based recurrent neural networks for early rumor detection[J]. arXiv Preprint, 2017. https://arxiv.org/pdf/1704.05973.pdf
16 RUCHANSKY N, SEO S, Liu Y. CSI: a hybrid deep model for fake news[C]. Proceedings of the 2017 ACM on Conference on Information and Knowledge Management. Singapore: ACM, 2017: 797-806.
17 刘政, 卫志华, 张韧弦. 基于卷积神经网络的谣言检测[J]. 计算机应用, 2017, 37 (11): 3053- 3056.
LIU Zheng , WEI Zhihua , ZHANG Renxian . Rumor detection based on convolution neural network[J]. Journal of Computer Applications, 2017, 37 (11): 3053- 3056.
18 ZHOU Chunting, SUN Chonglin, LIU Zhiyuan, et al. A C-LSTM neural network for text classification[J]. Computer Science, 2015. https://arxiv.org/pdf/1704.05973.pdf?tdsourcetag=s_pcqq_aiomsg.
19 CHO K, MERRIENBOER B V, BAHDANAU D, et al. On the properties of neural machine translation: Encoder-decoder approaches[C]//Proceedings of SSST-8, Eighth Workshop on Syntax, Semantics and Structure in Statistical Translation. Doha, Qatar: ACL, 2014, 103-111.
20 KINGMA D, BA J. ADAM: A Method for stochastic optimization[C]//Proceedings of the 3rd International Conference on Learning Representation. San Diego, USA: ICLR, 2015
21 YANG Fan, YU Xiaohui, LIU Yang, et al. Automatic detection of rumor on Sina Weibo[C]//Proceedings of the ACM SIGKDD Workshop on Mining Data Semantics. Beijing, China: ACM, 2012.
[1] 李常刚,李宝亮,曹永吉,王佳颖. 人工智能在电力系统潮流计算中的应用综述及展望[J]. 山东大学学报 (工学版), 2025, 55(5): 1-17.
[2] 周群颖,隋家成,张继,王洪元. 基于自监督卷积和无参数注意力机制的工业品表面缺陷检测[J]. 山东大学学报 (工学版), 2025, 55(4): 40-47.
[3] 薛冰冰,王勇,杨维浩,王川,于迪,王旭. 基于ETC收费数据的高速公路交通流数据修复及实时预测[J]. 山东大学学报 (工学版), 2025, 55(3): 58-71.
[4] 董明书,陈俐企,马川义,张珠皓,孙仁娟,管延华,庄培芝. 沥青路面内部裂缝雷达图像智能判识算法研究[J]. 山东大学学报 (工学版), 2025, 55(3): 72-79.
[5] 常新功,苏敏惠,周志刚. 基于进化集成的图神经网络解释方法[J]. 山东大学学报 (工学版), 2024, 54(4): 1-12.
[6] 索大翔,李波. 基于Gromov-Wasserstein最优传输的输电线路小目标检测方法[J]. 山东大学学报 (工学版), 2024, 54(3): 22-29.
[7] 宋辉,张轶哲,张功萱,孟元. 基于类权重和最小化预测熵的测试时集成方法[J]. 山东大学学报 (工学版), 2024, 54(3): 36-43.
[8] 刘新,刘冬兰,付婷,王勇,常英贤,姚洪磊,罗昕,王睿,张昊. 基于联邦学习的时间序列预测算法[J]. 山东大学学报 (工学版), 2024, 54(3): 55-63.
[9] 聂秀山,巩蕊,董飞,郭杰,马玉玲. 短视频场景分类方法综述[J]. 山东大学学报 (工学版), 2024, 54(3): 1-11.
[10] 李璐,张志军,范钰敏,王星,袁卫华. 面向冷启动用户的元学习与图转移学习序列推荐[J]. 山东大学学报 (工学版), 2024, 54(2): 69-79.
[11] 高泽文,王建,魏本征. 基于混合偏移轴向自注意力机制的脑胶质瘤分割算法[J]. 山东大学学报 (工学版), 2024, 54(2): 80-89.
[12] 陈成,董永权,贾瑞,刘源. 基于交互序列特征相关性的可解释知识追踪[J]. 山东大学学报 (工学版), 2024, 54(1): 100-108.
[13] 李家春,李博文,常建波. 一种高效且轻量的RGB单帧人脸反欺诈模型[J]. 山东大学学报 (工学版), 2023, 53(6): 1-7.
[14] 王旭晴,魏伟波,杨光宇,宋金涛,吕婷,潘振宽. 基于算法展开的图像盲去模糊深度学习网络[J]. 山东大学学报 (工学版), 2023, 53(6): 35-46.
[15] 王碧瑶,韩毅,崔航滨,刘毅超,任铭然,高维勇,陈姝廷,刘嘉巍,崔洋. 基于图像的道路语义分割检测方法[J]. 山东大学学报 (工学版), 2023, 53(5): 37-47.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
[1] 王素玉,艾兴,赵军,李作丽,刘增文 . 高速立铣3Cr2Mo模具钢切削力建模及预测[J]. 山东大学学报(工学版), 2006, 36(1): 1 -5 .
[2] 陈瑞,李红伟,田靖. 磁极数对径向磁轴承承载力的影响[J]. 山东大学学报(工学版), 2018, 48(2): 81 -85 .
[3] 孔维涛,张庆范,张承慧 . 基于DSP的空间矢量脉宽调制(SVPWM)的实现[J]. 山东大学学报(工学版), 2008, 38(3): 81 -84 .
[4] 张迎春 王佐勋 王桂娟. 基于神经网络控制器的高压电缆测温系统[J]. 山东大学学报(工学版), 2009, 39(5): 62 -67 .
[5] 孙媛媛 徐衍亮 姚之宁. 旁磁制动单相感应电动机制动力的分析与计算[J]. 山东大学学报(工学版), 2009, 39(5): 120 -123 .
[6] 朱向彩,栾云才,徐健 . 基于VB及FTA的城市交通评价系统[J]. 山东大学学报(工学版), 2007, 37(4): 89 -92 .
[7] 孙亮. 瞬变电磁对含水层的超前探测效果分析[J]. 山东大学学报(工学版), 2009, 39(4): 50 -52 .
[8] 张宁 李术才 李明田 杨磊. 新型岩石相似材料的研制[J]. 山东大学学报(工学版), 2009, 39(4): 149 -154 .
[9] 茹淼焱,王明刚, , 鲁成学, 张洪林 . 淀粉酶催化反应的最适温度的微量量热法[J]. 山东大学学报(工学版), 2008, 38(1): 113 -115 .
[10] 李春晓 岳钦艳 卢磊 高宝玉 杨忠莲 司晓慧 倪寿清 王元芳. 疏水缔合阳离子聚丙烯酰胺的合成与应用[J]. 山东大学学报(工学版), 2008, 38(6): 99 -104 .