JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE) ›› 2011, Vol. 41 ›› Issue (4): 7-12.

• Articles • Previous Articles     Next Articles

A semi-supervised learning method based on information entropy to extract the domain entity relation

GUO Jian-yi1,2, LEI Chun-ya1, YU Zheng-tao1,2, SU Lei1,2, ZHAO Jun1, TIAN Wei1   

  1. 1. School of Information Engineering and Automation, Kunming University of Science and Technology,
    Kunming 650051, China; 2. Key Laboratory of Intelligent Information Processing, Kunming University of
     Science and Technology, Kunming 650051, China
  • Received:2011-02-14 Online:2011-08-16 Published:2011-02-14

Abstract:

To solve the limitation by the scale of labeled corpus of the supervised learning method, a semi-supervised method based on information entropy was proposed to extract entity relation using small-scale training data. Firstly, combined with field vocabulary to select small-scale training data, an initial maximum entropy classifier of certain accuracy was constructed to predict some new candidate instances from unlabeled data. Secondly, applied the method of information entropy by setting different entropy value and cycling many times,some new instances of the higher credibility from candidate instances were selected to expand the training data. Finally, the training classifier was re-iteratived with the expanded training data until classifier performance  tended to stable iteration termination, which achieved field entity relation extraction. Experimental results showed that the semi-supervised learning method based on information entropy achieved better learning results compared to other methods.
 

Key words: information entropy, semi-supervised, the maximum entropy classifier, unlabeled, credibility

[1] WU Jianping, JIANG Bin, LIU Jianwei. Fault diagnosis of asynchronous motor based on wavelet packet entropy and wavelet neural network [J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2017, 47(5): 223-228.
[2] WU Shufang, XU Jianmin. Evaluation of microblog users' credibility based on HITS algorithm [J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2016, 46(5): 7-12.
[3] LIN Yaojin, ZHANG Jia, LIN Menglei, WANG Juan. A method of collaborative filtering recommendation based on fuzzy information entropy [J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2016, 46(5): 13-20.
[4] WANG Xiaochu, WANG Shitong, BAO Fang. Image classification algorithm based on minimax probability machine with regularized probability density concensus [J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2015, 45(5): 13-21.
[5] XIN Liling, HE Wei, YU Jian, JIA Caiyan. An outlier detection algorithm based on density difference [J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2015, 45(3): 7-14.
[6] LIU Xiaoyong. A semi-supervised method based on tree kernel for relationship extraction [J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2015, 45(2): 22-26.
[7] KONG Chao1,2, ZHANG Huaxiang1,2*, LIU Li1,2. A semi-supervised image retrieval algorithm based onfeature fusion of the region of interest [J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2014, 44(3): 22-28.
[8] WANG Xiao-feng, SUI Ting-ting. Protein sequence identification based on improved TIGA-S4VM algorithm [J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2014, 44(1): 1-6.
[9] LI Ya-lin1,2, ZHANG Hua-xiang1,2*, FENG Xin-ying1,2. A new multi-label learning algorithm based on semi-supervised learning [J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2013, 43(2): 18-22.
[10] ZHANG Xin-yi, ZHAI Yu-qing*. Conflict evidence in trust model based on evidence theory [J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2013, 43(1): 48-53.
[11] XIA Zhan-guo, WAN Ling, CAI Shi-yu, SUN Peng-hui. A semi-supervised clustering algorithm oriented to intrusion detection [J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2012, 42(6): 1-7.
[12] LI Hui1,2, HU Yun1,3, LI Cun-hua1. The technique of gas disaster information feature extraction based on rough set theory [J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2012, 42(5): 91-95.
[13] XIE Huo-sheng, LIU Min. An ensemble co-training algorithm based on active learning [J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2012, 42(3): 1-5.
[14] ZHANG You-xin, WANG Li-hong. Two-stage semi-supervised clustering algorithm based on affinity propagation [J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2012, 42(2): 18-22.
[15] WEI Wei, ZHANG Yanning. Pose estimation based on semi-supervised latent Dirichlet allocation [J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2011, 41(3): 17-22.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!