JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE) ›› 2010, Vol. 40 ›› Issue (5): 117-122.

• Articles • Previous Articles     Next Articles

Word-position-based tagging for Chinese word segmentation

YU Jiang-de1, SUI Dan1, FAN Xiao-zhong2   

  1. 1. School of Computer and Information Engineering, Anyang Normal University, Anyang 455002, China;
    2. School of Computer Science and Technology, Beijing Institute of Technology, Beijing 100081, China
  • Received:2010-01-30 Online:2010-10-16 Published:2010-01-30

Abstract:

 The performance of Chinese word segmentation has been greatly improved by word-position-based approaches in recent years. This approach treats Chinese word segmentation as a wordposition tagging problem. With the help of powerful sequence tagging model, word-position-based method quickly rose as a mainstream technique in this field. Feature template selection is crucial in this method. We further studied this technique via using four wordpositions and conditional random fields. Closed evaluations are performed on corpus from the third and the fourth international Chinese word segmentation Bakeoff, and comparative experiments are performed on different feature templates. Experimental results show that the feature template set: TMPT-10′  is much better performance than the traditional template set.
 

Key words: Chinese word segmentation, conditional random fields, word-position tagging, feature template

[1] YU Jiang-de1, ZHOU Hong-yu1, YU Zheng-tao2. Feature engineering for Chinese part-of-speech tagging [J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2011, 41(6): 12-17.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!