JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE) ›› 2014, Vol. 44 ›› Issue (1): 13-18.doi: 10.6040/j.issn.1672-3961.2.2013.274

• Articles • Previous Articles     Next Articles

A method of gender discrimination based on character feature of Chinese names

YU Jiang-de1, ZHAO Hong-dan1, ZHENG Bo-ju1, YU Zheng-tao2   

  1. 1. School of Computer and Information Engineering, Anyang Normal University, Anyang   455000, China;
    2. School of Information Engineering and Automation, Kunming University of Science and Technology,
     Kunming 650051, China
  • Received:2013-06-28 Online:2014-02-20 Published:2013-06-28

Abstract:

Based on the strong gender discrimination of Chinese names, a method of gender discrimination based on character feature of Chinese names using nave Bayes classifier was presented. In this method, the first character of each Chinese name (Zi1), the second character (Zi2), the first and the second characters (Zi1Zi2) were regarded as distinguishing features. The nave Bayes classification method was used for gender discrimination of Chinese names. Training and testing were done on 412775 Chinese names corpus using 10 fold cross validation method, and comparative experiments were done according to the different feature combinations, they were  Zi1, Zi2, Zi1+Zi2, Zi1+Zi1Zi2, Zi2+Zi1Zi2, Zi1+Zi2+Zi1Zi2(all the distinguishing features). The average accuracy were as followings in turn, 72.75%,86.92%, 88.84%, 87.37%, 89.35%, 90.06%, of which the best average accuracy was 90.06%.

Key words: character feature, gender discrimination, feature combination, distinguishing feature, Chinese names, nave Bayes classification

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!