JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE) ›› 2011, Vol. 41 ›› Issue (6): 1-6.

• Articles •     Next Articles

A method of feature selection for continuous attributes

LI Guo-he1,2, YUE Xiang1,2, LI Xue3, WU Wei-jiang1,2, LI Hong-qi1   

  1. 1. College of Geophysics and Information Engineering, China University of Petroleum, Beijing 102249, China;
    2.  PanPass Institute of Digital Identification Management and Internet of Things,  Beijing 100029, China;
    3.  School of Information Technology and Electrical Engineering, the University of Queensland, Brisbane 4072, Australia
  • Received:2011-04-15 Online:2011-12-16 Published:2011-04-15

Abstract:

Feature selection is one of the methods for reduction of data sets, which improves efficiency and effectivity of machine learning. In terms of the distribution of objects and their classification labels, the continuous feature space was partitioned into a variety of subspaces, each one with a clear edge and unique classification label. After the projection of all the subspaces for  each feature, the quality of each feature was  estimated for a subspace opposite all  the other subspaces with different classification labels by means of statistical significance. Through construction of a matrix by all the estimate qualities of all features of  the subspaces, all  features were ranked from the highest classifying power to the lowest on the matrix for the feature space. After the information gain function was defined by the subset of features, the feature subset was optimally determined on the basis of ranked features by gradually adding features. Experiments on the data sets from UCI(University of California Irvine) repository by the feature selection obtained feature subsets,  by which the performance and classification accuracy of machine learning were improved, illustrating that the feature selection was feasible.

Key words: data reduction, feature selection, continuous attributes, decision table

CLC Number: 

  • TP181
[1] MOU Lianming. Weighted k sub-convex-hull classifier based on adaptive feature selection [J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2018, 48(5): 32-37.
[2] LI Sushu, WANG Shitong, LI Tao. A feature selection method based on LS-SVM and fuzzy supplementary criterion [J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2017, 47(3): 34-42.
[3] FANG Hao, LI Yun. Random undersampling and POSS method for software defect prediction [J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2017, 47(1): 15-21.
[4] MO Xiaoyong, PAN Zhisong, QIU Junyang, YU Yajun, JIANG Mingchu. Anomaly detection in network traffic based on online feature selection [J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2016, 46(4): 21-27.
[5] JING Yunge, LI Tianrui. An incremental approach for reduction based on knowledge granularity [J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2016, 46(1): 1-9.
[6] WEI Xiaomin, XU Bin, GUAN Jihong. Prediction of protein energy hot spots based on recursion feature elimination [J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2014, 44(2): 12-20.
[7] ZHANG Guo-dong1,2, ZHANG Hua-xiang1,2*. Text categorization algorithm based on non-linear manifold learning and k-NN [J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2013, 43(1): 28-33.
[8] CHEN Yu-ming, WU Ke-shou, XIE Rong-sheng. Reduction for decision table based on relative knowledge granularity [J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2012, 42(6): 8-12.
[9] WU Ke-shou, CHEN Yu-ming, ZENG Zhi-qiang. Decision table reduction based on neighborhood relation [J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2012, 42(2): 7-10.
[10] PAN Dong-yin, ZHU Fa, XU Sheng, YE Ning*. Feature selection of gene expression profiles of colon cancer [J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2012, 42(2): 23-29.
[11] LI Xia1, WANG Lian-xi2, JIANG Sheng-yi1. Ensemble learning based feature selection for imbalanced problems [J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2011, 41(3): 7-11.
[12] DAI Ping, LI Ning*. A fast SVM-based feature selection method [J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2010, 40(5): 60-65.
[13] TAN Tai-zhe, LIANG Ying-yi, LIU Fu-chun. Application of ReliefF feature evaluation in un-supervised manifold learning [J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2010, 40(5): 66-71.
[14] YOU Ming-yu, CHEN Yan, LI Guo-zheng. Im-IG: A novel feature selection method for imbalanced problems [J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2010, 40(5): 123-128.
[15] YANG Ai-min1, ZHOU Yong-mei1, DENG He2, ZHOU Jian-feng3. Method of feature generation and selection for network traffic classification [J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2010, 40(5): 1-7.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!