JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE) ›› 2011, Vol. 41 ›› Issue (2): 96-101.

• Articles • Previous Articles     Next Articles

A classification method for class-imbalanced data

CHEN Jintan1, 2, KANG Hengzheng3*, YANG Yan3, ZHOU Weixiong 4   

  1. 1. School of Management, Huazhong University of Science and Technology, Wuhan 410000, China;
     2. Guangdong Provincial Highway Administration, Guangzhou 510075, China;
    3. School of Information Science & Technology, Southwest Jiaotong University, Chengdu 610031, China;
    4. Foshan Highway Administration, Foshan 528000, China
  • Received:2011-03-12 Online:2011-04-16 Published:2011-03-12

Abstract:

To improve the classification performance for minority class in an unbalanced dataset,  an improved AdaBoost algorithm (UnAdaBoost algorithm) for an unbalanced dataset was proposed. This algorithm could make the base classification better in order to raise the classification efficienly for the minority class, while to a certain extent losing the accuracy for the majority class. This algorithm could also ensemble the base classifications to make up loss of accuracy in majority class. The performance for  the minority class could be improved and the accuracy for majority class would not be lost. In this study, the improved NaiveBayes algorithm was the base classification, and the base classifiers were fused by the AdaBoost algorithm with improved weight for voting. Experimental results showed that the UnAdaBoost algorithm was effective for an unbalanced dataset compared with the AdaBoost algorithm.
 imbalanced class; AdaBoost algorithm; accuracy

Key words:  imbalanced class, AdaBoost algorithm, accuracy

[1] Yinfeng MENG,Qingfang LI. Recognition learning based on multivariate functional principal component representation [J]. Journal of Shandong University(Engineering Science), 2022, 52(3): 1-8.
[2] Peng WAN. Object detection of 3D point clouds based on F-PointNet [J]. Journal of Shandong University(Engineering Science), 2019, 49(5): 98-104.
[3] Zongtang ZHANG,Sen WANG,Shilin SUN. An ensemble learning algorithm for unbalanced data classification [J]. Journal of Shandong University(Engineering Science), 2019, 49(4): 8-13.
[4] Ya'nan YANG,Bin XIA,Nan XIE,Wenhao YUAN. Hybrid localization algorithm based on BP neural network and multivariable Taylor series [J]. Journal of Shandong University(Engineering Science), 2019, 49(1): 36-40.
[5] LIU Chen, CAI Ting. A localization algorithm based on RSSI vector for wireless sensor networks [J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2016, 46(3): 23-30.
[6] JIANG Weijian1,2, GUO Gongde1,2*, LAI Zhiming1,2. An improved adaboost algorithm based on new Haar-like feature for face detection [J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2014, 44(2): 43-48.
[7] LI Xiang1, ZHU Quan-yin1, WANG Zun2. Research of wavelet neural network based on variable basis functions and GentleAdaBoost algorithm [J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2013, 43(5): 31-38.
[8] ZHU Hong-jin1, FAN Hong-hui1, CHEN Xing-rui1, TAMURA-Yasutaka2. Image normalization based on local autocorrelation and its application to face detection [J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2012, 42(5): 59-64.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
[1] LI Kan . Empolder and implement of the embedded weld control system[J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2008, 38(4): 37 -41 .
[2] LAI Xiang . The global domain of attraction for a kind of MKdV equations[J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2006, 36(1): 87 -92 .
[3] YU Jia yuan1, TIAN Jin ting1, ZHU Qiang zhong2. Computational intelligence and its application in psychology[J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2009, 39(1): 1 -5 .
[4] CHEN Rui, LI Hongwei, TIAN Jing. The relationship between the number of magnetic poles and the bearing capacity of radial magnetic bearing[J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2018, 48(2): 81 -85 .
[5] WANG Bo,WANG Ning-sheng . Automatic generation and combinatory optimization of disassembly sequence for mechanical-electric assembly[J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2006, 36(2): 52 -57 .
[6] JI Tao,GAO Xu/sup>,SUN Tong-jing,XUE Yong-duan/sup>,XU Bing-yin/sup> . Characteristic analysis of fault generated traveling waves in 10 Kv automatic blocking and continuous power transmission lines[J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2006, 36(2): 111 -116 .
[7] ZHANG Ying,LANG Yongmei,ZHAO Yuxiao,ZHANG Jianda,QIAO Peng,LI Shanping . Research on technique of aerobic granular sludge cultivationby seeding EGSB anaerobic granular sludge[J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2006, 36(4): 56 -59 .
[8] Yue Khing Toh1, XIAO Wendong2, XIE Lihua1. Wireless sensor network for distributed target tracking: practices via real test bed development[J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2009, 39(1): 50 -56 .
[9] LIU Zhongguo,ZHANG Xiaojing,LIU Boqiang,LIU Changchun, . The development of ultrasonic characterization of the biological tissue elasticity[J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2006, 36(3): 34 -38 .
[10] SUN Weiwei, WANG Yuzhen. Finite gain stabilization of singlemachine infinite bus system subject to saturation[J]. JOURNAL OF SHANDONG UNIVERSITY (ENGINEERING SCIENCE), 2009, 39(1): 69 -76 .