CBR-PSO: cost-based rough particle swarm optimization approach for high-dimensional imbalanced problems


Aydogan E. K. , ÖZMEN M., Delice Y.

NEURAL COMPUTING & APPLICATIONS, cilt.31, ss.6345-6363, 2019 (SCI İndekslerine Giren Dergi) identifier identifier

  • Cilt numarası: 31 Konu: 10
  • Basım Tarihi: 2019
  • Doi Numarası: 10.1007/s00521-018-3469-2
  • Dergi Adı: NEURAL COMPUTING & APPLICATIONS
  • Sayfa Sayısı: ss.6345-6363

Özet

Datasets, which have a considerably larger number of attributes compared to samples, face a serious classification challenge. This issue becomes even harder when such high-dimensional datasets are also imbalanced. Recently, such datasets have attracted the interest of both industry and academia and thereby have become a very attractive research area. In this paper, a new cost-sensitive classification method, the CBR-PSO, is presented for such high-dimensional datasets with different imbalance ratios and number of classes. The CBR-PSO is based on particle swarm optimization and rough set theory. The robustness of the algorithm is based on the simultaneously applying attribute reduction and classification; in addition, these two stages are also sensitive to misclassification cost. Algorithm efficiency is examined in publicly available datasets and compared to well-known attribute reduction and cost-sensitive classification algorithms. The statistical analysis and experiments showed that the CBR-PSO can be better than or comparable to the other algorithms, in terms of MAUC values.