Information gain ratio-based subfeature grouping empowers particle swarm optimization for feature selection

Jinrui Gao, Ziqian Wang, Ting Jin, Jiujun Cheng*, Zhenyu Lei, Shangce Gao

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

37 Scopus citations

Abstract

Feature selection is a critical preprocessing step in machine learning with significant real-world applications. Despite the widespread use of particle swarm optimization (PSO) for feature selection, owing to its robust global search capabilities, developing an effective PSO method for this task is still a substantial challenge. This study introduces a novel PSO variant, ISPSO, which integrates the information gain ratio for assessing feature importance. ISPSO's feature selection process involves partitioning features into distinct groups to establish the initial population. Recognizing that feature selection tasks are inherently binary, ISPSO replaces the traditional PSO velocity concept with a probabilistic approach. In addition, introducing a penalty term enhances the algorithm's ability to achieve superior results. Experimental evaluations on 16 datasets consistently show that ISPSO surpasses compared algorithms, highlighting its efficiency in eliminating redundant and irrelevant features.

Original languageEnglish
Article number111380
JournalKnowledge-Based Systems
Volume286
DOIs
StatePublished - 2024/02/28

Keywords

  • Classification
  • Feature selection
  • Information gain ratio
  • Particle swarm optimization

ASJC Scopus subject areas

  • Software
  • Management Information Systems
  • Information Systems and Management
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Information gain ratio-based subfeature grouping empowers particle swarm optimization for feature selection'. Together they form a unique fingerprint.

Cite this