Abstract
Feature selection is a critical preprocessing step in machine learning with significant real-world applications. Despite the widespread use of particle swarm optimization (PSO) for feature selection, owing to its robust global search capabilities, developing an effective PSO method for this task is still a substantial challenge. This study introduces a novel PSO variant, ISPSO, which integrates the information gain ratio for assessing feature importance. ISPSO's feature selection process involves partitioning features into distinct groups to establish the initial population. Recognizing that feature selection tasks are inherently binary, ISPSO replaces the traditional PSO velocity concept with a probabilistic approach. In addition, introducing a penalty term enhances the algorithm's ability to achieve superior results. Experimental evaluations on 16 datasets consistently show that ISPSO surpasses compared algorithms, highlighting its efficiency in eliminating redundant and irrelevant features.
Original language | English |
---|---|
Article number | 111380 |
Journal | Knowledge-Based Systems |
Volume | 286 |
DOIs | |
State | Published - 2024/02/28 |
Keywords
- Classification
- Feature selection
- Information gain ratio
- Particle swarm optimization
ASJC Scopus subject areas
- Software
- Management Information Systems
- Information Systems and Management
- Artificial Intelligence