TY - JOUR
T1 - Information gain ratio-based subfeature grouping empowers particle swarm optimization for feature selection
AU - Gao, Jinrui
AU - Wang, Ziqian
AU - Jin, Ting
AU - Cheng, Jiujun
AU - Lei, Zhenyu
AU - Gao, Shangce
N1 - Publisher Copyright:
© 2024 Elsevier B.V.
PY - 2024/2/28
Y1 - 2024/2/28
N2 - Feature selection is a critical preprocessing step in machine learning with significant real-world applications. Despite the widespread use of particle swarm optimization (PSO) for feature selection, owing to its robust global search capabilities, developing an effective PSO method for this task is still a substantial challenge. This study introduces a novel PSO variant, ISPSO, which integrates the information gain ratio for assessing feature importance. ISPSO's feature selection process involves partitioning features into distinct groups to establish the initial population. Recognizing that feature selection tasks are inherently binary, ISPSO replaces the traditional PSO velocity concept with a probabilistic approach. In addition, introducing a penalty term enhances the algorithm's ability to achieve superior results. Experimental evaluations on 16 datasets consistently show that ISPSO surpasses compared algorithms, highlighting its efficiency in eliminating redundant and irrelevant features.
AB - Feature selection is a critical preprocessing step in machine learning with significant real-world applications. Despite the widespread use of particle swarm optimization (PSO) for feature selection, owing to its robust global search capabilities, developing an effective PSO method for this task is still a substantial challenge. This study introduces a novel PSO variant, ISPSO, which integrates the information gain ratio for assessing feature importance. ISPSO's feature selection process involves partitioning features into distinct groups to establish the initial population. Recognizing that feature selection tasks are inherently binary, ISPSO replaces the traditional PSO velocity concept with a probabilistic approach. In addition, introducing a penalty term enhances the algorithm's ability to achieve superior results. Experimental evaluations on 16 datasets consistently show that ISPSO surpasses compared algorithms, highlighting its efficiency in eliminating redundant and irrelevant features.
KW - Classification
KW - Feature selection
KW - Information gain ratio
KW - Particle swarm optimization
UR - http://www.scopus.com/inward/record.url?scp=85182276665&partnerID=8YFLogxK
U2 - 10.1016/j.knosys.2024.111380
DO - 10.1016/j.knosys.2024.111380
M3 - 学術論文
AN - SCOPUS:85182276665
SN - 0950-7051
VL - 286
JO - Knowledge-Based Systems
JF - Knowledge-Based Systems
M1 - 111380
ER -