Instance selection for multi-label learning based on a scalable evolutionary algorithm

View/ Open
Author
Romero-del-Castillo, Juan A.
Ortiz-Boyer, Domingo
García Pedrajas, Nicolás
Publisher
IEEEDate
2021Subject
Instance selectionEvolutionary algorithms
Classification models
Multi-label learning
Multi-Label kNN
METS:
Mostrar el registro METSPREMIS:
Mostrar el registro PREMISMetadata
Show full item recordAbstract
Multi-label classification has recently attracted
greater research interest as a data mining task. Many current ap-
plications in data mining address problems having instances that
belong to more than one class, which requires the development
of new efficient methods.
Instance-based classification models, such as the k-nearest
neighbor rule, are among the highest performing methods on
any classification task and have also been successfully applied
to multi-label problems. Despite their simplicity, they achieve
comparable performance compared to considerably more com-
plex methods. One of the challenges associated with instance-
based classification models is their requirement for storing
all training instances in memory. To ameliorate this problem,
instance selection methods have been proposed. However, their
application to multi-label problems is problematic because the
adaptation of most of their concepts to multi-label problems is
difficult.
In this paper, we propose a scalable evolutionary algorithm for
instance selection for multi-label problems. As our evolutionary
algorithm is solely based on the performance of a subset of
selected instances, it is able to handle multi-label datasets.
On a set of 12 real-world problems, our approach performs
comparably to methods that use all instances while achieving
a large reduction in the size of the training set.
Index Terms—Instance selection; evolutionary algorithms;
classification models; multi-label learning; multi-Label kNN.
Description
Embargado hasta 01/01/2100