TY - JOUR
T1 - A new principal component analysis by particle swarm optimization with an environmental application for data science
AU - Ramirez-Figueroa, John A.
AU - Martin-Barreiro, Carlos
AU - Nieto-Librero, Ana B.
AU - Leiva, Victor
AU - Galindo-Villardón, M. Purificación
N1 - Publisher Copyright:
© 2021, The Author(s), under exclusive licence to Springer-Verlag GmbH, DE part of Springer Nature.
Copyright:
Copyright 2020 Elsevier B.V., All rights reserved.
PY - 2021/10
Y1 - 2021/10
N2 - In this paper, we propose a new method for disjoint principal component analysis based on an intelligent search. The method consists of a principal component analysis with constraints, allowing us to determine components that are linear combinations of disjoint subsets of the original variables. The effectiveness of the proposed method contributes to solve one of the crucial problems of multivariate analysis, that is, the interpretation of the vectorial subspaces in the reduction of the dimensionality. The method selects the variables that contribute the most to each of the principal components in a clear and direct way. Numerical results are provided to confirm the quality of the solutions attained by the proposed method. This method avoids a local optimum and obtains a high success rate when reaching the best solution, which occurs in all the cases of our simulation study. An illustration with environmental real data shows the good performance of the method and its potential applications.
AB - In this paper, we propose a new method for disjoint principal component analysis based on an intelligent search. The method consists of a principal component analysis with constraints, allowing us to determine components that are linear combinations of disjoint subsets of the original variables. The effectiveness of the proposed method contributes to solve one of the crucial problems of multivariate analysis, that is, the interpretation of the vectorial subspaces in the reduction of the dimensionality. The method selects the variables that contribute the most to each of the principal components in a clear and direct way. Numerical results are provided to confirm the quality of the solutions attained by the proposed method. This method avoids a local optimum and obtains a high success rate when reaching the best solution, which occurs in all the cases of our simulation study. An illustration with environmental real data shows the good performance of the method and its potential applications.
KW - Constrained binary particle swarm optimization
KW - Data mining
KW - Disjoint principal components
KW - Evolutionary computation
KW - R software
KW - Singular value decomposition
UR - http://www.scopus.com/inward/record.url?scp=85098701291&partnerID=8YFLogxK
U2 - 10.1007/s00477-020-01961-3
DO - 10.1007/s00477-020-01961-3
M3 - Article
AN - SCOPUS:85098701291
SN - 1436-3240
VL - 35
SP - 1969
EP - 1984
JO - Stochastic Environmental Research and Risk Assessment
JF - Stochastic Environmental Research and Risk Assessment
IS - 10
ER -