TY - JOUR
T1 - Disjoint and functional principal component analysis for infected cases and deaths due to covid-19 in south american countries with sensor-related data
AU - Martin-Barreiro, Carlos
AU - Ramirez-Figueroa, John A.
AU - Cabezas, Xavier
AU - Leiva, Víctor
AU - Galindo-Villardón, M. Purificación
N1 - Publisher Copyright:
© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https:// creativecommons.org/licenses/by/ 4.0/).
PY - 2021/6/2
Y1 - 2021/6/2
N2 - In this paper, we group South American countries based on the number of infected cases and deaths due to COVID-19. The countries considered are: Argentina, Bolivia, Brazil, Chile, Colombia, Ecuador, Peru, Paraguay, Uruguay, and Venezuela. The data used are collected from a database of Johns Hopkins University, an institution that is dedicated to sensing and monitoring the evolution of the COVID-19 pandemic. A statistical analysis, based on principal components with modern and recent techniques, is conducted. Initially, utilizing the correlation matrix, standard components and varimax rotations are calculated. Then, by using disjoint components and functional components, the countries are grouped. An algorithm that allows us to keep the principal component analysis updated with a sensor in the data warehouse is designed. As reported in the conclusions, this grouping changes depending on the number of components considered, the type of principal component (standard, disjoint or functional) and the variable to be considered (infected cases or deaths). The results obtained are compared to the k-means technique. The COVID-19 cases and their deaths vary in the different countries due to diverse reasons, as reported in the conclusions.
AB - In this paper, we group South American countries based on the number of infected cases and deaths due to COVID-19. The countries considered are: Argentina, Bolivia, Brazil, Chile, Colombia, Ecuador, Peru, Paraguay, Uruguay, and Venezuela. The data used are collected from a database of Johns Hopkins University, an institution that is dedicated to sensing and monitoring the evolution of the COVID-19 pandemic. A statistical analysis, based on principal components with modern and recent techniques, is conducted. Initially, utilizing the correlation matrix, standard components and varimax rotations are calculated. Then, by using disjoint components and functional components, the countries are grouped. An algorithm that allows us to keep the principal component analysis updated with a sensor in the data warehouse is designed. As reported in the conclusions, this grouping changes depending on the number of components considered, the type of principal component (standard, disjoint or functional) and the variable to be considered (infected cases or deaths). The results obtained are compared to the k-means technique. The COVID-19 cases and their deaths vary in the different countries due to diverse reasons, as reported in the conclusions.
KW - Data science
KW - Disjoint and functional components
KW - Infectious diseases
KW - K-means clustering
KW - Multivariate statistical methods
KW - R software
KW - SARS-Cov2
KW - Sensing and data extraction
UR - http://www.scopus.com/inward/record.url?scp=85107721678&partnerID=8YFLogxK
U2 - 10.3390/s21124094
DO - 10.3390/s21124094
M3 - Article
AN - SCOPUS:85107721678
SN - 1424-8220
VL - 21
JO - Sensors
JF - Sensors
IS - 12
M1 - 4094
ER -