Intervención de la analítica de datos en el análisis de la vacunación covid-19 en la provincia Ica.
Fecha
2025
Asesor
Título de la revista
ISSN de la revista
Título del volumen
Editor
Universidad Nacional San Luis Gonzaga
Resumen
El presente trabajo de tesis tuvo como objetivo Evaluar la INTERVENCION DE LA ANALÍTICA
DE DATOS EN EL ANÁLISIS DE LA VACUNACIÓN COVID-19 EN LA PROVINCIA ICA, la
misma que fue ampliado el análisis a toda la región por existir los datos abiertos del MINSA. La
metodología del tipo aplicada tecnológica, del nivel descriptivo, con un enfoque cuantitativo, del tipo
retrospectivo, la muestra estuvo conformada por todos los datos extraídos del portal del MINSA y
filtrados para la región Ica con un total de 1048,554 registros; el archivo original fue reducido por
medio de la herramienta de RStudio por ser demasiado grande; el procesamiento requirió del uso de
la plataforma de Google colaboratory, con el lenguaje Python y las librerías de Pandas, Numpy,
Sweetviz para el análisis básico de los datos. se utilizó igualmente para el análisis el software para
minería de datos Orange Datamining. El método aplicado consistió en la extracción de los datos del
portal del MINSA, reducción y filtrado de los datos con software RStudio, preparación y análisis de
los datos en la plataforma de Google colaboratory y Orange Data Mining. Los resultados muestran
que la población mayormente vacunada está en el rango de edades de 20 a 59 años con cerca del 70%,
y las vacunas mayormente empleadas fueron de los fabricantes Pfizer y Sinopharm con más del 80%
y la población mayormente vacunada se encuentra en la provincia de Ica con 48%. Existe una
población similar en vacunación según el sexo con aproximadamente 50%. Se concluye que la
analítica de datos ha sido positiva en obtener información valiosa sobre la vacunación del COVID
19.
The objective of this thesis work was to evaluate the INTERVENTION OF DATA ANALYTICS IN THE ANALYSIS OF COVID-19 VACCINATION IN THE PROVINCE OF ICA, which was extended to the whole region due to the existence of MINSA's open data. The methodology of the applied technological type, descriptive level, with a quantitative approach, retrospective type, the sample consisted of all data extracted from the MINSA portal and filtered for the Ica region with a total of 1048,554 records; the original file was reduced through the RStudio tool for being too large; the processing required the use of the Google Collaboratory platform, with the Python language and the Pandas, Numpy, Sweetviz libraries for the basic analysis of the data. The data mining software Orange Datamining was also used for the analysis. The method applied consisted of extracting data from the MINSA portal, reducing and filtering the data with RStudio software, preparing and analyzing the data in the Google Collaboratory and Orange Data Mining platform. The results show that the population mostly vaccinated is in the age range of 20 to 59 years with about 70%, and the vaccines mostly used were of the manufacturers Pfizer and Sinopharm with more than 80% and the population mostly vaccinated is located in the province of Ica with 48%. There is a similar vaccination population according to sex with approximately 50%. It is concluded that the data analysis has been positive in obtaining valuable information on COVID-19 vaccination.
The objective of this thesis work was to evaluate the INTERVENTION OF DATA ANALYTICS IN THE ANALYSIS OF COVID-19 VACCINATION IN THE PROVINCE OF ICA, which was extended to the whole region due to the existence of MINSA's open data. The methodology of the applied technological type, descriptive level, with a quantitative approach, retrospective type, the sample consisted of all data extracted from the MINSA portal and filtered for the Ica region with a total of 1048,554 records; the original file was reduced through the RStudio tool for being too large; the processing required the use of the Google Collaboratory platform, with the Python language and the Pandas, Numpy, Sweetviz libraries for the basic analysis of the data. The data mining software Orange Datamining was also used for the analysis. The method applied consisted of extracting data from the MINSA portal, reducing and filtering the data with RStudio software, preparing and analyzing the data in the Google Collaboratory and Orange Data Mining platform. The results show that the population mostly vaccinated is in the age range of 20 to 59 years with about 70%, and the vaccines mostly used were of the manufacturers Pfizer and Sinopharm with more than 80% and the population mostly vaccinated is located in the province of Ica with 48%. There is a similar vaccination population according to sex with approximately 50%. It is concluded that the data analysis has been positive in obtaining valuable information on COVID-19 vaccination.
Descripción
Palabras clave
Analítica de datos, COVID-19, Librerías, Orange Data Mining, Data analytics
