Automated cleansing and harmonization of international trade data

Large volumes of data are becoming increasingly available and can be very valuable for the analysis of different phenomena. These data can originate from multiple sources and be recorded in diverse formats, requiring preliminary scrutiny in order to be further used in scientific analyses. This first...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Sandra Oliveira, César Capinha, Jorge Rocha
Formato: article
Lenguaje:EN
Publicado: Elsevier 2021
Materias:
Q
Acceso en línea:https://doaj.org/article/067251aaedcd4829a492d9659cffeddf
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:067251aaedcd4829a492d9659cffeddf
record_format dspace
spelling oai:doaj.org-article:067251aaedcd4829a492d9659cffeddf2021-11-10T04:27:42ZAutomated cleansing and harmonization of international trade data2215-016110.1016/j.mex.2021.101567https://doaj.org/article/067251aaedcd4829a492d9659cffeddf2021-01-01T00:00:00Zhttp://www.sciencedirect.com/science/article/pii/S2215016121003575https://doaj.org/toc/2215-0161Large volumes of data are becoming increasingly available and can be very valuable for the analysis of different phenomena. These data can originate from multiple sources and be recorded in diverse formats, requiring preliminary scrutiny in order to be further used in scientific analyses. This first crucial phase of filtering and cleansing data is usually a cumbersome and time-consuming task, but automated routines can be developed to help researchers. A routine created with the R language is here presented, to screen, harmonize and aggregate international trade data, representing the trade flows between countries for specific products, in a timeframe that covers monthly flows for at least 15 years for most countries. The R script implementing these routines is provided, being easily adapted to other datasets with similar issues.• A step-by-step procedure for cleansing and harmonizing international trade data, using R programming language, is presented• Automated routines are very effective in obtaining robust and filtered data inputs to integrate in scientific models• Spatial and temporal patterns of worldwide trade relations can be explored to enhance our understanding of various associated phenomenaSandra OliveiraCésar CapinhaJorge RochaElsevierarticleAutomated screeningData harmonizationTime-series analysisR softwareScienceQENMethodsX, Vol 8, Iss , Pp 101567- (2021)
institution DOAJ
collection DOAJ
language EN
topic Automated screening
Data harmonization
Time-series analysis
R software
Science
Q
spellingShingle Automated screening
Data harmonization
Time-series analysis
R software
Science
Q
Sandra Oliveira
César Capinha
Jorge Rocha
Automated cleansing and harmonization of international trade data
description Large volumes of data are becoming increasingly available and can be very valuable for the analysis of different phenomena. These data can originate from multiple sources and be recorded in diverse formats, requiring preliminary scrutiny in order to be further used in scientific analyses. This first crucial phase of filtering and cleansing data is usually a cumbersome and time-consuming task, but automated routines can be developed to help researchers. A routine created with the R language is here presented, to screen, harmonize and aggregate international trade data, representing the trade flows between countries for specific products, in a timeframe that covers monthly flows for at least 15 years for most countries. The R script implementing these routines is provided, being easily adapted to other datasets with similar issues.• A step-by-step procedure for cleansing and harmonizing international trade data, using R programming language, is presented• Automated routines are very effective in obtaining robust and filtered data inputs to integrate in scientific models• Spatial and temporal patterns of worldwide trade relations can be explored to enhance our understanding of various associated phenomena
format article
author Sandra Oliveira
César Capinha
Jorge Rocha
author_facet Sandra Oliveira
César Capinha
Jorge Rocha
author_sort Sandra Oliveira
title Automated cleansing and harmonization of international trade data
title_short Automated cleansing and harmonization of international trade data
title_full Automated cleansing and harmonization of international trade data
title_fullStr Automated cleansing and harmonization of international trade data
title_full_unstemmed Automated cleansing and harmonization of international trade data
title_sort automated cleansing and harmonization of international trade data
publisher Elsevier
publishDate 2021
url https://doaj.org/article/067251aaedcd4829a492d9659cffeddf
work_keys_str_mv AT sandraoliveira automatedcleansingandharmonizationofinternationaltradedata
AT cesarcapinha automatedcleansingandharmonizationofinternationaltradedata
AT jorgerocha automatedcleansingandharmonizationofinternationaltradedata
_version_ 1718440621572620288