Big Data...a few Outliers = Big Mistakes. Un nuovo processo per l'individuazione di outliers

The search and identification of outliers is a fundamental step, generally preparatory to the elaborations aimed at obtaining consistent results. The new approach devised for the identification of outliers in space R2 benefits from geometric / statistical techniques largely independent from the typ...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autor principal: Maurizio Rosina
Formato: article
Lenguaje:EN
IT
Publicado: mediaGEO soc. coop. 2018
Materias:
Acceso en línea:https://doaj.org/article/08698e4e118f4e2baea7e541e8450d77
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
Descripción
Sumario:The search and identification of outliers is a fundamental step, generally preparatory to the elaborations aimed at obtaining consistent results. The new approach devised for the identification of outliers in space R2 benefits from geometric / statistical techniques largely independent from the type of data distribution, and is based on four methodological pillars: clustering, the convex hull peeling technique, a specific metric and Chebyshev's inequality, which is valid for any type of univariate distribution of values. The modularity and the generality of the approach, coupled to the research and identification of outliers based on strictly statistical parameters, make the approach presented a useful and daily tool for those who need to process bivariate data with the security of being able to previously identify outliers.