Big Data...a few Outliers = Big Mistakes. Un nuovo processo per l'individuazione di outliers

The search and identification of outliers is a fundamental step, generally preparatory to the elaborations aimed at obtaining consistent results. The new approach devised for the identification of outliers in space R2 benefits from geometric / statistical techniques largely independent from the typ...

Description complète

Enregistré dans:
Détails bibliographiques
Auteur principal: Maurizio Rosina
Format: article
Langue:EN
IT
Publié: mediaGEO soc. coop. 2018
Sujets:
Accès en ligne:https://doaj.org/article/08698e4e118f4e2baea7e541e8450d77
Tags: Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
Description
Résumé:The search and identification of outliers is a fundamental step, generally preparatory to the elaborations aimed at obtaining consistent results. The new approach devised for the identification of outliers in space R2 benefits from geometric / statistical techniques largely independent from the type of data distribution, and is based on four methodological pillars: clustering, the convex hull peeling technique, a specific metric and Chebyshev's inequality, which is valid for any type of univariate distribution of values. The modularity and the generality of the approach, coupled to the research and identification of outliers based on strictly statistical parameters, make the approach presented a useful and daily tool for those who need to process bivariate data with the security of being able to previously identify outliers.