Big Data...a few Outliers = Big Mistakes. Un nuovo processo per l'individuazione di outliers
The search and identification of outliers is a fundamental step, generally preparatory to the elaborations aimed at obtaining consistent results. The new approach devised for the identification of outliers in space R2 benefits from geometric / statistical techniques largely independent from the typ...
Enregistré dans:
Auteur principal: | |
---|---|
Format: | article |
Langue: | EN IT |
Publié: |
mediaGEO soc. coop.
2018
|
Sujets: | |
Accès en ligne: | https://doaj.org/article/08698e4e118f4e2baea7e541e8450d77 |
Tags: |
Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
|
Résumé: | The search and identification of outliers is a fundamental step, generally preparatory to the elaborations aimed at obtaining consistent results. The new approach devised for the identification of outliers in space R2 benefits from geometric / statistical techniques largely independent from the type of data distribution, and is based on four methodological pillars: clustering, the convex hull peeling technique, a specific metric and Chebyshev's inequality, which is valid for any type of univariate distribution of values. The modularity and the generality of the approach, coupled to the research and identification of outliers based on strictly statistical parameters, make the approach presented a useful and daily tool for those who need to process bivariate data with the security of being able to previously identify outliers.
|
---|