A fuzzy proximity relation approach for outlier detection in the mixed dataset by using rough entropy-based weighted density method

Data mining is an emerging technology where researchers explore innovative ideas in different domains, particularly detecting anomalies. Instances in the dataset which considerably deviate from others by their common patterns are known as anomalies. The state of being ambiguous and not affording cer...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: T. Sangeetha, Geetha Mary A
Formato: article
Lenguaje:EN
Publicado: Elsevier 2021
Materias:
Acceso en línea:https://doaj.org/article/a824f40922d94837bf595332b7694a55
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:a824f40922d94837bf595332b7694a55
record_format dspace
spelling oai:doaj.org-article:a824f40922d94837bf595332b7694a552021-11-26T04:40:48ZA fuzzy proximity relation approach for outlier detection in the mixed dataset by using rough entropy-based weighted density method2666-222110.1016/j.socl.2021.100027https://doaj.org/article/a824f40922d94837bf595332b7694a552021-12-01T00:00:00Zhttp://www.sciencedirect.com/science/article/pii/S2666222121000162https://doaj.org/toc/2666-2221Data mining is an emerging technology where researchers explore innovative ideas in different domains, particularly detecting anomalies. Instances in the dataset which considerably deviate from others by their common patterns are known as anomalies. The state of being ambiguous and not affording certainty of data exists in this world of nature. Rough Set Theory is a proven methodology which deals with ambiguity and uncertainty of data. Research works that have been done until this point were focused on numeric or categorical type, which fails when the attributes are mixed type. By using fuzzy proximity and ordering relations, the numerical data has been converted to categorical data. This article presented an idea for detecting outliers in mixed data where the weighted density values of attributes and objects are calculated. The proposed approach has been compared with existing outlier detection methods by taking the hiring dataset as an example and benchmarked with Harvard dataverse datasets to prove its efficiency and performance.T. SangeethaGeetha Mary AElsevierarticleData MiningEntropyFuzzy ProximityMixed DataRough setsWeighted DensityInformation technologyT58.5-58.64Electronic computers. Computer scienceQA75.5-76.95ENSoft Computing Letters, Vol 3, Iss , Pp 100027- (2021)
institution DOAJ
collection DOAJ
language EN
topic Data Mining
Entropy
Fuzzy Proximity
Mixed Data
Rough sets
Weighted Density
Information technology
T58.5-58.64
Electronic computers. Computer science
QA75.5-76.95
spellingShingle Data Mining
Entropy
Fuzzy Proximity
Mixed Data
Rough sets
Weighted Density
Information technology
T58.5-58.64
Electronic computers. Computer science
QA75.5-76.95
T. Sangeetha
Geetha Mary A
A fuzzy proximity relation approach for outlier detection in the mixed dataset by using rough entropy-based weighted density method
description Data mining is an emerging technology where researchers explore innovative ideas in different domains, particularly detecting anomalies. Instances in the dataset which considerably deviate from others by their common patterns are known as anomalies. The state of being ambiguous and not affording certainty of data exists in this world of nature. Rough Set Theory is a proven methodology which deals with ambiguity and uncertainty of data. Research works that have been done until this point were focused on numeric or categorical type, which fails when the attributes are mixed type. By using fuzzy proximity and ordering relations, the numerical data has been converted to categorical data. This article presented an idea for detecting outliers in mixed data where the weighted density values of attributes and objects are calculated. The proposed approach has been compared with existing outlier detection methods by taking the hiring dataset as an example and benchmarked with Harvard dataverse datasets to prove its efficiency and performance.
format article
author T. Sangeetha
Geetha Mary A
author_facet T. Sangeetha
Geetha Mary A
author_sort T. Sangeetha
title A fuzzy proximity relation approach for outlier detection in the mixed dataset by using rough entropy-based weighted density method
title_short A fuzzy proximity relation approach for outlier detection in the mixed dataset by using rough entropy-based weighted density method
title_full A fuzzy proximity relation approach for outlier detection in the mixed dataset by using rough entropy-based weighted density method
title_fullStr A fuzzy proximity relation approach for outlier detection in the mixed dataset by using rough entropy-based weighted density method
title_full_unstemmed A fuzzy proximity relation approach for outlier detection in the mixed dataset by using rough entropy-based weighted density method
title_sort fuzzy proximity relation approach for outlier detection in the mixed dataset by using rough entropy-based weighted density method
publisher Elsevier
publishDate 2021
url https://doaj.org/article/a824f40922d94837bf595332b7694a55
work_keys_str_mv AT tsangeetha afuzzyproximityrelationapproachforoutlierdetectioninthemixeddatasetbyusingroughentropybasedweighteddensitymethod
AT geethamarya afuzzyproximityrelationapproachforoutlierdetectioninthemixeddatasetbyusingroughentropybasedweighteddensitymethod
AT tsangeetha fuzzyproximityrelationapproachforoutlierdetectioninthemixeddatasetbyusingroughentropybasedweighteddensitymethod
AT geethamarya fuzzyproximityrelationapproachforoutlierdetectioninthemixeddatasetbyusingroughentropybasedweighteddensitymethod
_version_ 1718409799692976128