Revisiting the Dissimilarity Representation in the Context of Regression
In machine learning, a natural way to represent an instance is by using a feature vector. However, several studies have shown that this representation may not accurately characterize an object. For classification problems, the dissimilarity paradigm has been proposed as an alternative to the standar...
Guardado en:
Autores principales: | , , , |
---|---|
Formato: | article |
Lenguaje: | EN |
Publicado: |
IEEE
2021
|
Materias: | |
Acceso en línea: | https://doaj.org/article/bcf6a3c7b90c46798ec66106b046576c |
Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
id |
oai:doaj.org-article:bcf6a3c7b90c46798ec66106b046576c |
---|---|
record_format |
dspace |
spelling |
oai:doaj.org-article:bcf6a3c7b90c46798ec66106b046576c2021-12-02T00:00:45ZRevisiting the Dissimilarity Representation in the Context of Regression2169-353610.1109/ACCESS.2021.3130127https://doaj.org/article/bcf6a3c7b90c46798ec66106b046576c2021-01-01T00:00:00Zhttps://ieeexplore.ieee.org/document/9624949/https://doaj.org/toc/2169-3536In machine learning, a natural way to represent an instance is by using a feature vector. However, several studies have shown that this representation may not accurately characterize an object. For classification problems, the dissimilarity paradigm has been proposed as an alternative to the standard feature-based approach. Encoding each object by pairwise dissimilarities has been demonstrated to improve the data quality because it mitigates some complexities such as class overlap, small disjuncts, and low-sample size. However, its suitability and performance when applied to regression problems have not been fully explored. This study redefines the dissimilarity representation for regression. To this end, we have carried out an extensive experimental evaluation on 34 datasets using two linear regression models. The results show that the dissimilarity approach decreases the error rates of both the traditional linear regression and the linear model with elastic net regularization, and it also reduces the complexity of most regression datasets.Vicente GarciaJ. Salvador SanchezRafael Martinez-PelaezLuis C. Mendez-GonzalezIEEEarticleData complexitydissimilarity representationlinear modelsregressionElectrical engineering. Electronics. Nuclear engineeringTK1-9971ENIEEE Access, Vol 9, Pp 157043-157051 (2021) |
institution |
DOAJ |
collection |
DOAJ |
language |
EN |
topic |
Data complexity dissimilarity representation linear models regression Electrical engineering. Electronics. Nuclear engineering TK1-9971 |
spellingShingle |
Data complexity dissimilarity representation linear models regression Electrical engineering. Electronics. Nuclear engineering TK1-9971 Vicente Garcia J. Salvador Sanchez Rafael Martinez-Pelaez Luis C. Mendez-Gonzalez Revisiting the Dissimilarity Representation in the Context of Regression |
description |
In machine learning, a natural way to represent an instance is by using a feature vector. However, several studies have shown that this representation may not accurately characterize an object. For classification problems, the dissimilarity paradigm has been proposed as an alternative to the standard feature-based approach. Encoding each object by pairwise dissimilarities has been demonstrated to improve the data quality because it mitigates some complexities such as class overlap, small disjuncts, and low-sample size. However, its suitability and performance when applied to regression problems have not been fully explored. This study redefines the dissimilarity representation for regression. To this end, we have carried out an extensive experimental evaluation on 34 datasets using two linear regression models. The results show that the dissimilarity approach decreases the error rates of both the traditional linear regression and the linear model with elastic net regularization, and it also reduces the complexity of most regression datasets. |
format |
article |
author |
Vicente Garcia J. Salvador Sanchez Rafael Martinez-Pelaez Luis C. Mendez-Gonzalez |
author_facet |
Vicente Garcia J. Salvador Sanchez Rafael Martinez-Pelaez Luis C. Mendez-Gonzalez |
author_sort |
Vicente Garcia |
title |
Revisiting the Dissimilarity Representation in the Context of Regression |
title_short |
Revisiting the Dissimilarity Representation in the Context of Regression |
title_full |
Revisiting the Dissimilarity Representation in the Context of Regression |
title_fullStr |
Revisiting the Dissimilarity Representation in the Context of Regression |
title_full_unstemmed |
Revisiting the Dissimilarity Representation in the Context of Regression |
title_sort |
revisiting the dissimilarity representation in the context of regression |
publisher |
IEEE |
publishDate |
2021 |
url |
https://doaj.org/article/bcf6a3c7b90c46798ec66106b046576c |
work_keys_str_mv |
AT vicentegarcia revisitingthedissimilarityrepresentationinthecontextofregression AT jsalvadorsanchez revisitingthedissimilarityrepresentationinthecontextofregression AT rafaelmartinezpelaez revisitingthedissimilarityrepresentationinthecontextofregression AT luiscmendezgonzalez revisitingthedissimilarityrepresentationinthecontextofregression |
_version_ |
1718404011026022400 |