Enhancing the pattern recognition capacity of machine learning techniques: The importance of feature positioning
We design several algorithms representing evaluation processes of different complexity, ranging from basic environments based on a predetermined number of features to complex structures involving alternatives defined through decision trees whose number of nodes is determined by the cardinality of th...
Guardado en:
Autores principales: | , |
---|---|
Formato: | article |
Lenguaje: | EN |
Publicado: |
Elsevier
2022
|
Materias: | |
Acceso en línea: | https://doaj.org/article/befbd61f2c784f028f2de8f0c3ebb23d |
Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
id |
oai:doaj.org-article:befbd61f2c784f028f2de8f0c3ebb23d |
---|---|
record_format |
dspace |
spelling |
oai:doaj.org-article:befbd61f2c784f028f2de8f0c3ebb23d2021-11-14T04:36:00ZEnhancing the pattern recognition capacity of machine learning techniques: The importance of feature positioning2666-827010.1016/j.mlwa.2021.100196https://doaj.org/article/befbd61f2c784f028f2de8f0c3ebb23d2022-03-01T00:00:00Zhttp://www.sciencedirect.com/science/article/pii/S2666827021000980https://doaj.org/toc/2666-8270We design several algorithms representing evaluation processes of different complexity, ranging from basic environments based on a predetermined number of features to complex structures involving alternatives defined through decision trees whose number of nodes is determined by the cardinality of the respective power sets. The sequential structure of these evaluation processes builds on the information retrieval behavior of users in online search environments. The algorithms generate two strings of data, namely, numerical evaluations determining the retrieval behavior of users and the subsequent choices made by the latter. The way the output obtained from the algorithms is inputted within the vectors summarizing the complexity of the evaluation processes conditions the capacity of machine learning techniques to categorize them correctly. The main purpose of the research is to illustrate numerically two main results. First, machine learning techniques categorize processes correctly even if their characteristic features are presented in a way that prevents their identification using standard statistical techniques. Second, the accuracy of the categorization capacities of these techniques can be substantially enhanced by describing the retrieval processes in the way required to implement standard statistical analyses. We perform a battery of tests using machine learning techniques to demonstrate and analyze these results. Their applicability to classification and prediction problems in medical environments, particularly those constrained by the quality of the data available, is emphasized.Debora Di CaprioFrancisco J. Santos-ArteagaElsevierarticleFeature positioningInformation retrievalPattern recognitionMachine learningDecision treesCyberneticsQ300-390Electronic computers. Computer scienceQA75.5-76.95ENMachine Learning with Applications, Vol 7, Iss , Pp 100196- (2022) |
institution |
DOAJ |
collection |
DOAJ |
language |
EN |
topic |
Feature positioning Information retrieval Pattern recognition Machine learning Decision trees Cybernetics Q300-390 Electronic computers. Computer science QA75.5-76.95 |
spellingShingle |
Feature positioning Information retrieval Pattern recognition Machine learning Decision trees Cybernetics Q300-390 Electronic computers. Computer science QA75.5-76.95 Debora Di Caprio Francisco J. Santos-Arteaga Enhancing the pattern recognition capacity of machine learning techniques: The importance of feature positioning |
description |
We design several algorithms representing evaluation processes of different complexity, ranging from basic environments based on a predetermined number of features to complex structures involving alternatives defined through decision trees whose number of nodes is determined by the cardinality of the respective power sets. The sequential structure of these evaluation processes builds on the information retrieval behavior of users in online search environments. The algorithms generate two strings of data, namely, numerical evaluations determining the retrieval behavior of users and the subsequent choices made by the latter. The way the output obtained from the algorithms is inputted within the vectors summarizing the complexity of the evaluation processes conditions the capacity of machine learning techniques to categorize them correctly. The main purpose of the research is to illustrate numerically two main results. First, machine learning techniques categorize processes correctly even if their characteristic features are presented in a way that prevents their identification using standard statistical techniques. Second, the accuracy of the categorization capacities of these techniques can be substantially enhanced by describing the retrieval processes in the way required to implement standard statistical analyses. We perform a battery of tests using machine learning techniques to demonstrate and analyze these results. Their applicability to classification and prediction problems in medical environments, particularly those constrained by the quality of the data available, is emphasized. |
format |
article |
author |
Debora Di Caprio Francisco J. Santos-Arteaga |
author_facet |
Debora Di Caprio Francisco J. Santos-Arteaga |
author_sort |
Debora Di Caprio |
title |
Enhancing the pattern recognition capacity of machine learning techniques: The importance of feature positioning |
title_short |
Enhancing the pattern recognition capacity of machine learning techniques: The importance of feature positioning |
title_full |
Enhancing the pattern recognition capacity of machine learning techniques: The importance of feature positioning |
title_fullStr |
Enhancing the pattern recognition capacity of machine learning techniques: The importance of feature positioning |
title_full_unstemmed |
Enhancing the pattern recognition capacity of machine learning techniques: The importance of feature positioning |
title_sort |
enhancing the pattern recognition capacity of machine learning techniques: the importance of feature positioning |
publisher |
Elsevier |
publishDate |
2022 |
url |
https://doaj.org/article/befbd61f2c784f028f2de8f0c3ebb23d |
work_keys_str_mv |
AT deboradicaprio enhancingthepatternrecognitioncapacityofmachinelearningtechniquestheimportanceoffeaturepositioning AT franciscojsantosarteaga enhancingthepatternrecognitioncapacityofmachinelearningtechniquestheimportanceoffeaturepositioning |
_version_ |
1718429921291796480 |