Hybridizing Feature Selection and Feature Learning Approaches in QSAR Modeling for Drug Discovery
Abstract Quantitative structure–activity relationship modeling using machine learning techniques constitutes a complex computational problem, where the identification of the most informative molecular descriptors for predicting a specific target property plays a critical role. Two main general appro...
Guardado en:
Autores principales: | , , , , , , , , , , |
---|---|
Formato: | article |
Lenguaje: | EN |
Publicado: |
Nature Portfolio
2017
|
Materias: | |
Acceso en línea: | https://doaj.org/article/d95973b8023d4de68ded65e398e04d19 |
Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
id |
oai:doaj.org-article:d95973b8023d4de68ded65e398e04d19 |
---|---|
record_format |
dspace |
spelling |
oai:doaj.org-article:d95973b8023d4de68ded65e398e04d192021-12-02T15:05:05ZHybridizing Feature Selection and Feature Learning Approaches in QSAR Modeling for Drug Discovery10.1038/s41598-017-02114-32045-2322https://doaj.org/article/d95973b8023d4de68ded65e398e04d192017-05-01T00:00:00Zhttps://doi.org/10.1038/s41598-017-02114-3https://doaj.org/toc/2045-2322Abstract Quantitative structure–activity relationship modeling using machine learning techniques constitutes a complex computational problem, where the identification of the most informative molecular descriptors for predicting a specific target property plays a critical role. Two main general approaches can be used for this modeling procedure: feature selection and feature learning. In this paper, a performance comparative study of two state-of-art methods related to these two approaches is carried out. In particular, regression and classification models for three different issues are inferred using both methods under different experimental scenarios: two drug-like properties, such as blood-brain-barrier and human intestinal absorption, and enantiomeric excess, as a measurement of purity used for chiral substances. Beyond the contrastive analysis of feature selection and feature learning methods as competitive approaches, the hybridization of these strategies is also evaluated based on previous results obtained in material sciences. From the experimental results, it can be concluded that there is not a clear winner between both approaches because the performance depends on the characteristics of the compound databases used for modeling. Nevertheless, in several cases, it was observed that the accuracy of the models can be improved by combining both approaches when the molecular descriptor sets provided by feature selection and feature learning contain complementary information.Ignacio PonzoniVíctor Sebastián-PérezCarlos Requena-TrigueroCarlos RocaMaría J. MartínezFiorella CraveroMónica F. DíazJuan A. PáezRamón Gómez ArrayásJavier AdrioNuria E. CampilloNature PortfolioarticleMedicineRScienceQENScientific Reports, Vol 7, Iss 1, Pp 1-19 (2017) |
institution |
DOAJ |
collection |
DOAJ |
language |
EN |
topic |
Medicine R Science Q |
spellingShingle |
Medicine R Science Q Ignacio Ponzoni Víctor Sebastián-Pérez Carlos Requena-Triguero Carlos Roca María J. Martínez Fiorella Cravero Mónica F. Díaz Juan A. Páez Ramón Gómez Arrayás Javier Adrio Nuria E. Campillo Hybridizing Feature Selection and Feature Learning Approaches in QSAR Modeling for Drug Discovery |
description |
Abstract Quantitative structure–activity relationship modeling using machine learning techniques constitutes a complex computational problem, where the identification of the most informative molecular descriptors for predicting a specific target property plays a critical role. Two main general approaches can be used for this modeling procedure: feature selection and feature learning. In this paper, a performance comparative study of two state-of-art methods related to these two approaches is carried out. In particular, regression and classification models for three different issues are inferred using both methods under different experimental scenarios: two drug-like properties, such as blood-brain-barrier and human intestinal absorption, and enantiomeric excess, as a measurement of purity used for chiral substances. Beyond the contrastive analysis of feature selection and feature learning methods as competitive approaches, the hybridization of these strategies is also evaluated based on previous results obtained in material sciences. From the experimental results, it can be concluded that there is not a clear winner between both approaches because the performance depends on the characteristics of the compound databases used for modeling. Nevertheless, in several cases, it was observed that the accuracy of the models can be improved by combining both approaches when the molecular descriptor sets provided by feature selection and feature learning contain complementary information. |
format |
article |
author |
Ignacio Ponzoni Víctor Sebastián-Pérez Carlos Requena-Triguero Carlos Roca María J. Martínez Fiorella Cravero Mónica F. Díaz Juan A. Páez Ramón Gómez Arrayás Javier Adrio Nuria E. Campillo |
author_facet |
Ignacio Ponzoni Víctor Sebastián-Pérez Carlos Requena-Triguero Carlos Roca María J. Martínez Fiorella Cravero Mónica F. Díaz Juan A. Páez Ramón Gómez Arrayás Javier Adrio Nuria E. Campillo |
author_sort |
Ignacio Ponzoni |
title |
Hybridizing Feature Selection and Feature Learning Approaches in QSAR Modeling for Drug Discovery |
title_short |
Hybridizing Feature Selection and Feature Learning Approaches in QSAR Modeling for Drug Discovery |
title_full |
Hybridizing Feature Selection and Feature Learning Approaches in QSAR Modeling for Drug Discovery |
title_fullStr |
Hybridizing Feature Selection and Feature Learning Approaches in QSAR Modeling for Drug Discovery |
title_full_unstemmed |
Hybridizing Feature Selection and Feature Learning Approaches in QSAR Modeling for Drug Discovery |
title_sort |
hybridizing feature selection and feature learning approaches in qsar modeling for drug discovery |
publisher |
Nature Portfolio |
publishDate |
2017 |
url |
https://doaj.org/article/d95973b8023d4de68ded65e398e04d19 |
work_keys_str_mv |
AT ignacioponzoni hybridizingfeatureselectionandfeaturelearningapproachesinqsarmodelingfordrugdiscovery AT victorsebastianperez hybridizingfeatureselectionandfeaturelearningapproachesinqsarmodelingfordrugdiscovery AT carlosrequenatriguero hybridizingfeatureselectionandfeaturelearningapproachesinqsarmodelingfordrugdiscovery AT carlosroca hybridizingfeatureselectionandfeaturelearningapproachesinqsarmodelingfordrugdiscovery AT mariajmartinez hybridizingfeatureselectionandfeaturelearningapproachesinqsarmodelingfordrugdiscovery AT fiorellacravero hybridizingfeatureselectionandfeaturelearningapproachesinqsarmodelingfordrugdiscovery AT monicafdiaz hybridizingfeatureselectionandfeaturelearningapproachesinqsarmodelingfordrugdiscovery AT juanapaez hybridizingfeatureselectionandfeaturelearningapproachesinqsarmodelingfordrugdiscovery AT ramongomezarrayas hybridizingfeatureselectionandfeaturelearningapproachesinqsarmodelingfordrugdiscovery AT javieradrio hybridizingfeatureselectionandfeaturelearningapproachesinqsarmodelingfordrugdiscovery AT nuriaecampillo hybridizingfeatureselectionandfeaturelearningapproachesinqsarmodelingfordrugdiscovery |
_version_ |
1718388986656849920 |