Pseudoabsence generation strategies for species distribution models.

<h4>Background</h4>Species distribution models require selection of species, study extent and spatial unit, statistical methods, variables, and assessment metrics. If absence data are not available, another important consideration is pseudoabsence generation. Different strategies for pse...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Brice B Hanberry, Hong S He, Brian J Palik
Formato: article
Lenguaje:EN
Publicado: Public Library of Science (PLoS) 2012
Materias:
R
Q
Acceso en línea:https://doaj.org/article/be87cd49db2f436dab33b1d9c6684da8
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:be87cd49db2f436dab33b1d9c6684da8
record_format dspace
spelling oai:doaj.org-article:be87cd49db2f436dab33b1d9c6684da82021-11-18T07:06:49ZPseudoabsence generation strategies for species distribution models.1932-620310.1371/journal.pone.0044486https://doaj.org/article/be87cd49db2f436dab33b1d9c6684da82012-01-01T00:00:00Zhttps://www.ncbi.nlm.nih.gov/pmc/articles/pmid/22952985/pdf/?tool=EBIhttps://doaj.org/toc/1932-6203<h4>Background</h4>Species distribution models require selection of species, study extent and spatial unit, statistical methods, variables, and assessment metrics. If absence data are not available, another important consideration is pseudoabsence generation. Different strategies for pseudoabsence generation can produce varying spatial representation of species.<h4>Methodology</h4>We considered model outcomes from four different strategies for generating pseudoabsences. We generating pseudoabsences randomly by 1) selection from the entire study extent, 2) a two-step process of selection first from the entire study extent, followed by selection for pseudoabsences from areas with predicted probability <25%, 3) selection from plots surveyed without detection of species presence, 4) a two-step process of selection first for pseudoabsences from plots surveyed without detection of species presence, followed by selection for pseudoabsences from the areas with predicted probability <25%. We used Random Forests as our statistical method and sixteen predictor variables to model tree species with at least 150 records from Forest Inventory and Analysis surveys in the Laurentian Mixed Forest province of Minnesota.<h4>Conclusions</h4>Pseudoabsence generation strategy completely affected the area predicted as present for species distribution models and may be one of the most influential determinants of models. All the pseudoabsence strategies produced mean AUC values of at least 0.87. More importantly than accuracy metrics, the two-step strategies over-predicted species presence, due to too much environmental distance between the pseudoabsences and recorded presences, whereas models based on random pseudoabsences under-predicted species presence, due to too little environmental distance between the pseudoabsences and recorded presences. Models using pseudoabsences from surveyed plots produced a balance between areas with high and low predicted probabilities and the strongest relationship between density and area with predicted probabilities ≥75%. Because of imperfect accuracy assessment, the best assessment currently may be evaluation of whether the species has been sufficiently but not excessively predicted to occur.Brice B HanberryHong S HeBrian J PalikPublic Library of Science (PLoS)articleMedicineRScienceQENPLoS ONE, Vol 7, Iss 8, p e44486 (2012)
institution DOAJ
collection DOAJ
language EN
topic Medicine
R
Science
Q
spellingShingle Medicine
R
Science
Q
Brice B Hanberry
Hong S He
Brian J Palik
Pseudoabsence generation strategies for species distribution models.
description <h4>Background</h4>Species distribution models require selection of species, study extent and spatial unit, statistical methods, variables, and assessment metrics. If absence data are not available, another important consideration is pseudoabsence generation. Different strategies for pseudoabsence generation can produce varying spatial representation of species.<h4>Methodology</h4>We considered model outcomes from four different strategies for generating pseudoabsences. We generating pseudoabsences randomly by 1) selection from the entire study extent, 2) a two-step process of selection first from the entire study extent, followed by selection for pseudoabsences from areas with predicted probability <25%, 3) selection from plots surveyed without detection of species presence, 4) a two-step process of selection first for pseudoabsences from plots surveyed without detection of species presence, followed by selection for pseudoabsences from the areas with predicted probability <25%. We used Random Forests as our statistical method and sixteen predictor variables to model tree species with at least 150 records from Forest Inventory and Analysis surveys in the Laurentian Mixed Forest province of Minnesota.<h4>Conclusions</h4>Pseudoabsence generation strategy completely affected the area predicted as present for species distribution models and may be one of the most influential determinants of models. All the pseudoabsence strategies produced mean AUC values of at least 0.87. More importantly than accuracy metrics, the two-step strategies over-predicted species presence, due to too much environmental distance between the pseudoabsences and recorded presences, whereas models based on random pseudoabsences under-predicted species presence, due to too little environmental distance between the pseudoabsences and recorded presences. Models using pseudoabsences from surveyed plots produced a balance between areas with high and low predicted probabilities and the strongest relationship between density and area with predicted probabilities ≥75%. Because of imperfect accuracy assessment, the best assessment currently may be evaluation of whether the species has been sufficiently but not excessively predicted to occur.
format article
author Brice B Hanberry
Hong S He
Brian J Palik
author_facet Brice B Hanberry
Hong S He
Brian J Palik
author_sort Brice B Hanberry
title Pseudoabsence generation strategies for species distribution models.
title_short Pseudoabsence generation strategies for species distribution models.
title_full Pseudoabsence generation strategies for species distribution models.
title_fullStr Pseudoabsence generation strategies for species distribution models.
title_full_unstemmed Pseudoabsence generation strategies for species distribution models.
title_sort pseudoabsence generation strategies for species distribution models.
publisher Public Library of Science (PLoS)
publishDate 2012
url https://doaj.org/article/be87cd49db2f436dab33b1d9c6684da8
work_keys_str_mv AT bricebhanberry pseudoabsencegenerationstrategiesforspeciesdistributionmodels
AT hongshe pseudoabsencegenerationstrategiesforspeciesdistributionmodels
AT brianjpalik pseudoabsencegenerationstrategiesforspeciesdistributionmodels
_version_ 1718423939235971072