Design flood estimation for global river networks based on machine learning models

<p>Design flood estimation is a fundamental task in hydrology. In this research, we propose a machine-learning-based approach to estimate design floods globally. This approach involves three stages: (i) estimating at-site flood frequency curves for global gauging stations using the Anderson–Da...

Full description

Saved in:

Bibliographic Details
Main Authors:	G. Zhao, P. Bates, J. Neal, B. Pang
Format:	article
Language:	EN
Published:	Copernicus Publications 2021
Subjects:	Technology T Environmental technology. Sanitary engineering TD1-1066 Geography. Anthropology. Recreation G Environmental sciences GE1-350
Online Access:	https://doaj.org/article/cc686fbb87c74b4680dbf28a08067fb8
Tags:	Add Tag No Tags, Be the first to tag this record!

id	oai:doaj.org-article:cc686fbb87c74b4680dbf28a08067fb8
record_format	dspace
spelling	oai:doaj.org-article:cc686fbb87c74b4680dbf28a08067fb82021-11-22T09:30:22ZDesign flood estimation for global river networks based on machine learning models10.5194/hess-25-5981-20211027-56061607-7938https://doaj.org/article/cc686fbb87c74b4680dbf28a08067fb82021-11-01T00:00:00Zhttps://hess.copernicus.org/articles/25/5981/2021/hess-25-5981-2021.pdfhttps://doaj.org/toc/1027-5606https://doaj.org/toc/1607-7938<p>Design flood estimation is a fundamental task in hydrology. In this research, we propose a machine-learning-based approach to estimate design floods globally. This approach involves three stages: (i) estimating at-site flood frequency curves for global gauging stations using the Anderson–Darling test and a Bayesian Markov chain Monte Carlo (MCMC) method; (ii) clustering these stations into subgroups using a <span class="inline-formula"><i>K</i></span>-means model based on 12 globally available catchment descriptors; and (iii) developing a regression model in each subgroup for regional design flood estimation using the same descriptors. A total of 11 793 stations globally were selected for model development, and three widely used regression models were compared for design flood estimation. The results showed that (1) the proposed approach achieved the highest accuracy for design flood estimation when using all 12 descriptors for clustering; and the performance of the regression was improved by considering more descriptors during training and validation; (2) a support vector machine regression provided the highest prediction performance amongst all regression models tested, with a root mean square normalised error of 0.708 for 100-year return period flood estimation; (3) 100-year design floods in tropical, arid, temperate, cold and polar climate zones could be reliably estimated (i.e. <span class="inline-formula"><i><</i>±</span>25 % error), with relative mean bias (RBIAS) values of <span class="inline-formula">−</span>0.199, <span class="inline-formula">−</span>0.233, <span class="inline-formula">−</span>0.169, 0.179 and <span class="inline-formula">−</span>0.091 respectively; (4) the machine-learning-based approach developed in this paper showed considerable improvement over the index-flood-based method introduced by Smith et al. (2015, <a href="https://doi.org/10.1002/2014WR015814">https://doi.org/10.1002/2014WR015814</a>) for design flood estimation at global scales; and (5) the average RBIAS in estimation is less than 18 % for 10-, 20-, 50- and 100-year design floods. We conclude that the proposed approach is a valid method to estimate design floods anywhere on the global river network, improving our prediction of the flood hazard, especially in ungauged areas.</p>G. ZhaoP. BatesP. BatesJ. NealJ. NealB. PangCopernicus PublicationsarticleTechnologyTEnvironmental technology. Sanitary engineeringTD1-1066Geography. Anthropology. RecreationGEnvironmental sciencesGE1-350ENHydrology and Earth System Sciences, Vol 25, Pp 5981-5999 (2021)
institution	DOAJ
collection	DOAJ
language	EN
topic	Technology T Environmental technology. Sanitary engineering TD1-1066 Geography. Anthropology. Recreation G Environmental sciences GE1-350
spellingShingle	Technology T Environmental technology. Sanitary engineering TD1-1066 Geography. Anthropology. Recreation G Environmental sciences GE1-350 G. Zhao P. Bates P. Bates J. Neal J. Neal B. Pang Design flood estimation for global river networks based on machine learning models
description	<p>Design flood estimation is a fundamental task in hydrology. In this research, we propose a machine-learning-based approach to estimate design floods globally. This approach involves three stages: (i) estimating at-site flood frequency curves for global gauging stations using the Anderson–Darling test and a Bayesian Markov chain Monte Carlo (MCMC) method; (ii) clustering these stations into subgroups using a <span class="inline-formula"><i>K</i></span>-means model based on 12 globally available catchment descriptors; and (iii) developing a regression model in each subgroup for regional design flood estimation using the same descriptors. A total of 11 793 stations globally were selected for model development, and three widely used regression models were compared for design flood estimation. The results showed that (1) the proposed approach achieved the highest accuracy for design flood estimation when using all 12 descriptors for clustering; and the performance of the regression was improved by considering more descriptors during training and validation; (2) a support vector machine regression provided the highest prediction performance amongst all regression models tested, with a root mean square normalised error of 0.708 for 100-year return period flood estimation; (3) 100-year design floods in tropical, arid, temperate, cold and polar climate zones could be reliably estimated (i.e. <span class="inline-formula"><i><</i>±</span>25 % error), with relative mean bias (RBIAS) values of <span class="inline-formula">−</span>0.199, <span class="inline-formula">−</span>0.233, <span class="inline-formula">−</span>0.169, 0.179 and <span class="inline-formula">−</span>0.091 respectively; (4) the machine-learning-based approach developed in this paper showed considerable improvement over the index-flood-based method introduced by Smith et al. (2015, <a href="https://doi.org/10.1002/2014WR015814">https://doi.org/10.1002/2014WR015814</a>) for design flood estimation at global scales; and (5) the average RBIAS in estimation is less than 18 % for 10-, 20-, 50- and 100-year design floods. We conclude that the proposed approach is a valid method to estimate design floods anywhere on the global river network, improving our prediction of the flood hazard, especially in ungauged areas.</p>
format	article
author	G. Zhao P. Bates P. Bates J. Neal J. Neal B. Pang
author_facet	G. Zhao P. Bates P. Bates J. Neal J. Neal B. Pang
author_sort	G. Zhao
title	Design flood estimation for global river networks based on machine learning models
title_short	Design flood estimation for global river networks based on machine learning models
title_full	Design flood estimation for global river networks based on machine learning models
title_fullStr	Design flood estimation for global river networks based on machine learning models
title_full_unstemmed	Design flood estimation for global river networks based on machine learning models
title_sort	design flood estimation for global river networks based on machine learning models
publisher	Copernicus Publications
publishDate	2021
url	https://doaj.org/article/cc686fbb87c74b4680dbf28a08067fb8
work_keys_str_mv	AT gzhao designfloodestimationforglobalrivernetworksbasedonmachinelearningmodels AT pbates designfloodestimationforglobalrivernetworksbasedonmachinelearningmodels AT pbates designfloodestimationforglobalrivernetworksbasedonmachinelearningmodels AT jneal designfloodestimationforglobalrivernetworksbasedonmachinelearningmodels AT jneal designfloodestimationforglobalrivernetworksbasedonmachinelearningmodels AT bpang designfloodestimationforglobalrivernetworksbasedonmachinelearningmodels
_version_	1718417821009969152

Design flood estimation for global river networks based on machine learning models

Similar Items