Performance of machine learning methods in predicting water quality index based on irregular data set: application on Illizi region (Algerian southeast)

Abstract Groundwater quality appraisal is one of the most crucial tasks to ensure safe drinking water sources. Concurrently, a water quality index (WQI) requires some water quality parameters. Conventionally, WQI computation consumes time and is often found with various errors during subindex calcul...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Saber Kouadri, Ahmed Elbeltagi, Abu Reza Md. Towfiqul Islam, Samir Kateb
Formato: article
Lenguaje:EN
Publicado: SpringerOpen 2021
Materias:
Acceso en línea:https://doaj.org/article/899f15adc69d42de8663ca7c9dfb3cb3
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:899f15adc69d42de8663ca7c9dfb3cb3
record_format dspace
spelling oai:doaj.org-article:899f15adc69d42de8663ca7c9dfb3cb32021-11-07T12:22:34ZPerformance of machine learning methods in predicting water quality index based on irregular data set: application on Illizi region (Algerian southeast)10.1007/s13201-021-01528-92190-54872190-5495https://doaj.org/article/899f15adc69d42de8663ca7c9dfb3cb32021-11-01T00:00:00Zhttps://doi.org/10.1007/s13201-021-01528-9https://doaj.org/toc/2190-5487https://doaj.org/toc/2190-5495Abstract Groundwater quality appraisal is one of the most crucial tasks to ensure safe drinking water sources. Concurrently, a water quality index (WQI) requires some water quality parameters. Conventionally, WQI computation consumes time and is often found with various errors during subindex calculation. To this end, 8 artificial intelligence algorithms, e.g., multilinear regression (MLR), random forest (RF), M5P tree (M5P), random subspace (RSS), additive regression (AR), artificial neural network (ANN), support vector regression (SVR), and locally weighted linear regression (LWLR), were employed to generate WQI prediction in Illizi region, southeast Algeria. Using the best subset regression, 12 different input combinations were developed and the strategy of work was based on two scenarios. The first scenario aims to reduce the time consumption in WQI computation, where all parameters were used as inputs. The second scenario intends to show the water quality variation in the critical cases when the necessary analyses are unavailable, whereas all inputs were reduced based on sensitivity analysis. The models were appraised using several statistical metrics including correlation coefficient (R), mean absolute error (MAE), root mean square error (RMSE), relative absolute error (RAE), and root relative square error (RRSE). The results reveal that TDS and TH are the key drivers influencing WQI in the study area. The comparison of performance evaluation metric shows that the MLR model has the higher accuracy compared to other models in the first scenario in terms of 1, 1.4572*10–08, 2.1418*10–08, 1.2573*10–10%, and 3.1708*10–08% for R, MAE, RMSE, RAE, and RRSE, respectively. The second scenario was executed with less error rate by using the RF model with 0.9984, 1.9942, 3.2488, 4.693, and 5.9642 for R, MAE, RMSE, RAE, and RRSE, respectively. The outcomes of this paper would be of interest to water planners in terms of WQI for improving sustainable management plans of groundwater resources.Saber KouadriAhmed ElbeltagiAbu Reza Md. Towfiqul IslamSamir KatebSpringerOpenarticleArtificial intelligenceWater quality indexModellingSensitivity analysisRandom forestWater supply for domestic and industrial purposesTD201-500ENApplied Water Science, Vol 11, Iss 12, Pp 1-20 (2021)
institution DOAJ
collection DOAJ
language EN
topic Artificial intelligence
Water quality index
Modelling
Sensitivity analysis
Random forest
Water supply for domestic and industrial purposes
TD201-500
spellingShingle Artificial intelligence
Water quality index
Modelling
Sensitivity analysis
Random forest
Water supply for domestic and industrial purposes
TD201-500
Saber Kouadri
Ahmed Elbeltagi
Abu Reza Md. Towfiqul Islam
Samir Kateb
Performance of machine learning methods in predicting water quality index based on irregular data set: application on Illizi region (Algerian southeast)
description Abstract Groundwater quality appraisal is one of the most crucial tasks to ensure safe drinking water sources. Concurrently, a water quality index (WQI) requires some water quality parameters. Conventionally, WQI computation consumes time and is often found with various errors during subindex calculation. To this end, 8 artificial intelligence algorithms, e.g., multilinear regression (MLR), random forest (RF), M5P tree (M5P), random subspace (RSS), additive regression (AR), artificial neural network (ANN), support vector regression (SVR), and locally weighted linear regression (LWLR), were employed to generate WQI prediction in Illizi region, southeast Algeria. Using the best subset regression, 12 different input combinations were developed and the strategy of work was based on two scenarios. The first scenario aims to reduce the time consumption in WQI computation, where all parameters were used as inputs. The second scenario intends to show the water quality variation in the critical cases when the necessary analyses are unavailable, whereas all inputs were reduced based on sensitivity analysis. The models were appraised using several statistical metrics including correlation coefficient (R), mean absolute error (MAE), root mean square error (RMSE), relative absolute error (RAE), and root relative square error (RRSE). The results reveal that TDS and TH are the key drivers influencing WQI in the study area. The comparison of performance evaluation metric shows that the MLR model has the higher accuracy compared to other models in the first scenario in terms of 1, 1.4572*10–08, 2.1418*10–08, 1.2573*10–10%, and 3.1708*10–08% for R, MAE, RMSE, RAE, and RRSE, respectively. The second scenario was executed with less error rate by using the RF model with 0.9984, 1.9942, 3.2488, 4.693, and 5.9642 for R, MAE, RMSE, RAE, and RRSE, respectively. The outcomes of this paper would be of interest to water planners in terms of WQI for improving sustainable management plans of groundwater resources.
format article
author Saber Kouadri
Ahmed Elbeltagi
Abu Reza Md. Towfiqul Islam
Samir Kateb
author_facet Saber Kouadri
Ahmed Elbeltagi
Abu Reza Md. Towfiqul Islam
Samir Kateb
author_sort Saber Kouadri
title Performance of machine learning methods in predicting water quality index based on irregular data set: application on Illizi region (Algerian southeast)
title_short Performance of machine learning methods in predicting water quality index based on irregular data set: application on Illizi region (Algerian southeast)
title_full Performance of machine learning methods in predicting water quality index based on irregular data set: application on Illizi region (Algerian southeast)
title_fullStr Performance of machine learning methods in predicting water quality index based on irregular data set: application on Illizi region (Algerian southeast)
title_full_unstemmed Performance of machine learning methods in predicting water quality index based on irregular data set: application on Illizi region (Algerian southeast)
title_sort performance of machine learning methods in predicting water quality index based on irregular data set: application on illizi region (algerian southeast)
publisher SpringerOpen
publishDate 2021
url https://doaj.org/article/899f15adc69d42de8663ca7c9dfb3cb3
work_keys_str_mv AT saberkouadri performanceofmachinelearningmethodsinpredictingwaterqualityindexbasedonirregulardatasetapplicationonilliziregionalgeriansoutheast
AT ahmedelbeltagi performanceofmachinelearningmethodsinpredictingwaterqualityindexbasedonirregulardatasetapplicationonilliziregionalgeriansoutheast
AT aburezamdtowfiqulislam performanceofmachinelearningmethodsinpredictingwaterqualityindexbasedonirregulardatasetapplicationonilliziregionalgeriansoutheast
AT samirkateb performanceofmachinelearningmethodsinpredictingwaterqualityindexbasedonirregulardatasetapplicationonilliziregionalgeriansoutheast
_version_ 1718443500431736832