File Download

There are no files associated with this item.

Supplementary

Conference Paper: Uncertainties of machine learning in predicting the hydrological responses of LID practices

TitleUncertainties of machine learning in predicting the hydrological responses of LID practices
Authors
KeywordsLow impact development
Sustainable drainage system
Machine learning
Stormwater management
Uncertainty analysis
Issue Date2020
PublisherIAHR.
Citation
22nd Congress of the International Association for Hydro-Environment Engineering and Research-Asia Pacific Division (IAHR-APD), Hokkaido, Japan, 14-17 September 2020. In Proceedings of the 22nd IAHR APD Congress 2020, Sapporo, Japan How to Cite?
AbstractLow impact development (LID) practices, such as green roofs and bioretention cells, are regarded as environmentally-friendly alternatives to the conventional drainage infrastructures. It is essential to accurately predict the hydrological responses of LID practices for assessing and optimizing LID designs. However, the accuracy of the commonly used process-based hydrological models is sometimes affected by model assumptions and the availability of field measurements for calibration. Machine learning methods can potentially avoid these issues by directly modeling the correlation between the input (e.g., rainfall time series) and the response (e.g., outflow hydrograph) of a system. However, considerable uncertainties are involved when training machine learning models. As a case study, the correlation between rainfall time series and outflow rates in an LID site in the U.S. is modeled using 11 commonly used machine learning models, including random forest, k-nearest neighbors, and gradient boosting machine. These models are trained on high temporal resolution data using formal machine learning procedures, which include feature engineering, pre-processing, model tuning, and resampling. Different methods are used in the training procedures for assessing the involved uncertainties. For example, in feature engineering, the original high-resolution time series is transformed into different sets of features (e.g., mean and peak rainfall intensity in the past two hours) which are used as input to machine learning models, and different types of transformations are used to pre- process these features. The results show that some machine learning models can achieve comparable or better prediction accuracy when compared to process-based models, and performance of different machine learning models can vary significantly. The feature engineering and the resampling procedures are found to have significant impacts on the quality of the trained models. Evaluating multiple machine learning models and using various methods in model training are crucial for assessing the uncertainties involved in machine learning.
DescriptionSession 6: Hydroinformatics
Persistent Identifierhttp://hdl.handle.net/10722/320357

 

DC FieldValueLanguage
dc.contributor.authorYang, Y-
dc.contributor.authorChui, TFM-
dc.date.accessioned2022-10-21T07:51:47Z-
dc.date.available2022-10-21T07:51:47Z-
dc.date.issued2020-
dc.identifier.citation22nd Congress of the International Association for Hydro-Environment Engineering and Research-Asia Pacific Division (IAHR-APD), Hokkaido, Japan, 14-17 September 2020. In Proceedings of the 22nd IAHR APD Congress 2020, Sapporo, Japan-
dc.identifier.urihttp://hdl.handle.net/10722/320357-
dc.descriptionSession 6: Hydroinformatics-
dc.description.abstractLow impact development (LID) practices, such as green roofs and bioretention cells, are regarded as environmentally-friendly alternatives to the conventional drainage infrastructures. It is essential to accurately predict the hydrological responses of LID practices for assessing and optimizing LID designs. However, the accuracy of the commonly used process-based hydrological models is sometimes affected by model assumptions and the availability of field measurements for calibration. Machine learning methods can potentially avoid these issues by directly modeling the correlation between the input (e.g., rainfall time series) and the response (e.g., outflow hydrograph) of a system. However, considerable uncertainties are involved when training machine learning models. As a case study, the correlation between rainfall time series and outflow rates in an LID site in the U.S. is modeled using 11 commonly used machine learning models, including random forest, k-nearest neighbors, and gradient boosting machine. These models are trained on high temporal resolution data using formal machine learning procedures, which include feature engineering, pre-processing, model tuning, and resampling. Different methods are used in the training procedures for assessing the involved uncertainties. For example, in feature engineering, the original high-resolution time series is transformed into different sets of features (e.g., mean and peak rainfall intensity in the past two hours) which are used as input to machine learning models, and different types of transformations are used to pre- process these features. The results show that some machine learning models can achieve comparable or better prediction accuracy when compared to process-based models, and performance of different machine learning models can vary significantly. The feature engineering and the resampling procedures are found to have significant impacts on the quality of the trained models. Evaluating multiple machine learning models and using various methods in model training are crucial for assessing the uncertainties involved in machine learning.-
dc.languageeng-
dc.publisherIAHR.-
dc.relation.ispartofProceedings of the 22nd IAHR APD Congress 2020, Sapporo, Japan-
dc.subjectLow impact development-
dc.subjectSustainable drainage system-
dc.subjectMachine learning-
dc.subjectStormwater management-
dc.subjectUncertainty analysis-
dc.titleUncertainties of machine learning in predicting the hydrological responses of LID practices-
dc.typeConference_Paper-
dc.identifier.emailYang, Y: yyang28@hku.hk-
dc.identifier.emailChui, TFM: maychui@hku.hk-
dc.identifier.authorityChui, TFM=rp01696-
dc.identifier.hkuros339805-
dc.identifier.volumeSession 6 Hydroinformatics-
dc.publisher.placeJapan-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats