File Download
  Links for fulltext
     (May Require Subscription)
Supplementary

Article: An evaluation of eight machine learning regression algorithms for forest aboveground biomass estimation from multiple satellite data products

TitleAn evaluation of eight machine learning regression algorithms for forest aboveground biomass estimation from multiple satellite data products
Authors
KeywordsEnsemble algorithm
Forest biomass
Machine learning algorithms
Model comparison
Satellite data product
Issue Date2020
Citation
Remote Sensing, 2020, v. 12, n. 24, article no. 4015 How to Cite?
AbstractThis study provided a comprehensive evaluation of eight machine learning regression algorithms for forest aboveground biomass (AGB) estimation from satellite data based on leaf area index, canopy height, net primary production, and tree cover data, as well as climatic and topographical data. Some of these algorithms have not been commonly used for forest AGB estimation such as the extremely randomized trees, stochastic gradient boosting, and categorical boosting (CatBoost) regression. For each algorithm, its hyperparameters were optimized using grid search with cross-validation, and the optimal AGB model was developed using the training dataset (80%) and AGB was predicted on the test dataset (20%). Performance metrics, feature importance as well as overestimation and underestimation were considered as indicators for evaluating the performance of an algorithm. To reduce the impacts of the random training-test data split and sampling method on the performance, the above procedures were repeated 50 times for each algorithm under the random sampling, the stratified sampling, and separate modeling scenarios. The results showed that five tree-based ensemble algorithms performed better than the three nonensemble algorithms (multivariate adaptive regression splines, support vector regression, and multilayer perceptron), and the CatBoost algorithm outperformed the other algorithms for AGB estimation. Compared with the random sampling scenario, the stratified sampling scenario and separate modeling did not significantly improve the AGB estimates, but modeling AGB for each forest type separately provided stable results in terms of the contributions of the predictor variables to the AGB estimates. All the algorithms showed forest AGB were underestimated when the AGB values were larger than 210 Mg/ha and overestimated when the AGB values were less than 120 Mg/ha. This study highlighted the capability of ensemble algorithms to improve AGB estimates and the necessity of improving AGB estimates for high and low AGB levels in future studies.
Persistent Identifierhttp://hdl.handle.net/10722/321915
ISI Accession Number ID

 

DC FieldValueLanguage
dc.contributor.authorZhang, Yuzhen-
dc.contributor.authorMa, Jun-
dc.contributor.authorLiang, Shunlin-
dc.contributor.authorLi, Xisheng-
dc.contributor.authorLi, Manyao-
dc.date.accessioned2022-11-03T02:22:19Z-
dc.date.available2022-11-03T02:22:19Z-
dc.date.issued2020-
dc.identifier.citationRemote Sensing, 2020, v. 12, n. 24, article no. 4015-
dc.identifier.urihttp://hdl.handle.net/10722/321915-
dc.description.abstractThis study provided a comprehensive evaluation of eight machine learning regression algorithms for forest aboveground biomass (AGB) estimation from satellite data based on leaf area index, canopy height, net primary production, and tree cover data, as well as climatic and topographical data. Some of these algorithms have not been commonly used for forest AGB estimation such as the extremely randomized trees, stochastic gradient boosting, and categorical boosting (CatBoost) regression. For each algorithm, its hyperparameters were optimized using grid search with cross-validation, and the optimal AGB model was developed using the training dataset (80%) and AGB was predicted on the test dataset (20%). Performance metrics, feature importance as well as overestimation and underestimation were considered as indicators for evaluating the performance of an algorithm. To reduce the impacts of the random training-test data split and sampling method on the performance, the above procedures were repeated 50 times for each algorithm under the random sampling, the stratified sampling, and separate modeling scenarios. The results showed that five tree-based ensemble algorithms performed better than the three nonensemble algorithms (multivariate adaptive regression splines, support vector regression, and multilayer perceptron), and the CatBoost algorithm outperformed the other algorithms for AGB estimation. Compared with the random sampling scenario, the stratified sampling scenario and separate modeling did not significantly improve the AGB estimates, but modeling AGB for each forest type separately provided stable results in terms of the contributions of the predictor variables to the AGB estimates. All the algorithms showed forest AGB were underestimated when the AGB values were larger than 210 Mg/ha and overestimated when the AGB values were less than 120 Mg/ha. This study highlighted the capability of ensemble algorithms to improve AGB estimates and the necessity of improving AGB estimates for high and low AGB levels in future studies.-
dc.languageeng-
dc.relation.ispartofRemote Sensing-
dc.rightsThis work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.-
dc.subjectEnsemble algorithm-
dc.subjectForest biomass-
dc.subjectMachine learning algorithms-
dc.subjectModel comparison-
dc.subjectSatellite data product-
dc.titleAn evaluation of eight machine learning regression algorithms for forest aboveground biomass estimation from multiple satellite data products-
dc.typeArticle-
dc.description.naturepublished_or_final_version-
dc.identifier.doi10.3390/rs12244015-
dc.identifier.scopuseid_2-s2.0-85097586063-
dc.identifier.volume12-
dc.identifier.issue24-
dc.identifier.spagearticle no. 4015-
dc.identifier.epagearticle no. 4015-
dc.identifier.eissn2072-4292-
dc.identifier.isiWOS:000603177100001-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats