Assessing the predictive capability of ensemble tree methods for landslide susceptibility mapping using XGBoost, gradient boosting machine, and random forest

dc.authorid0000-0002-9830-8585en_US
dc.contributor.authorŞahin, Emrehan Kutluğ
dc.date.accessioned2021-06-23T19:54:08Z
dc.date.available2021-06-23T19:54:08Z
dc.date.issued2020
dc.departmentBAİBÜ, Mühendislik Fakültesi, İnşaat Mühendisliği Bölümüen_US
dc.description.abstractDecision tree-based classifier ensemble methods are a machine learning (ML) technique that combines several tree models to produce an effective or optimum predictive model, and that allows well-predictive performance especially compared to a single model. Thus, selecting a proper ML algorithm help us to understand possible future occurrences by analyzing the past more accurate. The main purpose of this study is to produce landslide susceptibility map of the Ayancik district of Sinop province, situated in the Black Sea region of Turkey using three featured regression tree-based ensemble methods including gradient boosting machines (GBM), extreme gradient boosting (XGBoost), and random forest (RF). Fifteen landslide causative factors and 105 landslide locations occurred in the region were used. The landslide inventory map was randomly divided into training (70%) and testing (30%) dataset to construct the RF, XGBoost and GBM prediction models. Symmetrical uncertainty measure was utilized to determine the most important causative factors, and then the selected features were used to construct susceptibility prediction models. The performance of the ensemble models was validated using different accuracy metrics including Area under the curve (AUC), overall accuracy (OA), Root mean square error (RMSE), and Kappa coefficient. Also, the Wilcoxon signed-rank test was used to assess differences between optimum models. The accuracy results showed that the model of XgBoost_Opt model (the model created by optimum factor combination) has the highest prediction capability (OA = 0.8501 and AUC = 0.8976), followed by the RF_opt (OA = 0.8336 and AUC = 0.8860) and GBM_Opt (OA = 0.8244 and AUC = 0.8796). When the Wilcoxon sign-rank test results were analyzed, XgBoost_Opt model, which is the best subset combinations, were confirmed to be statistically significant considering other models. The results showed that, the XGBoost method according to optimum model achieved lower prediction error and higher accuracy results than the other ensemble methods.en_US
dc.identifier.doi10.1007/s42452-020-3060-1
dc.identifier.issn2523-3963
dc.identifier.issn2523-3971
dc.identifier.issue7en_US
dc.identifier.scopus2-s2.0-85096668611en_US
dc.identifier.scopusqualityQ2en_US
dc.identifier.urihttps://doi.org/10.1007/s42452-020-3060-1
dc.identifier.urihttps://hdl.handle.net/20.500.12491/10433
dc.identifier.volume2en_US
dc.identifier.wosWOS:000546987400002en_US
dc.identifier.wosqualityN/Aen_US
dc.indekslendigikaynakWeb of Scienceen_US
dc.indekslendigikaynakScopusen_US
dc.institutionauthorŞahin, Emrehan Kutluğ
dc.language.isoenen_US
dc.publisherSpringer International Publishing Agen_US
dc.relation.ispartofSn Applied Sciencesen_US
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanıen_US
dc.rightsinfo:eu-repo/semantics/closedAccessen_US
dc.subjectEnsemble Treeen_US
dc.subjectMachine Learningen_US
dc.subjectExtreme Gradient Boostingen_US
dc.subjectSymmetrical Uncertaintyen_US
dc.subjectFeature Selectionen_US
dc.titleAssessing the predictive capability of ensemble tree methods for landslide susceptibility mapping using XGBoost, gradient boosting machine, and random foresten_US
dc.typeArticleen_US

Dosyalar

Orijinal paket
Listeleniyor 1 - 1 / 1
Küçük Resim Yok
İsim:
emrehan-kutlug-sahin.pdf
Boyut:
3.18 MB
Biçim:
Adobe Portable Document Format
Açıklama:
Tam Metin/Full Text