Comparative analysis of gradient boosting algorithms for landslide susceptibility mapping

Şahin, Emrehan Kutlug

Comparative analysis of gradient boosting algorithms for landslide susceptibility mapping

dc.authorid	0000-0002-9830-8585	en_US
dc.contributor.author	Şahin, Emrehan Kutlug
dc.date.accessioned	2021-06-23T19:17:09Z
dc.date.available	2021-06-23T19:17:09Z
dc.date.issued	2020
dc.department	BAİBÜ, Mühendislik Fakültesi, İnşaat Mühendisliği Bölümü	en_US
dc.description.abstract	The aim of the study is to compare four recent gradient boosting algorithms named as Gradient Boosting Machine (GBM), Categorical Boosting (CatBoost), Extreme Gradient Boosting (XGBoost), and Light Gradient Boosting Machine (LightGBM) for modelling landslide susceptibility (LS). In the first step of the study, the geodatabase including landslide inventory map and landslide conditioning factors was constructed. In the second step, chi-square (CHI) statistic-based feature selection (FS) technique was utilized to compute the importance of the landslide causative factors. In the third step, tree-based ensemble learning algorithms were applied to predict the potential distribution of landslide susceptibility. Also, the prediction performance of ensemble methods was compared to that of Random Forest (RF) ensemble method. Finally, the prediction capabilities of the methods were assessed using overall accuracy (Acc), area under the receiver operating characteristic curve (AUC), kappa index, root mean square error (RMSE), and F score measures. In order to further evaluation, the McNemar's test was utilized to assess statistical significance in the differences between the four gradient boosting models. The accuracy results indicated that the CatBoost model had the highest prediction capability (Acc= 0.8503 and AUC= 0.8975), followed by the XGBoost (Acc= 0.8336 and AUC= 0.8860), the LightGBM (Acc= 0.8244 and AUC= 0.8796) and the GBM (Acc= 0.8080 and AUC= 0.8685). On the other hand, the estimated accuracy measures considered in this study showed that the RF method had the lowest prediction capability of compared the others. Although the individual performances of the methods were found to be acceptable level, the CatBoost method showed the superior performance compared to others with respect to the AUC and Acc values estimated in this study. The results of the study confirmed that the relatively new ensemble learning techniques were efficient and robust for producing LS maps and furthermore, it is probably that these algorithms will be preferred more often in the future studies due to their robustness.	en_US
dc.identifier.doi	10.1080/10106049.2020.1831623
dc.identifier.issn	1010-6049
dc.identifier.scopus	2-s2.0-85092690763	en_US
dc.identifier.scopusquality	Q1	en_US
dc.identifier.uri	https://doi.org/10.1080/10106049.2020.1831623
dc.identifier.uri	https://hdl.handle.net/20.500.12491/5250
dc.identifier.wos	WOS:000577651100001	en_US
dc.identifier.wosquality	Q1	en_US
dc.indekslendigikaynak	Web of Science	en_US
dc.indekslendigikaynak	Scopus	en_US
dc.institutionauthor	Şahin, Emrehan Kutlug
dc.language.iso	en	en_US
dc.publisher	Taylor & Francis Ltd	en_US
dc.relation.ispartof	Geocarto International	en_US
dc.relation.publicationcategory	Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı	en_US
dc.rights	info:eu-repo/semantics/closedAccess	en_US
dc.subject	Landslide Susceptibility	en_US
dc.subject	CatBoost	en_US
dc.subject	XGBoost	en_US
dc.subject	LightGBM	en_US
dc.subject	Ensemble Tree Methods	en_US
dc.title	Comparative analysis of gradient boosting algorithms for landslide susceptibility mapping	en_US
dc.type	Article	en_US

Dosyalar

Orijinal paket

Listeleniyor 1 - 1 / 1

İsim:: emrehan-kutlug-sahin.pdf
Boyut:: 8.6 MB
Biçim:: Adobe Portable Document Format
Açıklama:: Tam metin/ Full text

İndir

Koleksiyon

WoS İndeksli Yayınlar Koleksiyonu
İnşaat Mühendisliği Bölümü Koleksiyonu
Scopus İndeksli Yayınlar Koleksiyonu