An investigation of feature selection methods for soil liquefaction prediction based on tree-based ensemble algorithms using AdaBoost, gradient boosting, and XGBoost

dc.authorid0000-0003-2520-4395en_US
dc.authorid0000-0002-9830-8585en_US
dc.contributor.authorDemir, Selçuk
dc.contributor.authorŞahin, Emrehan Kutluğ
dc.date.accessioned2023-11-13T13:21:32Z
dc.date.available2023-11-13T13:21:32Z
dc.date.issued2023en_US
dc.departmentBAİBÜ, Mühendislik Fakültesi, İnşaat Mühendisliği Bölümüen_US
dc.description.abstractPrevious major earthquake events have revealed that soils susceptible to liquefaction are one of the factors causing significant damages to the structures. Therefore, accurate prediction of the liquefaction phenomenon is an important task in earthquake engineering. Over the past decade, several researchers have been extensively applied machine learning (ML) methods to predict soil liquefaction. This paper presents the prediction of soil liquefaction from the SPT dataset by using relatively new and robust tree-based ensemble algorithms, namely Adaptive Boosting, Gradient Boosting Machine, and eXtreme Gradient Boosting (XGBoost). The innovation points introduced in this paper are presented briefly as follows. Firstly, Stratified Random Sampling was utilized to ensure equalized sampling between each class selection. Secondly, feature selection methods such as Recursive Feature Elimination, Boruta, and Stepwise Regression were applied to develop models with a high degree of accuracy and minimal complexity by selecting the variables with significant predictive features. Thirdly, the performance of ML algorithms with feature selection methods was compared in terms of four performance metrics, Overall Accuracy, Precision, Recall, and F-measure to select the best model. Lastly, the best predictive model was determined using a statistical significance test called Wilcoxon's sign rank test. Furthermore, computational cost analyses of the tree-based ensemble algorithms were performed based on parallel and non-parallel processing. The results of the study suggest that all developed tree-based ensemble models could reliably estimate soil liquefaction. In conclusion, according to both validation and statistical results, the XGBoost with the Boruta model achieved the most stable and better prediction performance than the other models in all considered cases.en_US
dc.identifier.citationDemir, S., & Sahin, E. K. (2023). An investigation of feature selection methods for soil liquefaction prediction based on tree-based ensemble algorithms using AdaBoost, gradient boosting, and XGBoost. Neural Computing and Applications, 35(4), 3173-3190.en_US
dc.identifier.doi10.1007/s00521-022-07856-4
dc.identifier.endpage3190en_US
dc.identifier.issn0941-0643
dc.identifier.issn1433-3058
dc.identifier.issue4en_US
dc.identifier.scopus2-s2.0-85139676985en_US
dc.identifier.scopusqualityQ1en_US
dc.identifier.startpage3173en_US
dc.identifier.urihttp://dx.doi.org/10.1007/s00521-022-07856-4
dc.identifier.urihttps://hdl.handle.net/20.500.12491/11830
dc.identifier.volume35en_US
dc.identifier.wosWOS:000865154800006en_US
dc.identifier.wosqualityQ2en_US
dc.indekslendigikaynakWeb of Scienceen_US
dc.indekslendigikaynakScopusen_US
dc.institutionauthorDemir, Selçuk
dc.institutionauthorŞahin, Emrehan Kutluğ
dc.language.isoenen_US
dc.publisherSpringer London Ltden_US
dc.relation.ispartofNeural Computing and Applicationsen_US
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanıen_US
dc.rightsinfo:eu-repo/semantics/closedAccessen_US
dc.subjectAdaBoosten_US
dc.subjectBorutaen_US
dc.subjectLiquefactionen_US
dc.subjectSupport Vector Machinesen_US
dc.subjectDeterministic Assessmenten_US
dc.subjectGene Selectionen_US
dc.titleAn investigation of feature selection methods for soil liquefaction prediction based on tree-based ensemble algorithms using AdaBoost, gradient boosting, and XGBoosten_US
dc.typeArticleen_US

Dosyalar

Orijinal paket
Listeleniyor 1 - 1 / 1
Küçük Resim Yok
İsim:
selcuk-demir.pdf
Boyut:
2.54 MB
Biçim:
Adobe Portable Document Format
Açıklama:
Tam Metin/Full Text
Lisans paketi
Listeleniyor 1 - 1 / 1
Küçük Resim Yok
İsim:
license.txt
Boyut:
1.44 KB
Biçim:
Item-specific license agreed upon to submission
Açıklama: