Prediction and interpretation of pathogenic bacteria occurrence at a recreational beach using data-driven algorithms
- Authors
- Jang, Jiyi; Abbas, Ather; Kim, Hyein; Rhee, Chaeyoung; Shin, Seung Gu; Chun, Jong Ahn; Baek, Sangsoo; Cho, Kyung Hwa
- Issue Date
- Dec-2023
- Publisher
- Elsevier B.V.
- Keywords
- Cumulative importance features; Deep-learning algorithms; Interpretable models; Simulation; Strategy modeling
- Citation
- Ecological Informatics, v.78
- Indexed
- SCIE
SCOPUS
- Journal Title
- Ecological Informatics
- Volume
- 78
- URI
- https://scholarworks.gnu.ac.kr/handle/sw.gnu/68952
- DOI
- 10.1016/j.ecoinf.2023.102370
- ISSN
- 1574-9541
1878-0512
- Abstract
- Recreational beaches face a threat from pathogenic bacteria that harbor antibiotic resistance genes (ARGs). To predict bacterial occurrence and comprehend their non-linear relationship with hydrometeorological features, advanced machine- and deep-learning algorithms were employed. These algorithms include regression trees (RT), as well as interpretable deep-learning algorithms such as the ‘Input Attention-Long Short-Term Memory (IA-LSTM)’ and ‘Temporal Fusion Transformer (TFT)’. Our focus was on predicting the occurrence of Prevotella, a prevalent pathogenic bacterium found at the beaches. Utilizing model-dependent and model-agnostic interpretation methods, which encompass sensitivity analysis, permutation, and the SHapley Additive exPlanations (SHAP) importance, we evaluated model behavior. RT-based algorithms exhibited predictive capabilities comparable to those of IA-LSTM and TFT, achieving validation Nash–Sutcliffe efficiencies of 0.93, 0.94, and 0.96, respectively. However, the deep-learning algorithms (IA-LSTM and TFT) are surpassed in terms of interpretability. The model-dependent interpretation method identified heavy precipitation as a pivotal hydrometeorological feature linked to increased Prevotella occurrence. Notably, the IA-LSTM identified Prevotella as a potential host for the sulfonamide resistance gene (sul1), suggesting the potential of Prevotella as an indicator for sul1. This research, leveraging interpretable data-driven models, advances our understanding of the hydrometeorological features influencing the occurrence of pathogenic bacteria and the prevalence of ARGs at the beach, and enhances predictive capabilities for bacterial occurrence. © 2023
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - 융합기술공과대학 > Department of Energy Engineering > Journal Articles
![qrcode](https://api.qrserver.com/v1/create-qr-code/?size=55x55&data=https://scholarworks.gnu.ac.kr/handle/sw.gnu/68952)
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.