Prediction of algal blooms via data-driven machine learning models: an evaluation using data from a well-monitored mesotrophic lake

نویسندگان

چکیده

Abstract. With increasing lake monitoring data, data-driven machine learning (ML) models might be able to capture the complex algal bloom dynamics that cannot completely described in process-based (PB) models. We applied two ML models, gradient boost regressor (GBR) and long short-term memory (LSTM) network, predict blooms seasonal changes chlorophyll concentrations (Chl) a mesotrophic lake. Three predictive workflows were tested, one based solely on available measurements others applying two-step approach, first estimating nutrients have limited observations then predicting Chl using observed pre-generated environmental factors. The third workflow was developed hydrodynamic data derived from PB model as additional training features approach. performance of superior Chl. hybrid further improved prediction timing magnitude blooms. A sparsity test shuffling order testing years showed accuracy decreased with sample interval, varied training–testing year combinations.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Enhancing Learning from Imbalanced Classes via Data Preprocessing: A Data-Driven Application in Metabolomics Data Mining

This paper presents a data mining application in metabolomics. It aims at building an enhanced machine learning classifier that can be used for diagnosing cachexia syndrome and identifying its involved biomarkers. To achieve this goal, a data-driven analysis is carried out using a public dataset consisting of 1H-NMR metabolite profile. This dataset suffers from the problem of imbalanced classes...

متن کامل

Machine Learning Models for Housing Prices Forecasting using Registration Data

This article has been compiled to identify the best model of housing price forecasting using machine learning methods with maximum accuracy and minimum error. Five important machine learning algorithms are used to predict housing prices, including Nearest Neighbor Regression Algorithm (KNNR), Support Vector Regression Algorithm (SVR), Random Forest Regression Algorithm (RFR), Extreme Gradient B...

متن کامل

Prediction of algal blooms using genetic programming.

In this study, an attempt was made to mathematically model and predict algal blooms in Tolo Harbor (Hong Kong) using genetic programming (GP). Chlorophyll plays a vital role in blooms and was used in this model as a measure of algal bloom biomass, and eight other variables were used as input for its prediction. It has been observed that GP evolves multiple models with almost the same values of ...

متن کامل

Interactive Data Exploration via Machine Learning Models

This article provides an overview of our research on data exploration. Our work aims to facilitate interactive exploration tasks in many big data applications in the scientific, biomedical and healthcare domains. We argue for a shift towards learning-based exploration techniques that automatically steer the user towards interesting data areas based on relevance feedback on database samples, aim...

متن کامل

Prediction of Student Learning Styles using Data Mining Techniques

This paper focuses on the prediction of student learning styles using data mining techniques within their institutions. This prediction was aimed at finding out how different learning styles are achieved within learning environments which are specifically influenced by already existing factors. These learning styles, have been affected by different factors that are mainly engraved and found wit...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Geoscientific Model Development

سال: 2023

ISSN: ['1991-9603', '1991-959X']

DOI: https://doi.org/10.5194/gmd-16-35-2023