Predictive Machine Learning for Seasonal Water Quality Insights


Ali Z., Bilal M., Panhwar S., Junaid M., Keerio H. A., Hussain I.

Water, Air, and Soil Pollution, vol.237, no.7, 2026 (SCI-Expanded, Scopus) identifier identifier

  • Publication Type: Article / Article
  • Volume: 237 Issue: 7
  • Publication Date: 2026
  • Doi Number: 10.1007/s11270-026-09127-7
  • Journal Name: Water, Air, and Soil Pollution
  • Journal Indexes: Science Citation Index Expanded (SCI-EXPANDED), Scopus, ABI/INFORM, Artic & Antarctic Regions, BIOSIS, Chemical Abstracts Core, Chimica, Compendex, EMBASE, Environment Index, Geobase, Greenfile
  • Keywords: Hanna Lake, Machine Learning, Physico-chemical parameters, Spin Karez, Water quality prediction
  • Gazi University Affiliated: No

Abstract

Water is essential to human health, economic development, and productivity. Many contaminants have significantly affected the quality of water over time; thus, predicting water quality using modern tools is vital for mitigating water pollution. This research focused on an intensive examination of two distinct lakes. 80 surface water samples were taken randomly from each lake for 8 months continuously and tested in laboratory for physicochemical characteristics to determine the drinkability of the two lakes. However, 8 Spin Karez samples Lake and 18 Hanna Lake samples were deemed unsafe to drink. The samples were classified as drinkable or non-drinkable based on their drinkability values. The first six months of drinkability data were used to train the algorithms, and then the remaining two months of drinkability data were forecasted. Confusion matrix was used to examine the prediction performance of seven distinct supervised machine learning (ML) algorithms – Linear Regression (LR), Decision Tree Classifier (DTC), Random Forest (RF), XGBoosting (XGB), K-Nearest Neighbor (KNN), Support Vector Machines (SVM), and Adaptive Boosting (Adaboost). The models accurately predicted the water quality with DTC outperforming for both lakes. This study contributes to the evolution of prediction tools by integrating ML with environmental monitoring for sustainable water resource management.