Aır Pollutıon Predıctıon Wıth Machıne Learnıng Methods: Manısa And Zonguldak Example


Thesis Type: Postgraduate

Institution Of The Thesis: Gazi University, Fen Bilimleri Enstitüsü, Turkey

Approval Date: 2022

Thesis Language: Turkish

Student: BÜŞRA DUYGU ÇELİK

Supervisor: Nursal Arıcı

Open Archive Collection: AVESIS Open Access Collection

Abstract:

With the increase in energy use, urbanization, and heating activities that occur with the

increasing world population, air pollution has become a major environmental problem,

increasing day by day. The problem of air pollution causes serious damage to the ecosystem

by causing changes in air compounds in the atmosphere, acid rain, depletion of the ozone

layer, and negative effects on all living creatures. Estimating air pollution using a model is

crucial in taking measures for possible pollution and facilitating air quality improvement

processes. The fact that air pollution is dependent on many parameters, the inability to

measure all parameters and the high accuracy obtained at the end of the modelling are the

main reasons for using machine learning techniques in this regard. In this thesis, a data set

consisting of PM10, SO2, CO, NO2, NOX, O3 pollutants and meteorological parameters of

Temperature, Humidity, Precipitation, Wind Speed and Pressure measured between 2016-

2021 in Zonguldak and Manisa provinces are used. The data set was modelled with Support

Vector Machines, Decision Trees, Naive Bayes, Random Forest, K-Nearest Neighbor and

Long Short-Term Memory methods, which are well known in the literature. Model

performances measured and compared using accuracy, Root Mean Square Error (RMSE),

Mean Absolute Error (MAE), and Explanatory Coefficient (𝑅2) metrics. The LSTM, gives

better results than other methods with a 0.87 accuracy value for both stations. The lowest

performance is Naive Bayes with an accuracy value of 0.36.

Key Words : M achine learning, air pollution, air pollution prediction