Comparison of four machine learning methods for occupational accidents based on national data on metal sector in Turkey


YILDIZ ÖZKAN E., ULAŞ H. B.

Safety Science, cilt.174, 2024 (SCI-Expanded) identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 174
  • Basım Tarihi: 2024
  • Doi Numarası: 10.1016/j.ssci.2024.106468
  • Dergi Adı: Safety Science
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Academic Search Premier, PASCAL, Aerospace Database, CAB Abstracts, Communication Abstracts, Compendex, Environment Index, INSPEC, Metadex, Psycinfo, Veterinary Science Database, Civil Engineering Abstracts
  • Anahtar Kelimeler: Gradient Boosting Method, K-nearest Neighbour, Occupational Accident, Random forest, Recursive Partitioning and Regression Trees
  • Gazi Üniversitesi Adresli: Evet

Özet

Occupational accidents are one of the main problems in production system especially in metal sector. The aim of this study is to develop a predictive framework using machine learning (ML) to identify the causes of fatalities and amputations in the metal sector based on occupational accident data collected by the Turkish Ministry of Labor and Social Security (MLSS) from 2013 to 2018. Researchers have used a variety of strategies to investigate factors and create effective prediction frameworks for lowering occupational accidents. We used random forest (RF), k-nearest neighbour (KNN), gradient boosting method (GBM) and recursive partitioning and regression trees (RPART) to predict accident causes and consequence. Accuracy, precision, recall and f-score is used to measure the performance of ML frameworks. For model validation 10-fold cross validation method was used which increased the accuracy of the frameworks considerably. We extracted important factors which affected the causes of accident at metal sector using feature importance. Analysis proved RF as the best performing framework with highest classification results with 0.9172 accuracy, 0.9618 precision, 0.9518 recall and 0.9568 f-score using all features as compared to other techniques classification of occupational accident severity. To implement preventive controls and interventions in a more targeted way, it is recommended to use the predictive RF algorithm in the analysis of occupational accidents. With these studies, preventive measures can be taken by predicting occupational accidents that may occur in the future.