Support Vector Machine Based Spam SMS Detection


Creative Commons License

TEKEREK A.

JOURNAL OF POLYTECHNIC-POLITEKNIK DERGISI, cilt.22, sa.3, ss.779-784, 2019 (ESCI) identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 22 Sayı: 3
  • Basım Tarihi: 2019
  • Doi Numarası: 10.2339/politeknik.429707
  • Dergi Adı: JOURNAL OF POLYTECHNIC-POLITEKNIK DERGISI
  • Derginin Tarandığı İndeksler: Emerging Sources Citation Index (ESCI), TR DİZİN (ULAKBİM)
  • Sayfa Sayıları: ss.779-784
  • Anahtar Kelimeler: Spam SMS, data mining, machine learning, support vector machine
  • Gazi Üniversitesi Adresli: Evet

Özet

Short Message Service (SMS) is the most important communication tool in recent decades. With the increased popularity of mobile devices, the usage rate of SMS will increase more and more in years. SMS is a practical method used to reach individuals directly. But this practical and easy method can cause SMS to be misused. The advertising or promotional SMS of the companies are an examples of this misuse. In this study, a spam SMS detection technique is proposed using Data Mining (DM) methods. In the proposed study, data mining algorithms such as Naive Bayes (NB), K-Nearest Neighborhood (KNN), Support Vector Machine (SVM), Random Forest (RF) and Random Tree (RT) is selected. SMSSpamCollection dataset, which is contain 747 spam SMS and 4827 ham SMS, is used. 10 fold cross-validation technique is used to evaluate prediction of Spam SMS in the dataset. Therefore, proposed study achieved 98.33 % success rate and 0,087 false positive rate for SVM algorithm..