Author Identification with Chicken Swarm Optimization Algorithm and Adaboost Approaches

Güllü M., Polat H., Çetin A.

2020 5th International Conference on Computer Science and Engineering (UBMK), Diyarbakır, Türkiye, 9 - 11 Eylül 2020, ss.290-294, (Tam Metin Bildiri)

Yayın Türü: Bildiri / Tam Metin Bildiri
Doi Numarası: 10.1109/ubmk50275.2020.9219459
Basıldığı Şehir: Diyarbakır
Basıldığı Ülke: Türkiye
Sayfa Sayıları: ss.290-294
Anahtar Kelimeler: authorship identification, chicken swarm, optimiation, AdaBoost, feature selection, boosting
Açık Arşiv Koleksiyonu: AVESİS Açık Erişim Koleksiyonu
Gazi Üniversitesi Adresli: Evet

Özet

Author identification takes an important place in Natural Language Processing (NLP). Each written document carries the trail of its author. In this study, we aim to realize the author identification via the traces belonging to author be retrieved from the text. A raw dataset was created with 25 columnists and randomly selected 2024 texts from different newspapers in the Turkish language. A dataset with character and lexical features with natural language processing methods were prepared over the raw dataset. The feature selection process was realized with the combination of the Chicken swarm optimization and the ensemble learning algorithms on the prepared dataset. The results were evaluated before and after the feature selection method was applied. The highest success rate with 93.99% was achieved when Adaboost with J48 algorithm was applied after the feature selection process carried out.