Effect of Dataset Size on Deep Learning in Voice Recognition


Çayır A. N., Navruz T. S.

2021 3rd International Congress on Human-Computer Interaction, Optimization and Robotic Applications (HORA), Ankara, Türkiye, 11 - 13 Haziran 2021, ss.1-5

  • Yayın Türü: Bildiri / Tam Metin Bildiri
  • Doi Numarası: 10.1109/hora52670.2021.9461395
  • Basıldığı Şehir: Ankara
  • Basıldığı Ülke: Türkiye
  • Sayfa Sayıları: ss.1-5
  • Gazi Üniversitesi Adresli: Evet

Özet

Voice recognition systems mostly suffer from environmental effects and accent differences. Therefore, studies on speech recognition have begun to be examined using deep learning which is a method known to be successful in speech recognition and classification. In this study, 12 different voice commands are defined using convolutional neural network, which is a deep learning structure. In this study, the effect of dataset size on test and recognition accuracy was investigated. In addition, a different dataset which was prepared from the records of people whose main language is Turkish to investigate the effect of different accents on both test and recognition accuracy. In the experiments when the test dataset including native-speaker voice records is used, the test accuracy was obtained as 94.64% for large dataset and 64.81% for small dataset. On the other hand when the test dataset including foreigner’s voice records the test accuracy reduced to 63.29% for large and 33.18% for small data set.