Spoken language biomarkers in Turkish-speaking schizophrenia patients: Evidence from linguistic analysis and word embeddings


Creative Commons License

Bozdağ M. Ç., KUMCU A., Şenel L. K., TEMİZKAN H. N., Özil Ö., Arslanyürek İ., ...Daha Fazla

Psychiatry Research, cilt.362, 2026 (SCI-Expanded, SSCI, Scopus)

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 362
  • Basım Tarihi: 2026
  • Doi Numarası: 10.1016/j.psychres.2026.117215
  • Dergi Adı: Psychiatry Research
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Social Sciences Citation Index (SSCI), Scopus, BIOSIS, EMBASE, MEDLINE, Psycinfo
  • Anahtar Kelimeler: Formal thought disorder, Language as biomarker, Semantic models, Speech, Word vectors
  • Açık Arşiv Koleksiyonu: AVESİS Açık Erişim Koleksiyonu
  • Gazi Üniversitesi Adresli: Evet

Özet

Background and hypothesis: Schizophrenia (SZ) disrupts language in ways that may be universal across languages. This study investigated whether linguistic anomalies previously observed in SZ also occur in Turkish, a morphologically rich and agglutinative language. We hypothesised that SZ patients would differ from healthy controls (HCs) across multiple linguistic domains, including features typically sensitive to cross-linguistic variation. Methods: Speech characteristics of 50 native Turkish-speaking SZ patients were compared with 50 HCs matched for age, sex, length of education, and handedness. Speech data were collected in 15-minute interviews. The interview recordings were transcribed and analysed for various lexical, syntactic, and phonological measures using CLAN, and compared for discourse measures using fastText word embedding models. Results: The number of words produced per minute, mean length of utterance, average word frequency, the number of filled pauses, discourse coherence, and question-response similarity were lower in the patient group than in the control group. The content word-function word ratio, sentence prediction loss, type-token ratio, number of silent pauses, and silent pauses-to-total speech ratio were higher in the patient group than in the control group. Specific clinical and sociodemographic variables were identified as predictors of speech abnormalities in patients. Conclusion: The hypothesis was confirmed. Turkish-speaking SZ patients displayed speech patterns similar to those reported in other language groups, including language-sensitive variables. This supports the idea of universal linguistic disruptions in SZ. The findings are particularly valuable given the scarcity of research on Turkish, a low-resource and typologically distinct language.