ChatGPT vs. Orthopedic Residents! Who is the Winner?

YAŞ, SEMİH; AHMADOV, ASIM; BAYMURAT, ALİM; TOKGÖZ, MEHMET; Yaş, Secdegül; Odluyurt, Mustafa; TOLUNAY, TOLGA

doi:10.12996/gmj.2024.4067

ChatGPT vs. Orthopedic Residents! Who is the Winner?

Atıf İçin Kopyala

YAŞ S., AHMADOV A., BAYMURAT A. C., TOKGÖZ M. A., Yaş S. C., Odluyurt M., ...Daha Fazla

GAZI MEDICAL JOURNAL, cilt.35, sa.2, ss.186-191, 2024 (ESCI)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 35 Sayı: 2
Basım Tarihi: 2024
Doi Numarası: 10.12996/gmj.2024.4067
Dergi Adı: GAZI MEDICAL JOURNAL
Derginin Tarandığı İndeksler: Emerging Sources Citation Index (ESCI), Scopus, Academic Search Premier
Sayfa Sayıları: ss.186-191
Anahtar Kelimeler: artificial intelligence, ChatGPT, orthopedics, traumatology
Gazi Üniversitesi Adresli: Evet

Özet

Objective: In recent advancements in artificial intelligence, ChatGPT by OpenAI has emerged as a versatile tool capable of performing various tasks; however, its application in medicine is challenged by complexities and limitations in accuracy. This article aims to compare ChatGPT's performance with orthopedic residents at Gazi University in a multiple-choice exam to assess its applicability and reliability in the field of orthopedics. Methods: In this observational study conducted at Gazi University, 31 orthopedic residents were stratified by experience level and assessed using a 50 -question multiple-choice test on various orthopedic topics. The study also evaluated ChatGPT 3.5's responses to the same questions, focusing on both the correctness and reasoning behind the answers. Results: Orthopedic residents tested, ranging from 6 months to 5 years in experience, scored between 23 and 40 out of 50 in a multiplechoice exam, with a mean score of 30.81, varying by seniority. ChatGPT provided correct answers for 25 out of 50 questions, showing consistency in different languages and times, but also exhibited limitations by giving incorrect responses or stating that the correct answer was not among the choices for some questions. Conclusion: While ChatGPT can accurately answer some theoretical questions, its effectiveness is limited in interpretive scenarios and in situations with multiple variables, although its accuracy may improve with updates over time.