Exploring the performance of ChatGPT-3.5 in addressing dermatological queries: a research investigation into AI capabilities

Marcin Rojek; Jakub Kufel; Michał Bielówka; Adam Mitręga; Dominika Kaczyńska; Łukasz Czogalik; Dominika Kondoł; Kacper Palkij; Sylwia Mielcarska; Wiktoria Bartnikowska

doi:10.5114/dr.2024.140796

Abstract

1/2024 vol. 111

Original article

Exploring the performance of ChatGPT-3.5 in addressing dermatological queries: a research investigation into AI capabilities

Marcin Rojek ¹

,

Jakub Kufel ^{2, 3}

,

Michał Bielówka ¹

,

Adam Mitręga ¹

,

Dominika Kaczyńska ¹

,

Łukasz Czogalik ¹

,

Dominika Kondoł ⁴

,

Kacper Palkij ⁴

,

Sylwia Mielcarska ⁵

,

Wiktoria Bartnikowska ⁶

Students’ Scientific Association of Computer Analysis and Artificial Intelligence at the Department of Radiology and Nuclear Medicine of the Medical University of Silesia, Katowice, Poland
Department of Radiodiagnostics, Interventional Radiology and Nuclear Medicine, Medical University of Silesia, Katowice, Poland
Department of Radiology and Nuclear Medicine, Medical University of Silesia, Katowice, Poland
Multi-specialty District Hospital S.A. Dr. B. Hager Pyskowicka, Tarnowskie Góry, Poland
Department of Medical and Molecular Biology, Faculty of Medical Sciences in Zabrze, Medical University of Silesia in Katowice, Poland
Faculty of Medical Sciences in Katowice, Medical University of Silesia, Katowice, Poland

Dermatol Rev/Przegl Dermatol 2024, 111, 26-30

DOI: https://doi.org/10.5114/dr.2024.140796

Online publish date: 2024/06/28

View full text

AMA

Rojek M, Kufel J, Bielówka M, et al. Exploring the performance of ChatGPT-3.5 in addressing dermatological queries: a research investigation into AI capabilities. Dermatology Review/Przegląd Dermatologiczny. 2024;111(1):26-30. doi:10.5114/dr.2024.140796.

APA

Rojek, M., Kufel, J., Bielówka, M., Mitręga, A., Kaczyńska, D., & Czogalik, Ł. et al. (2024). Exploring the performance of ChatGPT-3.5 in addressing dermatological queries: a research investigation into AI capabilities. Dermatology Review/Przegląd Dermatologiczny, 111(1), 26-30. https://doi.org/10.5114/dr.2024.140796

Chicago

Rojek, Marcin, Jakub Kufel, Michał Bielówka, Adam Mitręga, Dominika Kaczyńska, Łukasz Czogalik, and Dominika Kondoł et al. 2024. "Exploring the performance of ChatGPT-3.5 in addressing dermatological queries: a research investigation into AI capabilities". Dermatology Review/Przegląd Dermatologiczny 111 (1): 26-30. doi:10.5114/dr.2024.140796.

Harvard

Rojek, M., Kufel, J., Bielówka, M., Mitręga, A., Kaczyńska, D., Czogalik, Ł., Kondoł, D., Palkij, K., Mielcarska, S., and Bartnikowska, W. (2024). Exploring the performance of ChatGPT-3.5 in addressing dermatological queries: a research investigation into AI capabilities. Dermatology Review/Przegląd Dermatologiczny, 111(1), pp.26-30. https://doi.org/10.5114/dr.2024.140796

MLA

Rojek, Marcin et al. "Exploring the performance of ChatGPT-3.5 in addressing dermatological queries: a research investigation into AI capabilities." Dermatology Review/Przegląd Dermatologiczny, vol. 111, no. 1, 2024, pp. 26-30. doi:10.5114/dr.2024.140796.

Vancouver

Rojek M, Kufel J, Bielówka M, Mitręga A, Kaczyńska D, Czogalik Ł et al. Exploring the performance of ChatGPT-3.5 in addressing dermatological queries: a research investigation into AI capabilities. Dermatology Review/Przegląd Dermatologiczny. 2024;111(1):26-30. doi:10.5114/dr.2024.140796.

Introduction:

In the 21^st century’s era of rapid technological advancement, ChatGPT-3.5, an artificial intelligence (AI) language model, is scrutinized for its application in dermatology. Using 119 questions from the National Specialist Examination (PES), we assess ChatGPT-3.5’s performance by comparing it to human skills and addressing ethical implications.

Objective:

Our primary aim is to evaluate ChatGPT-3.5’s proficiency in responding to 119 dermatology questions from the PES. The study emphasizes ethical considerations and compares the model’s knowledge and skills to those of human dermatologists.

Material and methods:

Utilizing the 2023 PES question database, questions were categorized by Bloom’s taxonomy and thematic content. ChatGPT-3.5, version of 3 August 2023, answered 119 questions in five sessions, allowing for a probabilistic evaluation. Statistical analyses, conducted using R Studio, assessed correctness, confidence, and difficulty.

Results:

ChatGPT-3.5 achieved a 49.58% correct response rate, below the 60% passing threshold. No significant differences in difficulty or correlations between difficulty and certainty were observed. Varied performance across question types highlighted strengths and weaknesses. Despite suboptimal results, ChatGPT-3.5’s differential performance offers insights, suggesting future improvements. The study advocates for ongoing research into AI integration in dermatology, envisioning a promising role for AI in assisting dermatologists.

Conclusions:

Ethical considerations are crucial for effective AI introduction, minimizing errors, and enhancing dermatological healthcare quality, fostering optimism for AI’s evolving role in dermatology.

Keywords

medical education, artificial intelligence, dermatology, venereology, ChatGPT-35

Abstract

Exploring the performance of ChatGPT-3.5 in addressing dermatological queries: a research investigation into AI capabilities

Introduction:

Objective:

Material and methods:

Results:

Conclusions:

Keywords

Share

Coverage in

Integrated with

Editorial Policies