eISSN: 2391-6052
ISSN: 2353-3854
Alergologia Polska - Polish Journal of Allergology
Bieżący numer Archiwum Artykuły zaakceptowane O czasopiśmie Suplementy Zeszyty specjalne Rada naukowa Bazy indeksacyjne Prenumerata Kontakt Zasady publikacji prac Opłaty publikacyjne Standardy etyczne i procedury
Panel Redakcyjny
Zgłaszanie i recenzowanie prac online
1/2024
vol. 11
 
Poleć ten artykuł:
Udostępnij:
streszczenie artykułu:
Artykuł oryginalny

Evaluating ChatGPT-3.5 in allergology: performance in the Polish Specialist Examination

Michał Bielówka
1
,
Jakub Kufel
2, 3
,
Marcin Rojek
1, 4
,
Adam Mitręga
1
,
Dominika Kaczyńska
1
,
Łukasz Czogalik
1
,
Michał Janik
1
,
Wiktoria Bartnikowska
5
,
Sylwia Mielcarska
6
,
Dominika Kondoł
7

1.
Students’ Scientific Association of Computer Analysis and Artificial Intelligence at the Department of Radiology and Nuclear Medicine, Medical University of Silesia, Katowice, Poland
2.
Department of Radiodiagnostics, Interventional Radiology and Nuclear Medicine, Medical University of Silesia, Katowice, Poland
3.
Department of Radiology and Nuclear Medicine, Medical University of Silesia, Katowice, Poland
4.
Students’ Scientific Association at the Department of Microbiology and Immunology, Medical University of Silesia, Katowice, Poland
5.
Faculty of Medical Sciences in Katowice, Medical University of Silesia, Katowice, Poland
6.
Department of Medical and Molecular Biology, Faculty of Medical Sciences in Zabrze, Medical University of Silesia, Zabrze, Poland
7.
Dr B. Hager Memorial Multi-specialty District Hospital, Tarnowskie Góry, Poland
Alergologia Polska – Polish Journal of Allergology 2024; 11, 1: 42–47
Data publikacji online: 2024/02/13
Pełna treść artykułu Pobierz cytowanie
 
Metryki PlumX:


Introduction:
The development of Artificial Intelligence (AI) and attempts to use it in medicine are increasingly becoming the subject of more scientific research.

Aim:
The aim of this article is to present the effectiveness of the advanced language model, ChatGPT-3.5 in the context of the pass rate of the Polish National Specialist Examination (PES) in allergology. Additionally, it seeks to comprehend the potential applications of artificial intelligence in the field of medicine, particularly within allergology.

Material and methods:
The study used the latest available PES exam prepared by the Medical Research Centre in Lodz. 118 questions were asked using the openai.com platform, which allows free access to the ChatGPT-3.5 model. All questions were classified according to Bloom’s taxonomy to assess their complexity and difficulty, with additional three categorisations. Each question was asked five times.

Results:
ChatGPT-3.5 did not pass the allergology PES, achieving a score of 52.54%. It was observed that the model performed better in answering memory questions (60%) compared to those requiring comprehension and critical thinking, where the results were slightly lower (45%). Moreover, within the categories of ‘treatment’, ‘immune system’ and ‘symptoms’, the model exceeded the passing threshold. Questions to which ChatGPT provided the correct answer significantly exhibited higher difficulty compared to those to which it provided an incorrect response.

Conclusions:
The results indicate that ChatGPT’s pass rate in the allergology PES is considerably lower than that of resident doctors specializing in this field. The potential applications of AI in medicine require further research to effectively support clinical practice among physicians.




© 2024 Termedia Sp. z o.o.
Developed by Bentus.