|
Current issue
Archive
Manuscripts accepted
About the journal
Editorial board
Reviewers
Abstracting and indexing
Subscription
Contact
Instructions for authors
Ethical standards and procedures
Editorial System
Submit your Manuscript
|
4/2025
vol. 78 Review paper
The role of large language models in dental diagnosis, treatment planning, and prognosis
Julien Issa
1
,
Fatemeh Sohrabniya
2
,
Janet Brinz
3
,
Marta Dyszkiewicz-Konwińska
4
,
Akhilanand Chaurasia
5
J Stoma 2025; 78, 4: 298-303
Online publish date: 2025/11/04
Article file
- JOS-01132-The_role_of_large.pdf
[0.33 MB]
ENW EndNote
BIB JabRef, Mendeley
RIS Papers, Reference Manager, RefWorks, Zotero
AMA
APA
Chicago
Harvard
MLA
Vancouver
IntroductionLarge language models (LLMs) are advanced deep learning algorithms proficient in executing diverse natural language processing (NLP) tasks, utilizing transformer models to train on vast datasets [1, 2]. This training enables them to perform functions, such as recognizing, translating, predicting, and generating text and other content [1, 2]. Moreover, LLMs specialize in tasks beyond language, including analyzing protein structures and coding software [1, 2]. Like the human brain, these models undergo pretraining and fine-tuning to master various challenges, such as text classification, question answering, document summarization, and text generation [1, 2]. They have extensive applications across diverse fields, e.g., healthcare, finance, and entertainment, functioning in roles ranging from translation services to powering chatbots and artificial intelligence (AI) assistants [1, 2]. These models’ large parameter counts, similar to accumulated memories, enhance their problem-solving capabilities in both linguistic and non-linguistic tasks [1, 2]. LLMs are built on deep neural networks that consist of multiple layers, including input, hidden, and output (Figure 1). The input layer initiates data processing by first applying tokenization, which breaks textual data into smaller units, i.e., words or sub-words, and then embedding, which converts these tokens into numerical representations, enabling deeper linguistic analysis [3]. In deep learning, the hidden layers, essential in transformer-based architectures, comprise components, such as feedforward networks and normalization layers, which help stabilize learning and address challenges, e.g., the vanishing gradient problem [3]. These deep neural network layers are stacked to support the model’s language-handling capabilities. The output layer processes the data to make predictions or decisions, usually using the softmax function that helps to determine the most likely next word or token by converting the model’s results into probabilities [3]. LLMs have three primary categories: generic, instruction-tuned, and dialog-tuned models. Generic models excel in predicting the next word based on learned language patterns, and are useful in information retrieval [4]. Instruction-tuned models, trained to respond based on specific instructions, are adept at tasks, including sentiment analysis or code generation [5]. Dialog-tuned models are designed for conversational AI, capable of maintaining engaging dialogues through contextually appropriate responses [6]. The landscape of AI and NLP has evolved with the advent of models, such as OpenAI’s, ChatGPT, and Google’s Gemini, marking significant milestones in human-machine interaction [7, 8]. These LLMs are built on a rich history of AI research, tracing back to early innovations, i.e., the ELIZA chatbot in 1966 [9]. Today, LLMs are not only transforming textual interactions, but are also pioneering applications in fields as diverse as medicine and dentistry [10]. In dentistry, NLP shows promise by using methods, such as manual analysis, semantic similarity, key word extraction, and regular expressions to extract relevant dental data from patients’ electronic health records, e.g., identifying fillings or tooth decay [11]. AI technologies address various aspects of dental education and practice, from personalized learning and diagnosis to treatment planning and research [12, 13]. The availability of vast amounts of unstructured clinical text data, including electronic health records, medical literature, and patient narratives in the medical and dental fields, has made them invaluable resources for LLMs to process and extract insights from [13]. This review aimed to explore the challenges and opportunities that LLMs present to dentistry, focusing on the potential to enhance patient care through diagnosis, treatment planning, and prognosis. By examining the current state of LLMs and their applications, we seek to shed light on the evolving role of AI in shaping the future of dentistry. LLMs across dental specialtiesThe application of machine learning in dentistry, for instance, using statistical techniques to enhance computer systems through experience, leverage these advancements to improve predictive analytics and operational efficiency in dental practices [14]. The core methodology involves algorithms capable of interpreting complex datasets without explicit programming, thus adapting to new patterns in dental health records to optimize decision-making processes [14]. This not only leads to higher diagnostic precision, but also enhances the administrative efficiency of dental practices, ultimately improving patient outcomes and workflow [15]. Recent studies have showcased the value of ChatGPT in medical education [16, 17]. Notably, ChatGPT demonstrated its capacity to achieve a passing score equivalent to that of a third-year medical student [17]. Furthermore, the latest version of ChatGPT, GPT4, has exhibited a notable competency by successfully passing the dental licensing examinations of both the United States and the United Kingdom, alongside achieving commendable performance across a diverse range of dental disciplines [18]. The application of LLMs within the five main dental specialties (pediatric dentistry, endodontics, oral and maxillofacial radiology, oral and maxillofacial surgery, and periodontics) exhibited diverse applications, ranging from enhancing patient education to improving diagnostic capabilities (Figure 2). LLMs are used in pediatric dentistry to enhance caries detection and improve radiological image analysis [19]. They also assist in predicting oral diseases, promoting health, and educating patients [19]. In a study conducted by Rokhshad et al. [20], various chatbots, including ChatGPT and Google Bard, among others, were evaluated on their ability to respond to true/false questions related to pediatric dentistry. The findings revealed that these chatbots demonstrated lower accuracy rates than pediatric and general dentists, though their performance exceeded that of dental students. This suggests that while chatbots are making strides, they currently may not be advisable for clinical applications within pediatric dentistry. Similarly, ChatGPT demonstrated an 85.44% consistency level in addressing clinical questions of varying difficulty, sourced from the position statements of the European Society of Endodontology [21, 22]. These questions capture expert consensus on diverse endodontic topics. While maintaining a notable consistency, ChatGPT’s average accuracy was reported as 57.33%, showcasing discernible fluctuations linked to the complexity of questions [21]. Even though these results highlight ChatGPT’s potential utility in endodontics, the current level of performance does not allow it to supplant the clinical decision-making of dental professionals. As AI continues to learn and improve, it is anticipated to become more effective in the field. Furthermore, in oral and maxillofacial radiology, LLMs assist in determining radiation doses, suggesting appropriate imaging protocols, and creating preprocedural checklists [20]. This technology also aids in report generation, reducing radiologists’ workload and providing educational resources [20]. However, their ability to analyze radiographs and respond to radiographical image-based questions is limited [23]. Mago et al. [24] found that ChatGPT-3 achieved 100% accuracy in identifying radiographic landmarks, but its ability to describe oral and maxillofacial pathologies was limited to their primary or most distinctive radiographic features. In addition, in oral and maxillofacial surgery, LLMs provide quality patient information, but are less reliable for technical training purposes. Surgeons are advised to use it cautiously and as a supplement to their expertise [21]. Balel [22] assessed the proficiency of ChatGPT in answering questions about impacted teeth, dental implants, temporomandibular joint diseases, and orthognathic surgery, categorizing them into patient and technical queries. While ChatGPT’s responses to patient-oriented diagnostic questions were rated highly, indicating its potential as a supportive tool for patient education and diagnosis clarification, its effectiveness in answering complex technical diagnostic questions was less pronounced [22]. This discrepancy underscores the importance of using ChatGPT judiciously in diagnostic contexts, viewing it as a complementary aid to professional medical judgment rather than a standalone diagnostic tool [22]. Moreover, LLMs, such as ChatGPT-4, have shown promise as adjunctive tools in periodontal disease diagnosis. They facilitate the provision of information by identifying pertinent topics and creating dialogues simulating patient inquiries, thereby guiding individuals seeking knowledge about their condition [25]. Alan et al. [26] evaluated the quality of ChatGPT-4’s responses to periodontal disease questions, and found that although the overall DISCERN scores indicated good quality for most topics, the information related to treatment options was often poor. Specifically, although the reliability and overall quality sections were rated as good, the treatment choices section frequently scored below the acceptable threshold, indicating a lower quality of information in this critical area [26]. Additionally, NLP demonstrated promising results in automated dental charting using voice-based reporting [27]. Role of LLMs in treatment planning and patient educationLLMs could have a potential role in patient education in dentistry, significantly improving the way medical and dental knowledge is communicated. By translating complex medical and dental technical language into everyday language, these models help improving patient comprehension and engagement with their treatment plans. This is crucial, as some patients may feel uncomfortable with specialized terminology, and misunderstandings can lead to ill-informed decisions or unnecessary concerns [28]. Moreover, LLMs facilitate patient communication by offering translations and summaries, which is particularly beneficial in enhancing the understanding of conditions and treatments among patients [29]. They also address common concerns about dental health and postprocedural care, providing tailored responses based on individual symptoms [30]. Additionally, these models organize patient data and deliver essential guidance for postoperative care, such as pain management after root canal treatments and instructions for postextraction care, which is invaluable for patients whose primary language is not English [31]. Despite these advantages, it is essential to recognize the limitations of LLMs. Even though they show promise for various tasks, such as record-keeping and decision support, they must be used cautiously due to their potential to generate inaccurate or misleading information. This underscores the importance of verifying the reliability and accuracy of information provided by LLMs in healthcare contexts [31]. Furthermore, with culturally and linguistically diverse populations facing unequal access to dental services, LLMs might play a critical role in bridging gaps in oral health outcomes [32], thus underlining their significance in global health equity. Considering a patient’s unique genetic makeup, AI can suggest treatments that have the potential to be effective [33]. Furthermore, AI can generate clinical insights, combining genomic determinants with information from patient symptoms, clinical history, and lifestyles, facilitating personalized diagnosis and prognostication [33]. Integrating AI and clinical data sources could be valuable in a dental setting. LLMs can automate document comprehension and make treatment plan analysis feasible, which reaps the benefit of large-scale pretraining [26]. They can assist dental practitioners more efficiently in establishing treatment plans tailored to patients’ backgrounds by examining adverse drug reaction patterns linked to various dental procedures and drugs [34]. NLR algorithms can assist in identifying comorbidities by analyzing patient records for common risk factors and symptoms, identification of adverse drug reactions, drug safety surveillance, and patient education [34]. Role of LLMs in prognosisPrognosis plays a crucial role in dentistry across diverse specialties, offering clinicians valuable insights into the anticipated course and outcome of a patient’s oral health condition [35, 36]. However, as far as our current knowledge extends, there is a lack of literature studying the integration of LLMs in the domain of dental prognosis. LLMs, equipped with the capability to process extensive textual information and recognize complicated patterns, are potent tools for enhancing both the experiential and analytical facets of prognosis in dentistry. Their ability to synthesize information from various sources, such as clinical records, research literature, and case studies, can empower clinicians with a wealth of knowledge. Challenges and ethical considerationsThe integration of AI and LLMs into dentistry presents significant challenges and ethical concerns [37]. A primary issue is that LLM outputs, shaped by the quality and characteristics of their training data, are probabilistic and not absolute truths. This raises concerns about potential biases, which could perpetuate stereotypes and inaccuracies in patient care and education. Ensuring diverse and representative datasets is crucial to mitigate these biases and promote equitable outcomes. Additionally, maintaining the confidentiality and privacy of patient data is paramount due to the extensive data requirements for training LLMs. Another ethical consideration is the risk of dependency on automated systems, which could potentially devalue dental practitioners’ professional judgment and skills. Balancing the enhancement of clinical practice with AI while preserving the indispensable human element of patient care is essential. Furthermore, the transparency and explainability of decisions made by LLMs, which often operate as “black boxes,” pose challenges to informed consent and patient autonomy, as the rationale behind diagnoses or treatment recommendations may not be easily understood. Addressing these challenges requires a multidisciplinary approach to develop guidelines and frameworks, ensuring the responsible and ethical utilization of LLMs in dentistry. As this technology advances, ongoing dialogue and research are essential to harness its potential while safeguarding against its risks, making sure that LLMs enhance, rather than undermine, the quality and integrity of dental care. ConclusionsLLMs have the potential to improve patient education, dental diagnosis, treatment planning, and prognosis, but their full capabilities have yet to be realized due to various limitations. While their future integration into clinical practice could enhance efficiency and accuracy, significant challenges and ethical concerns must first be addressed. These models also come with inherent biases and ethical considerations that need careful evaluation. Given the limited research available in this area, further studies are necessary to understand better the role and impact of LLMs in dentistry, particularly in treatment planning and prognosis. Disclosures1. The approval of the Bioethics Committee for the research: Not applicable. 2. Assistance with the article: None. 3. Financial support and sponsorship: Julien Issa is a participant of the STER Internationalization of Doctoral Schools Program from NAWA Polish National Agency for Academic Exchange No. PPI/STE/ 2020/1/00014/DEC/02. 4. Conflicts of interest: None. References1. Manning CD. Human language understanding & reasoning. Daedalus 2022; 151: 127-138. 2.
Brown TB, Mann B, Ryder N, Subbiah M, Kaplan J, Dhariwalet P, et al. Language models are few-shot learners. arXiv:2005.14165. DOI: https://doi.org/10.48550/arXiv.2005.14165. 3.
Bridgelall R. Unraveling the mysteries of AI chatbots. Artif Intell Rev 2024; 57: 89. DOI: 10.1007/s10462-024-10720-7. 4.
Villa L, Carneros-Prado D, Dobrescu CC, Sánchez-Miguel A, Cubero G, Hervás R. Comparative analysis of generic and fine-tuned large language models for conversational agent systems. Robotics 2024; 13: 68. DOI: 10.3390/robotics13050068. 5.
Singhal K, Azizi S, Tu T, Mahdavi SS, Wei J, Chung HW, et al. Large language models encode clinical knowledge. Nature 2023; 620: 172-180. 6.
Handler A, Larsen KR, Hackathon R. Large language models present new questions for decision support. Int J Inf Manage 2024; 79: 102811. DOI: 10.1016/j.ijinfomgt.2024.102811. 7.
OpenAI. ChatGPT: optimizing language models for dialogue. Available at: https://openai.com/blog/chatgpt/, 2022 (Accessed: 05.11.2023). 8.
Pichai S. An important next step on our AI journey. Available at: https://blog.google/technology/ai/bard-google-ai-search-updates/, 2023 (Accessed: 05.11.2023). 9.
Weizenbaum J. ELIZA – a computer program for the study of natural language communication between man and machine. Commun ACM 1996; 9: 36-45. 10.
Yu P, Xu H, Hu X, Deng C. Leveraging generative AI and large language models: a comprehensive roadmap for healthcare integration. Healthcare 2023; 11: 2776. DOI: 10.3390/healthcare-11202776. 11.
Büttner M, Leser U, Schneider L, Schwendicke F. Natural language processing: chances and challenges in dentistry. J Dent 2024; 141: 104796. DOI: 10.1016/j.jdent.2023.104796. 12.
Schwendicke F, Samek W, Krois J. Artificial intelligence in dentistry: chances and challenges. J Dent Res 2020; 99: 769-774. DOI: 10.1177/0022034520915714. 13.
Shan T, Tay FR, Gu L. Application of artificial intelligence in dentistry. J Dent Res 2021; 100: 232-244. DOI: 10.1177/002203452096911. 14.
Nguyen KTP, Medjaher K, Tran DT. A review of artificial intelligence methods for engineering prognostics and health management with implementation guidelines. Artif Intell Rev 2023; 56: 3659-3709. 15.
Arsiwala-Scheppach LT, Chaurasia A, Müller A, Krois J, Schwendicke F. Machine learning in dentistry: a scoping review. J Clin Med 2023; 12: 937. DOI: 10.3390/jcm12030937. 16.
Kung TH, Cheatham M, Medenilla A, Sillos C, De Leon L, Elepaño C, et al. Performance of ChatGPT on USMLE: potential for AI-assisted medical education using large language models. PLOS Digit Health 2023; 2: e0000198. DOI: 10.1371/journal.pdig.0000198. 17.
Gilson A, Safranek CW, Huang T, Socrates V, Chi L, Taylor RA, et al. How does ChatGPT perform on the United States Medical Licensing Examination (USMLE)? The implications of large language models for medical education and knowledge assessment. JMIR Med Educ 2023; 9: e45312. DOI: 10.2196/45312. 18.
Chau RCW, Thu KM, Yu OY, Hsung RT, Lo ECM, Lam WYH. Performance of generative artificial intelligence in dental licensing examinations. Int Dent J 2024; 74: 616-621. 19.
Maltarollo TFH, Shitsuka C, Sivieri-Araújo G. ChatGPT and its use in paediatric dentistry. Eur Arch Paediatr Dent 2023; 24: 675-676. DOI: 10.1007/s40368-023-00838-7. 20.
Rokhshad R, Zhang P, Mohammad-Rahimi H, Pitchika V, Entezari N, Schwendicke F. Accuracy and consistency of chatbots versus clinicians for answering pediatric dentistry questions: a pilot study. J Dent 2024; 144: 104938. DOI: 10.1016/j.jdent.2024.104938. 21.
Suárez A, Díaz-Flores García V, Algar J, Gómez Sánchez M, Llorente de Pedro M, Freire Y. Unveiling the ChatGPT phenomenon: evaluating the consistency and accuracy of endodontic question answers. Int Endod J 2024; 57: 108-113. 22.
Duncan HF, Kirkevang LL, Peters OA, El-Karim I, Krastl G, Del Fabbro M, et al. Treatment of pulpal and apical disease: the European Society of Endodontology (ESE) S3-level clinical practice guideline. Int Endod J 2023; 56 Suppl 3: 238-295. DOI: 10.1111/iej.13974. 23.
Khurana S, Vaddi A. ChatGPT from the perspective of an academic oral and maxillofacial radiologist. Cureus 2023; 15: e40053. DOI: 10.7759/cureus.40053. 24.
Mago J, Sharma M. The potential usefulness of ChatGPT in oral and maxillofacial radiology. Cureus 2023; 15: e42133. DOI: 10.7759/cureus.42133. 25.
Babayiğit O, Tastan Eroglu Z, Ozkan Sen D, Ucan Yarkac F. Potential use of ChatGPT for patient information in periodontology: a descriptive pilot study. Cureus 2023; 15: e48518. DOI: 10.7759/cureus.48518. 26.
Alan R, Alan BM. Utilizing ChatGPT-4 for providing information on periodontal disease to patients: a DISCERN quality analysis. Cureus 2023; 15. DOI: 10.7759/cureus.46213. 27.
Patel JS, Kumar K, Zai A, Shin D, Willis L, Thyvalikakath TP. Developing automated computer algorithms to track periodontal disease change from longitudinal electronic dental records. Diagnostics (Basel) 2023; 13: 1028. DOI: 10.3390/diagnostics13061028. 28.
Hamilton A, Lamey P, Ulhaq A, Besi E. Commonly used terminology in oral surgery and oral medicine: the patient’s perspective. Br Dent J 2021; 230: 823-830. 29.
Clusmann J, Kolbinger FR, Muti HS, Carrero ZI, Eckardt JN, Lalehet NG, et al. The future landscape of large language models in medicine. Commun Med 2023; 3: 141. DOI: 10.1038/s43856-023-00370-1. 30.
Alhaidry HM, Fatani B, Alrayes JO, Almana AM, Alfhaed NK. ChatGPT in dentistry: a comprehensive review. Cureus 2023; 15: e38317. DOI: 10.7759/cureus.38317. 31.
Eggmann F, Weiger R, Zitzmann NU, Blatz MB. Implications of large language models such as ChatGPT for dental medicine. J Esthet Restor Dent 2023; 35: 1098-1102. 32.
Marcus K, Balasubramanian M, Short S, Sohn W. Barriers and facilitators to dental care among culturally and linguistically diverse carers: a mixed-methods systematic review. Community Dent Oral Epidemiol 2023; 51: 327-344. DOI: 10.1111/cdoe.12745. 33.
Johnson KB, Wie WQ, Weeraratne D, Frisse ME, Misulis K, Rhee K, et al. Precision medicine, AI, and the future of personalized health care. Clin Transl Sci 2021; 14: 86-93. 34.
Huang H, Zheng O, Wang D, Yin J, Wang Z, Ding S, et al. ChatGPT for shaping the future of dentistry: the potential of multi-modal large language model. Int J Oral Sci 2023; 15: 29. DOI: 10.1038/s41368-023-00239-y. 35.
Beck JD. Risk revisited. Community Dent Oral Epidemiol 1998; 26: 220-225. 36.
McGuire MK, Nunn ME. Prognosis versus actual outcome. II. The effectiveness of clinical parameters in developing an accurate prognosis. J Periodontol 1996; 67: 658-665. 37.
Dziedzic A, Issa J, Chaurasia A, Tanasiewicz M. Artificial intelligence and health-related data: the patient’s best interest and data ownership dilemma. Proc Inst Mech Eng H 2024; 238: 1023-1028.
This is an Open Access journal, all articles are distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0). License (http://creativecommons.org/licenses/by-nc-sa/4.0/), allowing third parties to copy and redistribute the material in any medium or format and to remix, transform, and build upon the material, provided the original work is properly cited and states its license.
|