Abstract
Introduction: The purpose of this research was to compare how artificial intelligence chatbots (ChatGPT-3.5, ChatGPT-4, Gemini, and DeepSeek) responded to common patient inquiries regarding home dental bleaching, with an emphasis on quality, accuracy, clarity, and practical applicability.
Materials and Methods: Forty patient-oriented questions identified using the AlsoAsked tool, which extracts Google “People Also Ask” data, were categorized into seven thematic domains and submitted individually to each chatbot in separate sessions. Responses were independently scored by two evaluators using the global quality scale (GQS), accuracy of ınformation ındex (AOI), and patient education materials assessment tool for printed materials. Response times were recorded in seconds. Statistical analyses included the Kruskal–Wallis test, Bonferroni-adjusted pairwise comparisons, and Spearman correlation (p<0.05).
Results: ChatGPT-4 and DeepSeek achieved the highest GQS and AOI scores. DeepSeek had the highest actionability score but the longest response time. ChatGPT-3.5 demonstrated moderate performance, while Gemini had the lowest intelligibility and actionability scores.
Discussion and Conclusion: Advanced artificial intelligence chatbots can provide high-quality and accurate information on at-home dental bleaching. However, unsupervised use may pose patient safety risks; thus, their deployment should be limited to validated, monitored, and task-specific applications.
