Artificial Intelligence in Vision and Ophthalmology https://www.aivojournal.com/index.php/AIVO <p>Artificial Intelligence in Vision and Ophthalmology (AIVO) provides a forum for interdisciplinary approaches integrating techniques from artificial intelligence, mathematics, computer science, engineering and experimental and clinical sciences to address open problems in ophthalmology.</p> <p>AIVO uses the Continuous Article Publication (CAP) model. Articles are published as soon as they are ready. </p> <p>Read more about AIVO's <a title="AIVO Focus &amp; Scope" href="https://www.aivojournal.com/index.php/AIVO/about/#focusAndScope" target="_blank" rel="noopener">focus and scope</a>.<br /><a href="https://www.aivojournal.com/index.php/AIVO/issue/archive">See all issues here</a></p> <p style="text-align: center;"> </p> Kugler Publications en-US Artificial Intelligence in Vision and Ophthalmology 3051-2328 <p>Authors who publish with this journal agree to the following terms:</p><ol type="a"><li><p>Authors retain copyright and grant the journal right of first publication, with the work twelve (12) months after publication simultaneously licensed under a <a href="http://creativecommons.org/licenses/by/3.0/" target="_new">Creative Commons Attribution License</a> that allows others to share the work with an acknowledgement of the work’s authorship and initial publication in this journal.</p></li><li>After 12 months from the date of publication, authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.</li></ol> Multimodal large language models for use in diabetic retinopathy screening https://www.aivojournal.com/index.php/AIVO/article/view/157 <p><em><strong>Purpose:</strong></em> To evaluate the performance o f ChatGPT-4o and Gemini 2.5 Pro in detecting more-than-mild diabetic retinopathy (mtmDR) from fundus photography (FP) and diabetic macular edema (DME) from optical coherence tomography (OCT) using publicly available datasets.</p> <p><em><strong>Methods:</strong> </em>A custom GPT (powered by ChatGPT-4o) was created and instructed to follow the LumineticsCore™ (IDx-DR) screening criteria for mtmDR, defined as an ETDRS level ≥ 35 and/or clinically significant diabetic macular edema (CSDME). Gemini 2.5 Pro was evaluated with the same criteria. Performance on FPs was assessed using 2 publicly available datasets: MESSIDOR-2 (<em>n</em> = 106; 66 mtmDR, 40 no mtmDR) and EyePACS (<em>n</em> = 99; 56 mtmDR, 43 non-mtmDR). To assess detection of DME, a separate OCT dataset (<em>n</em> = 48; 24 DME, 24 normal) was used to evaluate identification of intraretinal and/or subretinal fluid. Sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) for detecting mtmDR on FP and DME on OCT were calculated for each multimodal large language model (LLM).</p> <p><em><strong>Results:</strong></em> On MESSIDOR-2 (n = 106), ChatGPT-4o achieved a sensitivity of 90.77%, specificity of 97.50%, PPV of 98.33%, and NPV of 86.67% for mtmDR detection. Gemini 2.5 Pro achieved a sensitivity of 80.30%, specificity of 97.50%, PPV of 98.15%, and NPV of 75.00%. On EyePACS (<em>n</em> = 99), ChatGPT-4o demonstrated a sensitivity of 94.64%, specificity of 86.05%, PPV of 89.83%, and NPV of 92.50%, while Gemini 2.5 Pro achieved a sensitivity of 89.29%, specificity of 88.37%, PPV of 90.91%, and NPV of 86.36%. For OCT-based DME detection (<em>n</em> = 48), ChatGPT-4o achieved a sensitivity of 95.83%, specificity of 100%, and PPV of 100%, while Gemini 2.5 Pro achieved a sensitivity of 95.83%, specificity of 95.65%, PPV of 95.83%, and NPV of 95.65%.</p> <p><em><strong>Conclusion:</strong> </em>ChatGPT-4o and Gemini 2.5 Pro demonstrated high performance in detecting mtmDR and DME across multiple publicly available datasets. These findings support the potential of multimodal LLMs as cost-effective and accessible tools for diabetic retinopathy screening. Further validation in larger, more diverse real-world datasets is warranted.</p> S. Saeed Mohammadi Sahana Aggarwal Kavina Aggarwal Grant Wiarda Kayla Nguyen Emmanuel A. Sarmiento Quan Nguyen Manjot K. Gill Copyright (c) 2026 S. Saeed Mohammadi, Sahana Aggarwal, Kavina Aggarwal, Grant Wiarda, Kayla Nguyen, Emmanuel A. Sarmiento, Quan Nguyen, Manjot K. Gill https://creativecommons.org/licenses/by-nc/4.0 2026-03-11 2026-03-11 2 1 10.35119/aivo.v2i1.157