All nodules had already been recommended for biopsy by doctors. All nodules either had benign biopsy results with >12 months of stable follow-up of the nodule or had the nodule surgically removed with pathology results. A computer program reviewed the images and decided which nodules were likely benign.
The computer correctly identified most benign nodules and could have reduced unnecessary biopsies from about 70% to about 9%. The computer performed better than both junior and senior doctors. However, 123 cancers (8%) were misdiagnosed as benign, with 56.9% being papillary microcancers (small cancers <1 cm)
WHAT ARE THE IMPLICATIONS OF THIS STUDY?
This study suggests that AI characterization works well as a clinical “goalkeeper,” significantly outperforming radiologists and potentially preventing nearly 90% of unnecessary biopsies in the study group. Fewer unnecessary biopsies mean less pain, less worry, and fewer medical visits, while still keeping patients safe. However, the misclassification of a subset of cancers underscores that AI-benign designations should trigger ongoing ultrasound follow up, known as active surveillance, rather than clinical discharge, particularly in those with intermediate- and high-risk nodules. Overall, AI characterization of ultrasound images can help doctors make better decisions and avoid unnecessary biopsies but cannot replace medical judgment. Further studies to clarify the role of AI in ultrasound imaging are needed.
— Joanna Miragaya, MD