Article Text
Abstract
Introduction It is unclear whether ultrasound risk stratification models for adnexal lesions perform well when used by novice providers. We aim to compare the performance of four commonly used models to detect ovarian cancer, when the operator has only basic experience.
Methods Women with adnexal masses treated in 2019 were identified retrospectively. Patients were included if they underwent surgery within 3 months of diagnosis or had at least 12±2 months of follow-up. A non-expert operator (European Federation of Societies for Ultrasound in Medicine and Biology level I) classified each lesion using ADNEX, two-step strategy (benign descriptors followed by ADNEX), O-RADS 2019, and O-RADS 2022. The primary outcome measure was AUC [95% confidence interval], compared across the four models.
Results A total of 556 women were included in the analyses: 452 benign and 104 malignant. The AUCs of ADNEX, the two-step strategy, O-RADS 2019, and O-RADS 2022 were 0.90[0.87–0.94], 0.91[0.88–0.94], 0.88[0.85–0.91], and 0.88[0.84–0.91], respectively (figure 1). The two-step strategy performed significantly better than the O-RADS algorithms (both p=0.01). With all the algorithms, the observed malignancy rate was 1.91–2.17% among lesions categorized as ‘almost certainly benign’, two-fold higher than the expected <1% (table 1).
Out of the four methods, lesions wrongly classified as ‘almost certainly benign’ were borderline tumors (n=4) and metastases (n=3).
Conclusions In the hands of a novice provider, all algorithms performed well, and were able to distinguish benign from malignant lesions. ADNEX misclassified only one malignant patient as ‘almost certainly benign’, compared to 5–6 patients by the other models.