Article Text
Abstract
Objective To review the literature on machine learning in endometrial cancer, report the most commonly used algorithms, and compare performance with traditional prediction models.
Methods This is a systematic review of the literature from January 1985 to March 2021 on the use of machine learning in endometrial cancer. An extensive search of electronic databases was conducted. Four independent reviewers screened studies initially by title then full text. Quality was assessed using the MINORS (Methodological Index for Non-Randomized Studies) criteria. P values were derived using the Pearson’s Χ2 test in JMP 15.0.
Results Among 4295 articles screened, 30 studies on machine learning in endometrial cancer were included. The most frequent applications were in patient datasets (33.3%, n=10), pre-operative diagnostics (30%, n=9), genomics (23.3%, n=7), and serum biomarkers (13.3%, n=4). The most commonly used models were neural networks (n=10, 33.3%) and support vector machine (n=6, 20%).
The number of publications on machine learning in endometrial cancer increased from 1 in 2010 to 29 in 2021.Eight studies compared machine learning with traditional statistics. Among patient dataset studies, two machine learning models (20%) performed similarly to logistic regression (accuracy: 0.85 vs 0.82, p=0.16). Machine learning algorithms performed similarly to detect endometrial cancer based on MRI (accuracy: 0.87 vs 0.82, p=0.24) while outperforming traditional methods in predicting extra-uterine disease in one serum biomarker study (accuracy: 0.81 vs 0.61). For survival outcomes, one study compared machine learning with Kaplan-Meier and reported no difference in concordance index (83.8% vs 83.1%).
Conclusion Although machine learning is an innovative and emerging technology, performance is similar to that of traditional regression models in endometrial cancer. More studies are needed to assess its role in endometrial cancer.
PROSPERO registration number CRD42021269565.
- Endometrial Neoplasms
Data availability statement
Data are available upon reasonable request.
Statistics from Altmetric.com
Data availability statement
Data are available upon reasonable request.
Footnotes
SP and GR are joint first authors.
Contributors SP: conceptualization, data interpretation, manuscript writing and editing. GR and DV: conceptualization, data curation, data interpretation, manuscript writing and editing. BG: conceptualization, data curation, data interpretation, manuscript writing and editing. MS, AC, and SL: data curation, data interpretation, manuscript writing and editing. DV is responsible for the overall content as guarantor.
Funding The authors have not declared a specific grant for this research from any funding agency in the public, commercial or not-for-profit sectors.
Competing interests None declared.
Provenance and peer review Not commissioned; externally peer reviewed.
Supplemental material This content has been supplied by the author(s). It has not been vetted by BMJ Publishing Group Limited (BMJ) and may not have been peer-reviewed. Any opinions or recommendations discussed are solely those of the author(s) and are not endorsed by BMJ. BMJ disclaims all liability and responsibility arising from any reliance placed on the content. Where the content includes any translated material, BMJ does not warrant the accuracy and reliability of the translations (including but not limited to local regulations, clinical guidelines, terminology, drug names and drug dosages), and is not responsible for any error and/or omissions arising from translation and adaptation or otherwise.