An Artificial Intelligence–based Mammography Screening Protocol for Breast Cancer: Outcome and Radiologist Workload

Research output: Contribution to journalJournal articleResearchpeer-review

Standard

An Artificial Intelligence–based Mammography Screening Protocol for Breast Cancer: Outcome and Radiologist Workload. / Lauritzen, Andreas D.; Rodríguez-Ruiz, Alejandro; Euler-Chelpin, My Catarina von; Lynge, Elsebeth; Vejborg, Ilse; Nielsen, Mads; Karssemeijer, Nico; Lillholm, Martin.

In: Radiology, Vol. 304, No. 1, 2022, p. 41-49.

Research output: Contribution to journalJournal articleResearchpeer-review

Harvard

Lauritzen, AD, Rodríguez-Ruiz, A, Euler-Chelpin, MCV, Lynge, E, Vejborg, I, Nielsen, M, Karssemeijer, N & Lillholm, M 2022, 'An Artificial Intelligence–based Mammography Screening Protocol for Breast Cancer: Outcome and Radiologist Workload', Radiology, vol. 304, no. 1, pp. 41-49. https://doi.org/10.1148/radiol.210948

APA

Lauritzen, A. D., Rodríguez-Ruiz, A., Euler-Chelpin, M. C. V., Lynge, E., Vejborg, I., Nielsen, M., Karssemeijer, N., & Lillholm, M. (2022). An Artificial Intelligence–based Mammography Screening Protocol for Breast Cancer: Outcome and Radiologist Workload. Radiology, 304(1), 41-49. https://doi.org/10.1148/radiol.210948

Vancouver

Lauritzen AD, Rodríguez-Ruiz A, Euler-Chelpin MCV, Lynge E, Vejborg I, Nielsen M et al. An Artificial Intelligence–based Mammography Screening Protocol for Breast Cancer: Outcome and Radiologist Workload. Radiology. 2022;304(1):41-49. https://doi.org/10.1148/radiol.210948

Author

Lauritzen, Andreas D. ; Rodríguez-Ruiz, Alejandro ; Euler-Chelpin, My Catarina von ; Lynge, Elsebeth ; Vejborg, Ilse ; Nielsen, Mads ; Karssemeijer, Nico ; Lillholm, Martin. / An Artificial Intelligence–based Mammography Screening Protocol for Breast Cancer: Outcome and Radiologist Workload. In: Radiology. 2022 ; Vol. 304, No. 1. pp. 41-49.

Bibtex

@article{a52694eff17f47ec87d998b037938539,
title = "An Artificial Intelligence–based Mammography Screening Protocol for Breast Cancer: Outcome and Radiologist Workload",
abstract = "BackgroundDevelopments in artificial intelligence (AI) systems to assist radiologists in reading mammograms could improve breast cancer screening efficiency.PurposeTo investigate whether an AI system could detect normal, moderate-risk, and suspicious mammograms in a screening sample to safely reduce radiologist workload and evaluate across Breast Imaging Reporting and Data System (BI-RADS) densities.Materials and MethodsThis retrospective simulation study analyzed mammographic examination data consecutively collected from January 2014 to December 2015 in the Danish Capital Region breast cancer screening program. All mammograms were scored from 0 to 10, representing the risk of malignancy, using an AI tool. During simulation, normal mammograms (score < 5) would be excluded from radiologist reading and suspicious mammograms (score > recall threshold [RT]) would be recalled. Two radiologists read the remaining mammograms. The RT was fitted using another independent cohort (same institution) by matching to the radiologist sensitivity. This protocol was further applied to each BI-RADS density. Screening outcomes were measured using the sensitivity, specificity, workload, and false-positive rate. The AI-based screening was tested for noninferiority sensitivity compared with radiologist screening using the Farrington-Manning test. Specificities were compared using the McNemar test.ResultsThe study sample comprised 114 421 screenings for breast cancer in 114 421 women, resulting in 791 screen-detected, 327 interval, and 1473 long-term cancers and 2107 false-positive screenings. The mean age of the women was 59 years ± 6 (SD). The AI-based screening sensitivity was 69.7% (779 of 1118; 95% CI: 66.9, 72.4) and was noninferior (P = .02) to the radiologist screening sensitivity of 70.8% (791 of 1118; 95% CI: 68.0, 73.5). The AI-based screening specificity was 98.6% (111 725 of 113 303; 95% CI: 98.5, 98.7), which was higher (P < .001) than the radiologist specificity of 98.1% (111 196 of 113 303; 95% CI: 98.1, 98.2). The radiologist workload was reduced by 62.6% (71 585 of 114 421), and 25.1% (529 of 2107) of false-positive screenings were avoided. Screening results were consistent across BI-RADS densities, although not significantly so for sensitivity.ConclusionArtificial intelligence (AI)–based screening could detect normal, moderate-risk, and suspicious mammograms in a breast cancer screening program, which may reduce the radiologist workload. AI-based screening performed consistently across breast densities.",
author = "Lauritzen, {Andreas D.} and Alejandro Rodr{\'i}guez-Ruiz and Euler-Chelpin, {My Catarina von} and Elsebeth Lynge and Ilse Vejborg and Mads Nielsen and Nico Karssemeijer and Martin Lillholm",
year = "2022",
doi = "10.1148/radiol.210948",
language = "English",
volume = "304",
pages = "41--49",
journal = "Radiology",
issn = "0033-8419",
publisher = "Radiological Society of North America, Inc.",
number = "1",

}

RIS

TY - JOUR

T1 - An Artificial Intelligence–based Mammography Screening Protocol for Breast Cancer: Outcome and Radiologist Workload

AU - Lauritzen, Andreas D.

AU - Rodríguez-Ruiz, Alejandro

AU - Euler-Chelpin, My Catarina von

AU - Lynge, Elsebeth

AU - Vejborg, Ilse

AU - Nielsen, Mads

AU - Karssemeijer, Nico

AU - Lillholm, Martin

PY - 2022

Y1 - 2022

N2 - BackgroundDevelopments in artificial intelligence (AI) systems to assist radiologists in reading mammograms could improve breast cancer screening efficiency.PurposeTo investigate whether an AI system could detect normal, moderate-risk, and suspicious mammograms in a screening sample to safely reduce radiologist workload and evaluate across Breast Imaging Reporting and Data System (BI-RADS) densities.Materials and MethodsThis retrospective simulation study analyzed mammographic examination data consecutively collected from January 2014 to December 2015 in the Danish Capital Region breast cancer screening program. All mammograms were scored from 0 to 10, representing the risk of malignancy, using an AI tool. During simulation, normal mammograms (score < 5) would be excluded from radiologist reading and suspicious mammograms (score > recall threshold [RT]) would be recalled. Two radiologists read the remaining mammograms. The RT was fitted using another independent cohort (same institution) by matching to the radiologist sensitivity. This protocol was further applied to each BI-RADS density. Screening outcomes were measured using the sensitivity, specificity, workload, and false-positive rate. The AI-based screening was tested for noninferiority sensitivity compared with radiologist screening using the Farrington-Manning test. Specificities were compared using the McNemar test.ResultsThe study sample comprised 114 421 screenings for breast cancer in 114 421 women, resulting in 791 screen-detected, 327 interval, and 1473 long-term cancers and 2107 false-positive screenings. The mean age of the women was 59 years ± 6 (SD). The AI-based screening sensitivity was 69.7% (779 of 1118; 95% CI: 66.9, 72.4) and was noninferior (P = .02) to the radiologist screening sensitivity of 70.8% (791 of 1118; 95% CI: 68.0, 73.5). The AI-based screening specificity was 98.6% (111 725 of 113 303; 95% CI: 98.5, 98.7), which was higher (P < .001) than the radiologist specificity of 98.1% (111 196 of 113 303; 95% CI: 98.1, 98.2). The radiologist workload was reduced by 62.6% (71 585 of 114 421), and 25.1% (529 of 2107) of false-positive screenings were avoided. Screening results were consistent across BI-RADS densities, although not significantly so for sensitivity.ConclusionArtificial intelligence (AI)–based screening could detect normal, moderate-risk, and suspicious mammograms in a breast cancer screening program, which may reduce the radiologist workload. AI-based screening performed consistently across breast densities.

AB - BackgroundDevelopments in artificial intelligence (AI) systems to assist radiologists in reading mammograms could improve breast cancer screening efficiency.PurposeTo investigate whether an AI system could detect normal, moderate-risk, and suspicious mammograms in a screening sample to safely reduce radiologist workload and evaluate across Breast Imaging Reporting and Data System (BI-RADS) densities.Materials and MethodsThis retrospective simulation study analyzed mammographic examination data consecutively collected from January 2014 to December 2015 in the Danish Capital Region breast cancer screening program. All mammograms were scored from 0 to 10, representing the risk of malignancy, using an AI tool. During simulation, normal mammograms (score < 5) would be excluded from radiologist reading and suspicious mammograms (score > recall threshold [RT]) would be recalled. Two radiologists read the remaining mammograms. The RT was fitted using another independent cohort (same institution) by matching to the radiologist sensitivity. This protocol was further applied to each BI-RADS density. Screening outcomes were measured using the sensitivity, specificity, workload, and false-positive rate. The AI-based screening was tested for noninferiority sensitivity compared with radiologist screening using the Farrington-Manning test. Specificities were compared using the McNemar test.ResultsThe study sample comprised 114 421 screenings for breast cancer in 114 421 women, resulting in 791 screen-detected, 327 interval, and 1473 long-term cancers and 2107 false-positive screenings. The mean age of the women was 59 years ± 6 (SD). The AI-based screening sensitivity was 69.7% (779 of 1118; 95% CI: 66.9, 72.4) and was noninferior (P = .02) to the radiologist screening sensitivity of 70.8% (791 of 1118; 95% CI: 68.0, 73.5). The AI-based screening specificity was 98.6% (111 725 of 113 303; 95% CI: 98.5, 98.7), which was higher (P < .001) than the radiologist specificity of 98.1% (111 196 of 113 303; 95% CI: 98.1, 98.2). The radiologist workload was reduced by 62.6% (71 585 of 114 421), and 25.1% (529 of 2107) of false-positive screenings were avoided. Screening results were consistent across BI-RADS densities, although not significantly so for sensitivity.ConclusionArtificial intelligence (AI)–based screening could detect normal, moderate-risk, and suspicious mammograms in a breast cancer screening program, which may reduce the radiologist workload. AI-based screening performed consistently across breast densities.

U2 - 10.1148/radiol.210948

DO - 10.1148/radiol.210948

M3 - Journal article

C2 - 35438561

VL - 304

SP - 41

EP - 49

JO - Radiology

JF - Radiology

SN - 0033-8419

IS - 1

ER -

ID: 307748943