Breast cancer histologic grading using digital microscopy: concordance and outcome association

Emad A Rakha; Mohamed Aleskandarani; Michael S Toss; Andrew R Green; Graham Ball; Ian O Ellis; Leslie W Dalton

doi:10.1136/jclinpath-2017-204979

Article Text

PDF

Original article

Breast cancer histologic grading using digital microscopy: concordance and outcome association

Emad A Rakha1,
Mohamed Aleskandarani1,
Michael S Toss1,
Andrew R Green1,
Graham Ball2,
Ian O Ellis1,
Leslie W Dalton3

¹ Division of Cancer and Stem Cells, School of Medicine, University of Nottingham, Nottingham City Hospital, Nottingham, UK
² School of Science and Technology, John van Geest Cancer Research Centre, Nottingham Trent University, Nottingham, UK
³ Department of Histopathology, South Austin Hospital, Austin, Texas, USA

Correspondence to Dr Emad A Rakha, Department of Histopathology, Nottingham University Hospital NHS Trust, Nottingham NG5 1PB, UK; emad.rakha{at}nuh.nhs.uk

Abstract

Aims Virtual microscopy utilising digital whole slide imaging (WSI) is increasingly used in breast pathology. Histologic grade is one of the strongest prognostic factors in breast cancer (BC). This study aims at investigating the agreement between BC grading using traditional light microscopy (LM) and digital WSI with consideration of reproducibility and impact on outcome prediction.

Methods A large (n=1675) well-characterised cohort of BC originally graded by LM was re-graded using WSI. Two separate virtual-based grading sessions (V1 and V2) were performed with a 3-month washout period. Outcome was assessed using BC-specific and distant metastasis-free survival.

Results The concordance between LM grading and WSI was strong (LM/WSI Cramer’s V: V1=0.576, and V2=0.579). The agreement regarding grade components was as follows: tubule formation=0.538, pleomorphism=0.422 and mitosis=0.514. Greatest discordance was observed between adjacent grades, whereas high/low grade discordance was uncommon (1.5%). The intraobserver agreement for the two WSI sessions was substantial for grade (V1/V2 Cramer’s V=0.676; kappa=0.648) and grade components (Cramer’s V T=0.628, p=0.573 and M=0.580). Grading using both platforms showed strong association with outcome (all p values <0.001). Although mitotic scores assessed using both platforms were strongly associated with outcome, WSI tends to underestimate mitotic counts.

Conclusions Virtual microscopy is a reliable and reproducible method for assessing BC histologic grade. Regardless of the observer or assessment platform, histologic grade is a significant predictor of outcome. Continuing advances in imaging technology could potentially provide improved performance of WSI BC grading and in particular mitotic count assessment.

breast pathology
grade
agreement
virtual microscopy

https://doi.org/10.1136/jclinpath-2017-204979

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

Introduction

Virtual microscopy (VM) using digital whole slide imaging (WSI) is a technology through which glass slides of pathological specimens are digitally scanned at high resolution for viewing on a computer screen. Applications of WSI in the clinical, educational and research settings including image analysis applications are increasing and in some centres WSI has replaced conventional microscopy as a diagnostic tool used by pathologists.1–6 However, one of main the concerns related to VM adoption in breast pathology, in addition to diagnosis, is the assessment of prognostic and predictive variables including histologic grade.6 There is a perception that the quality of the images displayed by WSI may interfere with reliable histologic grading. In addition, the interpretive ability of the reporting pathologist assigning a ‘virtual grade’ to each cancer remains largely unknown.

Therefore, to improve WSI performance, enhancement of the WSI platform and the training of histopathologists with the digital environment is recommended. However, testing performance and reproducibility of WSI in cases’ reporting is critically needed. This could be achieved via head-to-head comparison of WSI compared with traditional light microscopy (LM) to provide sufficient evidence prior to clinical adoption.

Grading of breast cancer (BC) using the Nottingham combined histologic grade is one of the strongest prognostic factors in early-stage disease.7–9 Grade comprises one of the main components of several management decision tools10–13 and it has recently been included in the American Joint Committee on cancer tumour, node, metastases staging system as a stage modifier.14 15 However, concordance of BC grading among pathologists using glass slides shows moderate agreement with kappa values of 0.4816 to 0.5317; the high concordance rates observed in grade 3 (kappa 0.60) and grade 1 (kappa 0.51) tumours, whereas the lowest is observed with grade 2 (kappa 0.33) tumours.16 The impact of introduction of WSI in routine practice on the concordance of grade and its performance as a prognostic factor remains to be defined.

Therefore, this study aims at comparing the histologic grading of BC as assigned by an expert pathologist using WSI with the grade assessed in routine practice using LM. In addition to assessment of concordance, impact of different grading platforms on patient outcome was evaluated using the large well-characterised Nottingham BC cohort.

Patients and methods

This study has been performed on a large series (n=1675) of patients with early-stage invasive primary operable BC presented to Nottingham City Hospital from 1999 to 2006. This is a well-characterised cohort of BC with long-term clinical follow-up (median 135 months) and detailed clinicopathological profiles. Data included primary tumour histologic grade and grade components, tumour size and histotype, lymph node stage, nodal status, lymphovascular invasion, Nottingham Prognostic Index, molecular subtypes and outcome data were collected. The latter includes breast cancer-specific survival (BCSS), defined as time (in months) from the date the primary surgical treatment to the time of death from BC, and distant metastasis-free survival (DMFS) was defined as the time (in months) from the surgery until the first event of distant metastasis. Patient and tumour demographics are summarised in table 1.

View this table:

Table 1

Characteristics of the breast cancer cohort

This tumour cohort was originally graded using the Nottingham grading system during routine pathology reporting utilising all available tumour glass slides (average four slides per case) and LM.8 For the purpose of this study, data for the final grade and the individual grade components (tubule formation, nuclear pleomorphism and mitotic count scores) were retrieved from the patients’ records. One to three tumour blocks per case were retrieved, and freshly prepared H&E slides were reviewed. A representative slide per case was selected by a specialised breast pathologist (EAR) without further glass slide grading. Glass slides were scanned into high-resolution (0.19 µm/pixel) digital images at ×20 magnification using 3DHistech Panoramic 250 Flash II scanner (3DHISTECH, Budapest, Hungary). The digital WSIs were generated, stored and viewed using the 3DHistech Pannoramic Viewer (3DHISTECH; http://www.3dhistech.com/downloads) on a high-resolution screen. The digital slide was graded using College of American Pathologists’ criteria18 which are essentially the same as the original Nottingham criteria.7 Digital images were initially examined at low magnification where tubule formation was assessed. Also, low-to-intermediate magnification was performed for the identification of potential ‘hotspots’ for mitotic counting. Essentially, for mitotic counting, the distance measure tool of the software was used. This was important for determining the number of mitotic figures in a given area.

To allow for intraobserver agreement of BC grading using WSI, the whole cohort was graded again by the same observer (LWD, who is an experienced breast pathologists with special interest in BC grading) using the same criteria twice after a 3-month washout time with no special training during that time. In both WSI grading sessions (V1 and V2), grade components were assigned blinded to the LM grade as well as other clinicopathological parameters.

Statistical analysis

Statistical analysis was performed using functions obtained from the open-source R statistical platform.19 Since WSI and LM have a procedural difference, Cramer’s V statistic was adopted to help judge strength of concordance.20 The coefficient ranges from 0 (no association) to 1 (perfect association). The kappa statistic is technically a measure of concordance between two observers who are examining the same parameter following the same approach.21 For these analyses, R library vcd had the function required for calculating the Cramer’s V statistic (=function assocstats), while the function for kappa statistic was obtained from R library inter-rater reliability. Survival analysis was performed using SPSS V.23 for Windows using log-rank test and Kaplan-Meier plots. Survival analysis (BCSS and DMFS) was performed on WSI grade as well as the WSI component scores. Likewise, survival analysis was performed for the glass-slide LM grade to include separate analysis of the component scores. Multivariate analysis was performed using Cox proportional hazards analysis with inclusion of parameters significantly associated with outcome in univariate analysis. Statistical significance in survival stratification was calculated by the log-rank method and univariate Cox regression analysis. A p value of <0.05 (two tailed) was considered significant.

Results

In this study, a large (n=1675) retrospective cohort of early invasive primary operable BC was graded on high-resolution digital images acquired through WSI of representative slides. For this WSI grading, as for the original LM, the three-tier histologic grade of BC was used.7 8 18 Tables 2 and 3 show cross comparison of WSI grade with LM grade as well as the cross comparison of the three components of the Nottingham grade. Table 4 shows the cross comparison of the sum of grade components (3–9 scale) for LM grade scores and WSI grade scores.

View this table:

Table 2

Cross comparison of Nottingham grade (A) and grade component scores (B–D) between virtual microscopy* and traditional light microscopy

View this table:

Table 3

Concordance between light microscopy grade and its component scores with virtual microscopy grade and its component scores assessed using Cramer’s V and kappa statistic

View this table:

Table 4

Cross comparison of the sum of grade components between virtual microscopy and light microscopy

The agreement between WSI grading and glass slide/LM grading was moderate for the both WSI grading sessions when kappa statistic was used (V1/LM kappa=0.51 and V2/LM kappa=0.50). However, when Cramer’s V statistic was used the Cramer’s V for WSI with LM was 0.58 in both sessions respectively which is considered as a substantial concordance. If grade is reduced to a binary level of high (ie, grade 3) versus not high (ie, grades 1 and 2), the Cramer’s V was 0.66. The unweighted kappa statistic for WSI grade with LM grade was 0.51. The kappa statistics for component scores were as follows: mitoses=0.47; tubules=0.49; and pleomorphism=0.3.

Importantly, exact grade agreement between WSI and LM grading was reached in 68% of cancers. There was almost an even match in the number of cancers with low–intermediate discordance (255 cancers; 15.1%) as compared with intermediate–high discordance (265 cancers; 15.7%). There was 32.3% discordance between WSI and LM grade, which is largely between adjacent levels of grade: low versus intermediate, or intermediate versus high grade. The binary high versus low/intermediate discordance of grade was 17%. In this study, only 26 (1.5%) grade assignments were attributable to high versus low-grade discrepancy and the proportion reduction of high in LM to low in WSI was highly significant (p<0.00001).

The intraobserver agreement for the two virtual sessions (V1/V2) was higher than the values of agreement between WSI and LM but remained in the moderate concordance category (table 5).

View this table:

Table 5

Concordance of virtual microscopy grade and its component scores between first and second session of virtual scoring (intraobserver agreement of grade using virtual microscopy)

Survival analysis

Survival analysis was performed on both the grade assigned by WSI and the original LM (table 5). WSI grading in both grading sessions showed statistically significant differences for BCSS and DMFS as did the LM grading (p=1×10⁻¹³) (table 6). Individual WSI grade components showed statistically significant differences for BCSS and DMFS. WSI tubule formation showed a stronger association with BCSS than that of LM (HR=2.8, 95% CI 1.9 to 4.0 and HR=1.9, 95% CI 1.5 to 2.4, for WSI and LM, respectively). Similar results were observed for DMFS (HR=2.6, 95% CI 1.9 to 3.6 and HR=1.7, 95% CI 1.4 to 2.1). Figures 1 and 2 show survival curves of the final WSI and LM-based histologic grade as well as grade components and BCSS.

Figure 1

Association between histologic grade as assessed using digital slide imaging and traditional light microscope and breast cancer-specific survival (BCSS).

Figure 2

Association between histologic grade components as assessed using digital slide image and traditional light microscope and breast cancer-specific survival (BCSS); tubule formation: (A) & (B), pleomorphism: (C) & (D) and mitotic scores: (E) & (F).

View this table:

Table 6

Association between outcome, in terms of BCSS and DMFS, and histologic grade, as assessed by VM and LM

To assess the prognostic independency of BC grade assigned using LM and WSI V1 and V2, multivariate analyses were performed including other established prognostic variables in the models. LM grading as well as WSI V1 and V2 were significantly associated with BCSS (p value for the three grading methods were <0.001) and DMFS (p value <0.001), independent of other variables (table 7).

View this table:

Table 7

Multivariate Cox proportional hazard analysis for predictors of BCSS and DMFS for histologic grade, as assessed by light microscopy and virtual microscopy sessions 1 and 2

Discussion

Currently, there is an increasing interest in using WSI for diagnostic and research purposes. However, it is crucial to ensure that diagnostic performance using virtual slides is at least equivalent to that of using conventional LM. To validate the diagnostic concordance of WSI and LM, The United States and Canadian Academy of Pathology recommends 12 rigorously developed guidelines with the potential of providing pathology laboratories with a practical guide to validate WSI systems for diagnostic work.22 These include, yet not limited to, the number required for double reporting (at least 60 cases per application) and the washout period (at least 2 weeks). In the current study, more than 1600 BCs were regraded using WSI by expert pathologists and the results were compared with the original routine practice generated grade. The quantifiable three-tier system of Nottingham grade compiling the degree of tubule formation, nuclear pleomorphism and mitotic frequency scores is an ideal parameter for comparing WSI with LM. To assess the intraobserver concordance and the impact of WSI training, the whole cohort was grading again with a long washout interval of 3 months. The endpoint for this study was the concordance statistics as well as the patients’ clinical outcome. To the best of our knowledge, this is the largest study performing head-to-head comparison of BC grading using WSI and LM including patients’ survival as a study endpoint.

WSI grading showed moderate concordance with LM grading comparable to concordance rate reported among different pathologists who graded BC using conventional microscopy.16 17 Exact grade agreement between WSI and LM grading was reached in 68% of cases. This magnitude of concordance is in line with a prior reproducibility study.23 Since WSI has a procedural difference, compared with LM, some emphasis was given to Cramer’s V as measure of concordance. Multiple authorities considered a Cramer’s V of a value greater than 0.5 to be the breakpoint for acceptable concordance.24 25 In the current study, WSI grade as compared with LM grade had Cramer’s V of 0.58 at the ternary level and 0.66 at a binary level. These figures indicate high levels of reproducibility and demonstrate WSI reliability as a platform for grading BC taking into account the inherent discordance levels in grade assignment between different observers using a single platform. Detailed analysis of discordance at the level of individual cases awaits further study to include evaluation by recently introduced technologies.

In this study, the true merit of WSI as compared with LM was further studied as regard to whether both offered a comparable level of patients’ survival stratification using the large number of cases with long-term follow-up data. Both WSI and LM showed significant association with patients’ outcome and individual grade components assessed by both platforms. Interestingly, tubule formation as assessed by WSI showed stronger association with outcome compared with VLM assessment. Of note, our study demonstrated that morphology is easy enough to be amendable to survival analysis while technically difficult molecular assays are not.26 27

The intraobserver agreement for the two WSI sessions was moderate and showed similar association with outcome. These results support the fact that the level of concordance is to a large extent related to observer performance and the subjective nature of grade rather than the platform used. This together with the limitation of the current study which include (1) grade was assessed by different observers, (2) original grade was assessed using an average of four tumour tissue slides per case whereas WSI grade was assessed on a single slide and (3) WSI scan magnification used was ×20 rather than ×40 which is considered ideal for assessment of mitotic counts. In fact, among the three Nottingham grade components, the most challenging component to evaluate by WSI was mitotic counts. There was difficulty in discerning mitotic figures from apoptotic cells. Although this was largely attributed to resolution, the inability of WSI to provide different planes may have contributed as an additional hurdle. Therefore, assessment of mitotic counts using ×40 magnification may help resolving this issue. However, the large number of cases in this study and the repeated grade by the same observer using WSI has potentially overcome these limitations. The reasons for the tendency of lower mitotic scores in WSI compared with LM are likely related to the use of a single slide per case and the lower magnification used in WSI.

In BC grading there will be, without doubt, some discordance between grade assignments by WSI as well as to LM grading. Also, comparisons among biomarkers tested for diagnostic and research purposes share this possibility of discordance.25 26 28 However, at the level of an individual patient especially in the diagnostic setting, discordance is usually met with caution and concordance is sought for. Therefore, sustained effort is critically needed for improvement in concordance, or at least for an improved understanding in the meaning of discordance. In the current study, grading was validated as a ternary scheme and as binary scheme to assess for concordance of both grading platforms. Previous studies addressing binary biomarkers have compared their results with grade by collapsing grade into a binary scheme. For illustrative purposes, we did the same, and showed strong concordance of WSI with LM, no matter if low and intermediate grade were combined, or intermediate with high grade; concordance of grade was Cramer’s V=0.55 if low combined with intermediate.

As mentioned above, two reasons are thought to be responsible for underperforming of WSI in the assessment of histologic grade which are the technology itself or WSI and the reader. This study demonstrates that grading using WSI is reproducible and provides significant survival information comparable to glass slides. The concordance rate between glass slides grading and WSI was comparable to these reported using glass slides as the only tool and the intraobserver concordance using WSI was even higher than that reported by multiple readers using glass slides.29 30 This study in addition to providing evidence for the reproducibility and reliability of WSI in grading BC could prompt the question of what would be the minimal number of cases, randomly selected, which would be expected to show if a histopathologist would show ability to predict survival using WSI grade. If low enough, then WSI may be a method to test competence at the level of survival prediction and not just concordance. The use of WSI technology also opens up opportunities for computer-assisted classification of histologic grade with inherent improved standardisation and reproducibility of evaluation and potential for refinement of methodology.

Take home messages

Regardless of the observer or assessment platform, histologic grade is a significant predictor of outcome.
Virtual microscopy is a reliable and reproducible method for assessing breast cancer histologic grade.
Higher magnification (×40) is recommended to produce adequate resolution for an accurate grading.
Continuing advances in imaging technology could potentially provide improved performance of whole slide imaging breast cancer grading and in particular mitotic count assessment.

Acknowledgments

We thank the Nottingham Health Science Biobank and Breast Cancer Now Tissue Bank for the provision of tissue samples.

References

↵
2. Kayser K
. Introduction of virtual microscopy in routine surgical pathology-a hypothesis and personal view from Europe. Diagn Pathol 2012;7:48.doi:10.1186/1746-1596-7-48
OpenUrl CrossRef PubMed
↵
2. Allen TC
. Digital pathology and federalism. Arch Pathol Lab Med 2014;138:162–5.doi:10.5858/arpa.2013-0258-ED
OpenUrl CrossRef PubMed
↵
2. Hedvat CV
. Digital microscopy: past, present, and future. Arch Pathol Lab Med 2010;134:1666–70.doi:10.1043/2009-0579-RAR1.1
OpenUrl PubMed
↵
2. Rocha R ,
3. Vassallo J ,
4. Soares F , et al
. Digital slides: present status of a tool for consultation, teaching, and quality control in pathology. Pathol Res Pract 2009;205:735–41.doi:10.1016/j.prp.2009.05.004
OpenUrl CrossRef PubMed
↵
2. Brachtel E ,
3. Yagi Y
. Digital imaging in pathology-current applications and challenges. J Biophotonics 2012;5:327–35.doi:10.1002/jbio.201100103
OpenUrl CrossRef PubMed Web of Science
↵
2. Al-Janabi S ,
3. Huisman A ,
4. Van Diest PJ
. Digital pathology: current status and future perspectives. Histopathology 2012;61:1–9.doi:10.1111/j.1365-2559.2011.03814.x
OpenUrl PubMed
↵
2. Elston CW ,
3. Ellis IO
. Pathological prognostic factors in breast cancer. I. The value of histological grade in breast cancer: experience from a large study with long-term follow-up. Histopathology 1991;19:403–10.doi:10.1111/j.1365-2559.1991.tb00229.x
OpenUrl CrossRef PubMed Web of Science
↵
2. Rakha EA ,
3. El-Sayed ME ,
4. Lee AH , et al
. Prognostic significance of Nottingham histologic grade in invasive breast carcinoma. J Clin Oncol 2008;26:3153–8.doi:10.1200/JCO.2007.15.5986
OpenUrl Abstract/FREE Full Text
↵
2. Rakha EA ,
3. Reis-Filho JS ,
4. Baehner F , et al
. Breast cancer prognostic classification in the molecular era: the role of histological grade. Breast Cancer Res 2010;12:207.doi:10.1186/bcr2607
OpenUrl PubMed Web of Science
↵
2. Galea MH ,
3. Blamey RW ,
4. Elston CE , et al
. The Nottingham Prognostic Index in primary breast cancer. Breast Cancer Res Treat 1992;22:207–19.doi:10.1007/BF01840834
OpenUrl CrossRef PubMed Web of Science
↵
2. Wishart GC ,
3. Bajdik CD ,
4. Dicks E , et al
. PREDICT Plus: development and validation of a prognostic model for early breast cancer that includes HER2. Br J Cancer 2012;107:800–7.doi:10.1038/bjc.2012.338
OpenUrl CrossRef PubMed Web of Science
↵
2. Carlson RW ,
3. Brown E ,
4. Burstein HJ , et al
. NCCN task force report: Adjuvant therapy for breast cancer. J Natl Compr Canc Netw 2006;4(Suppl 1):S1–26.
OpenUrl
↵
2. Curigliano G ,
3. Burstein HJ ,
4. P Winer E , et al
. De-escalating and escalating treatments for early-stage breast cancer: the St. Gallen International Expert Consensus Conference on the Primary Therapy of Early Breast Cancer 2017. Ann Oncol 2017;28:1700–12.doi:10.1093/annonc/mdx308
OpenUrl
↵
2. Giuliano AE ,
3. Connolly JL ,
4. Edge SB , et al
. Breast Cancer-Major changes in the American Joint Committee on Cancer eighth edition cancer staging manual. CA Cancer J Clin 2017;67:290–303.doi:10.3322/caac.21393
OpenUrl
↵
American Joint Committee on Cancer (AJCC). AJCC cancer staging manual. 8th ed. New York: Springer, 2017.
↵
2. Rakha EA ,
3. Bennett RL ,
4. Coleman D , et al
. Review of the national external quality assessment (EQA) scheme for breast pathology in the UK. J Clin Pathol 2017;70:51–7.doi:10.1136/jclinpath-2016-203800
OpenUrl Abstract/FREE Full Text
↵
2. Sloane JP ,
3. Amendoeira I ,
4. Apostolikas N , et al
. Consistency achieved by 23 European pathologists from 12 countries in diagnosing breast disease and reporting prognostic features of carcinomas. Virchows Archiv-an International Journal of Pathology 1999;434:3–10.
OpenUrl CrossRef PubMed
↵
2. Lester SC ,
3. Bose S ,
4. Chen YY , et al
. Protocol for the examination of specimens from patients with invasive carcinoma of the breast. Arch Pathol Lab Med 2009;133:1515–38.doi:10.1043/1543-2165-133.10.1515
OpenUrl PubMed
↵
R: A language and enviornment for statistical computing [computer program]: R Foundation for Statistical Computing, 2013.
↵
2. McHugh ML
. The chi-square test of independence. Biochem Med 2013;23:143–9.doi:10.11613/BM.2013.018
OpenUrl CrossRef PubMed
↵
2. Kundel HL ,
3. Polansky M
. Measurement of observer agreement. Radiology 2003;228:303–8.doi:10.1148/radiol.2282011860
OpenUrl CrossRef PubMed Web of Science
↵
2. Pantanowitz L ,
3. Sinard JH ,
4. Henricks WH , et al
. Validating whole slide imaging for diagnostic purposes in pathology: guideline from the College of American Pathologists Pathology and Laboratory Quality Center. Arch Pathol Lab Med 2013;137:1710–22.doi:10.5858/arpa.2013-0093-CP
OpenUrl CrossRef PubMed
↵
2. Shaw EC ,
3. Hanby AM ,
4. Wheeler K , et al
. Observer agreement comparing the use of virtual slides with glass slides in the pathology review component of the POSH breast cancer cohort study. J Clin Pathol 2012;65:403–8.doi:10.1136/jclinpath-2011-200369
OpenUrl Abstract/FREE Full Text
↵
2. Haibe-Kains B ,
3. Desmedt C ,
4. Loi S , et al
. A three-gene model to robustly identify breast cancer molecular subtypes. J Natl Cancer Inst 2012;104:311–25.doi:10.1093/jnci/djr545
OpenUrl CrossRef PubMed Web of Science
↵
2. Fan C ,
3. Oh DS ,
4. Wessels L , et al
. Concordance among gene-expression-based predictors for breast cancer. N Engl J Med 2006;355:560–9.doi:10.1056/NEJMoa052933
OpenUrl CrossRef PubMed Web of Science
↵
2. Bartlett JM ,
3. Bayani J ,
4. Marshall A , et al
. Comparing Breast Cancer Multiparameter Tests in the OPTIMA Prelim Trial: No Test Is More Equal Than the Others. J Natl Cancer Inst 2016;108:djw050.doi:10.1093/jnci/djw050
OpenUrl CrossRef PubMed
↵
2. Varga Z ,
3. Diebold J ,
4. Dommann-Scherrer C , et al
. How reliable is Ki-67 immunohistochemistry in grade 2 breast carcinomas? A QA study of the Swiss Working Group of Breast- and Gynecopathologists. PLoS One 2012;7:e37379.doi:10.1371/journal.pone.0037379
↵
2. Zhong F ,
3. Bi R ,
4. Yu B , et al
. A Comparison of Visual Assessment and Automated Digital Image Analysis of Ki67 Labeling Index in Breast Cancer. PLoS One 2016;11:e0150505.doi:10.1371/journal.pone.0150505
↵
2. Schuh F ,
3. Biazús JV ,
4. Resetkova E , et al
. Histopathological grading of breast ductal carcinoma in situ: validation of a web-based survey through intra-observer reproducibility analysis. Diagn Pathol 2015;10:93.doi:10.1186/s13000-015-0320-2
OpenUrl
↵
2. Dalton LW ,
3. Gerds TA
. The Advantage of Discordance: An Example Using the Highly Subjective Nuclear Grading of Breast Cancer. Am J Surg Pathol 2017;41:1105–11.doi:10.1097/PAS.0000000000000886
OpenUrl

Footnotes

EAR and MA contributed equally.
Handling editor Dhirendra Govender.
Funding This research received no specific grant from any funding agency in the public, commercial or not-for-profit sectors.
Competing interests None declared.
Ethics approval This study was approved by Nottingham Research Ethics Committee 2 under the title ‘Development of a molecular genetic classification of breast cancer’, and in compliance with current ethical and legal guidelines of the UK.
Provenance and peer review Not commissioned; externally peer reviewed.
Data sharing statement Data are available upon request and at the discretion of the authors.

[1] ↵

Kayser K
. Introduction of virtual microscopy in routine surgical pathology-a hypothesis and personal view from Europe. Diagn Pathol 2012;7:48.doi:10.1186/1746-1596-7-48
OpenUrl CrossRef PubMed

[3] Kayser K

[4] ↵

Allen TC
. Digital pathology and federalism. Arch Pathol Lab Med 2014;138:162–5.doi:10.5858/arpa.2013-0258-ED
OpenUrl CrossRef PubMed

[6] Allen TC

[7] ↵

Hedvat CV
. Digital microscopy: past, present, and future. Arch Pathol Lab Med 2010;134:1666–70.doi:10.1043/2009-0579-RAR1.1
OpenUrl PubMed

[9] Hedvat CV

[10] ↵

Rocha R ,
Vassallo J ,
Soares F , et al
. Digital slides: present status of a tool for consultation, teaching, and quality control in pathology. Pathol Res Pract 2009;205:735–41.doi:10.1016/j.prp.2009.05.004
OpenUrl CrossRef PubMed

[12] Rocha R ,

[13] Vassallo J ,

[14] Soares F , et al

[15] ↵

Brachtel E ,
Yagi Y
. Digital imaging in pathology-current applications and challenges. J Biophotonics 2012;5:327–35.doi:10.1002/jbio.201100103
OpenUrl CrossRef PubMed Web of Science

[17] Brachtel E ,

[18] Yagi Y

[19] ↵

Al-Janabi S ,
Huisman A ,
Van Diest PJ
. Digital pathology: current status and future perspectives. Histopathology 2012;61:1–9.doi:10.1111/j.1365-2559.2011.03814.x
OpenUrl PubMed

[21] Al-Janabi S ,

[22] Huisman A ,

[23] Van Diest PJ

[24] ↵

Elston CW ,
Ellis IO
. Pathological prognostic factors in breast cancer. I. The value of histological grade in breast cancer: experience from a large study with long-term follow-up. Histopathology 1991;19:403–10.doi:10.1111/j.1365-2559.1991.tb00229.x
OpenUrl CrossRef PubMed Web of Science

[26] Elston CW ,

[27] Ellis IO

[28] ↵

Rakha EA ,
El-Sayed ME ,
Lee AH , et al
. Prognostic significance of Nottingham histologic grade in invasive breast carcinoma. J Clin Oncol 2008;26:3153–8.doi:10.1200/JCO.2007.15.5986
OpenUrl Abstract/FREE Full Text

[30] Rakha EA ,

[31] El-Sayed ME ,

[32] Lee AH , et al

[33] ↵

Rakha EA ,
Reis-Filho JS ,
Baehner F , et al
. Breast cancer prognostic classification in the molecular era: the role of histological grade. Breast Cancer Res 2010;12:207.doi:10.1186/bcr2607
OpenUrl PubMed Web of Science

[35] Rakha EA ,

[36] Reis-Filho JS ,

[37] Baehner F , et al

[38] ↵

Galea MH ,
Blamey RW ,
Elston CE , et al
. The Nottingham Prognostic Index in primary breast cancer. Breast Cancer Res Treat 1992;22:207–19.doi:10.1007/BF01840834
OpenUrl CrossRef PubMed Web of Science

[40] Galea MH ,

[41] Blamey RW ,

[42] Elston CE , et al

[43] ↵

Wishart GC ,
Bajdik CD ,
Dicks E , et al
. PREDICT Plus: development and validation of a prognostic model for early breast cancer that includes HER2. Br J Cancer 2012;107:800–7.doi:10.1038/bjc.2012.338
OpenUrl CrossRef PubMed Web of Science

[45] Wishart GC ,

[46] Bajdik CD ,

[47] Dicks E , et al

[48] ↵

Carlson RW ,
Brown E ,
Burstein HJ , et al
. NCCN task force report: Adjuvant therapy for breast cancer. J Natl Compr Canc Netw 2006;4(Suppl 1):S1–26.
OpenUrl

[50] Carlson RW ,

[51] Brown E ,

[52] Burstein HJ , et al

[53] ↵

Curigliano G ,
Burstein HJ ,
P Winer E , et al
. De-escalating and escalating treatments for early-stage breast cancer: the St. Gallen International Expert Consensus Conference on the Primary Therapy of Early Breast Cancer 2017. Ann Oncol 2017;28:1700–12.doi:10.1093/annonc/mdx308
OpenUrl

[55] Curigliano G ,

[56] Burstein HJ ,

[57] P Winer E , et al

[58] ↵

Giuliano AE ,
Connolly JL ,
Edge SB , et al
. Breast Cancer-Major changes in the American Joint Committee on Cancer eighth edition cancer staging manual. CA Cancer J Clin 2017;67:290–303.doi:10.3322/caac.21393
OpenUrl

[60] Giuliano AE ,

[61] Connolly JL ,

[62] Edge SB , et al

[63] ↵
American Joint Committee on Cancer (AJCC). AJCC cancer staging manual. 8th ed. New York: Springer, 2017.

[64] ↵

Rakha EA ,
Bennett RL ,
Coleman D , et al
. Review of the national external quality assessment (EQA) scheme for breast pathology in the UK. J Clin Pathol 2017;70:51–7.doi:10.1136/jclinpath-2016-203800
OpenUrl Abstract/FREE Full Text

[66] Rakha EA ,

[67] Bennett RL ,

[68] Coleman D , et al

[69] ↵

Sloane JP ,
Amendoeira I ,
Apostolikas N , et al
. Consistency achieved by 23 European pathologists from 12 countries in diagnosing breast disease and reporting prognostic features of carcinomas. Virchows Archiv-an International Journal of Pathology 1999;434:3–10.
OpenUrl CrossRef PubMed

[71] Sloane JP ,

[72] Amendoeira I ,

[73] Apostolikas N , et al

[74] ↵

Lester SC ,
Bose S ,
Chen YY , et al
. Protocol for the examination of specimens from patients with invasive carcinoma of the breast. Arch Pathol Lab Med 2009;133:1515–38.doi:10.1043/1543-2165-133.10.1515
OpenUrl PubMed

[76] Lester SC ,

[77] Bose S ,

[78] Chen YY , et al

[79] ↵
R: A language and enviornment for statistical computing [computer program]: R Foundation for Statistical Computing, 2013.

[80] ↵

McHugh ML
. The chi-square test of independence. Biochem Med 2013;23:143–9.doi:10.11613/BM.2013.018
OpenUrl CrossRef PubMed

[82] McHugh ML

[83] ↵

Kundel HL ,
Polansky M
. Measurement of observer agreement. Radiology 2003;228:303–8.doi:10.1148/radiol.2282011860
OpenUrl CrossRef PubMed Web of Science

[85] Kundel HL ,

[86] Polansky M

[87] ↵

Pantanowitz L ,
Sinard JH ,
Henricks WH , et al
. Validating whole slide imaging for diagnostic purposes in pathology: guideline from the College of American Pathologists Pathology and Laboratory Quality Center. Arch Pathol Lab Med 2013;137:1710–22.doi:10.5858/arpa.2013-0093-CP
OpenUrl CrossRef PubMed

[89] Pantanowitz L ,

[90] Sinard JH ,

[91] Henricks WH , et al

[92] ↵

Shaw EC ,
Hanby AM ,
Wheeler K , et al
. Observer agreement comparing the use of virtual slides with glass slides in the pathology review component of the POSH breast cancer cohort study. J Clin Pathol 2012;65:403–8.doi:10.1136/jclinpath-2011-200369
OpenUrl Abstract/FREE Full Text

[94] Shaw EC ,

[95] Hanby AM ,

[96] Wheeler K , et al

[97] ↵

Haibe-Kains B ,
Desmedt C ,
Loi S , et al
. A three-gene model to robustly identify breast cancer molecular subtypes. J Natl Cancer Inst 2012;104:311–25.doi:10.1093/jnci/djr545
OpenUrl CrossRef PubMed Web of Science

[99] Haibe-Kains B ,

[100] Desmedt C ,

[101] Loi S , et al

[102] ↵

Fan C ,
Oh DS ,
Wessels L , et al
. Concordance among gene-expression-based predictors for breast cancer. N Engl J Med 2006;355:560–9.doi:10.1056/NEJMoa052933
OpenUrl CrossRef PubMed Web of Science

[104] Fan C ,

[105] Oh DS ,

[106] Wessels L , et al

[107] ↵

Bartlett JM ,
Bayani J ,
Marshall A , et al
. Comparing Breast Cancer Multiparameter Tests in the OPTIMA Prelim Trial: No Test Is More Equal Than the Others. J Natl Cancer Inst 2016;108:djw050.doi:10.1093/jnci/djw050
OpenUrl CrossRef PubMed

[109] Bartlett JM ,

[110] Bayani J ,

[111] Marshall A , et al

[112] ↵

Varga Z ,
Diebold J ,
Dommann-Scherrer C , et al
. How reliable is Ki-67 immunohistochemistry in grade 2 breast carcinomas? A QA study of the Swiss Working Group of Breast- and Gynecopathologists. PLoS One 2012;7:e37379.doi:10.1371/journal.pone.0037379

[114] Varga Z ,

[115] Diebold J ,

[116] Dommann-Scherrer C , et al

[117] ↵

Zhong F ,
Bi R ,
Yu B , et al
. A Comparison of Visual Assessment and Automated Digital Image Analysis of Ki67 Labeling Index in Breast Cancer. PLoS One 2016;11:e0150505.doi:10.1371/journal.pone.0150505

[119] Zhong F ,

[120] Bi R ,

[121] Yu B , et al

[122] ↵

Schuh F ,
Biazús JV ,
Resetkova E , et al
. Histopathological grading of breast ductal carcinoma in situ: validation of a web-based survey through intra-observer reproducibility analysis. Diagn Pathol 2015;10:93.doi:10.1186/s13000-015-0320-2
OpenUrl

[124] Schuh F ,

[125] Biazús JV ,

[126] Resetkova E , et al

[127] ↵

Dalton LW ,
Gerds TA
. The Advantage of Discordance: An Example Using the Highly Subjective Nuclear Grading of Breast Cancer. Am J Surg Pathol 2017;41:1105–11.doi:10.1097/PAS.0000000000000886
OpenUrl

[129] Dalton LW ,

[130] Gerds TA

Log in using your username and password

Main menu

Log in using your username and password

You are here

Abstract

Statistics from Altmetric.com

Request Permissions

Introduction

Patients and methods

Statistical analysis

Results

Survival analysis

Discussion

Take home messages

Acknowledgments

References

Footnotes

Read the full text or download the PDF:

Log in using your username and password