- Research article
- Open Access
- Open Peer Review
Does risk for ovarian malignancy algorithm excel human epididymis protein 4 and ca125 in predicting epithelial ovarian cancer: A meta-analysis
BMC Cancervolume 12, Article number: 258 (2012)
Risk for Ovarian Malignancy Algorithm (ROMA) and Human epididymis protein 4 (HE4) appear to be promising predictors for epithelial ovarian cancer (EOC), however, conflicting results exist in the diagnostic performance comparison among ROMA, HE4 and CA125.
Remote databases (MEDLINE/PUBMED, EMBASE, Web of Science, Google Scholar, the Cochrane Library and ClinicalTrials.gov) and full texts bibliography were searched for relevant abstracts. All studies included were closely assessed with the QUADAS-2 (Quality Assessment of Diagnostic Accuracy Studies-2). EOC predictive value of ROMA was systematically evaluated, and comparison among the predictive performances of ROMA, HE4 and CA125 were conducted within the same population. Sensitivity, specificity, DOR (diagnostic odds ratio), LR ± (positive and negative likelihood ratio) and AUC (area under receiver operating characteristic-curve) were summarized with a bivariate model. Subgroup analysis and sensitivity analysis were used to explore the heterogeneity.
Data of 7792 tests were retrieved from 11 studies. The overall estimates of ROMA for EOC predicting were: sensitivity (0.89, 95% CI 0.84-0.93), specificity (0.83, 95% CI 0.77-0.88), and AUC (0.93, 95% CI 0.90-0.95). Comparison of EOC predictive value between HE4 and CA125 found, specificity: HE4 (0.93, 95% CI 0.87-0.96) > CA125 (0.84, 95% CI 0.76-0.90); AUC: CA125 (0.88, 95% CI 0.85-0.91) > HE4 (0.82, 95% CI 0.78-0.85). Comparison of OC predictive value between HE4 and CA125 found, AUC: CA125 (0.89, 95% CI 0.85-0.91) > HE4 (0.79, 95% CI 0.76-0.83). Comparison among the three tests for EOC prediction found, sensitivity: ROMA (0.86, 95%CI 0.81-0.91) > HE4 (0.80, 95% CI 0.73-0.85); specificity: HE4 (0.94, 95% CI 0.90-0.96) > ROMA (0.84, 95% CI 0.79-0.88) > CA125 (0.78, 95%CI 0.73-0.83).
ROMA is helpful for distinguishing epithelial ovarian cancer from benign pelvic mass. HE4 is not better than CA125 either for EOC or OC prediction. ROMA is promising predictors of epithelial ovarian cancer to replace CA125, but its utilization requires further exploration.
Ovarian cancer is the leading cause of death from gynecologic cancers in the United States and the fifth-top cause of cancer death in women (Link 1). Non-specific clinical manifestation mainly hinders the early diagnosis of ovarian cancer. Cancer antigen 125 (CA125) was the only FDA-approved biomarker for ovarian cancer before the year 2008. CA125 is indicated for use as an aid in the detection of residual ovarian carcinoma in patients who have undergone first-line therapy and would be considered for diagnostic second-look procedures. Although the CA125 serum level elevated in 80% of epithelial ovarian cancer (EOC) patients with advanced stage , it increased in only 50% of patients with stage I EOC . In addition, CA125 serum levels elevate in various benign gynecological diseases (including endometriosis) , non-gynecologic malignancies . Therefore, considerable efforts are underway to identify new serum biomarkers, alone or combining with CA125 to improve EOC detection [6, 7].
With high-throughput technologies employed, a large number of new biomarkers have been discovered [8–10]. Human epididymis protein 4 (HE4) is among the most promising ones . High levels of HE4 are found in the serum of patients with EOC, especially in serous and endometroid cancers . Unlike CA125, HE4 doesn’t overexpress in endometriosis and other benign gynecological diseases . And HE4, as an aid in monitoring recurrence or progressive disease in patients with epithelial ovarian cancer, has been the first biomarker for EOC after CA125 to be approved by the U.S. Food and Drug Administration (FDA) at the year of 2008. However, conflicts arise on the sensitivity of HE4 and CA125 [5, 13–16].
Moore and colleagues  have explored a multianalytes assay named the Risk of Ovarian Malignancy Algorithm (ROMA™), which combines the results of HE4 EIA (enzyme immunoassay), ARCHITECT CA 125 II™ and menopausal status into a numerical score to predict malignancy when an ovarian mass was found clinically. Although ROMA™ has received clearance from the FDA of U.S. in September of the year 2011, the diagnostic accuracy of ROMA compared to CA125 and HE4 alone is still controversial [13, 16–18]. Here we try to clarify conflicting results existing in the diagnostic accuracy of ROMA, and in the performance comparison among ROMA, HE4 and CA125.
Data sources and search strategy
We followed the Meta-analysis Of Observational Studies in Epidemiology (MOOSE) and the Cochrane Handbook for Systematic Reviews of Diagnostic Test Accuracy (Link 2). MEDLINE (through PubMed interface), EMBASE, Web of Science, Google Scholar, the Cochrane Library and ClinicalTrials.gov (ended on 22th December, 2011) were searched. Reference lists of articles identified were manually searched. Publication languages were not limited. The terminology for search was based on the standardized National Library of Medicine MeSH terms and free texts. The search strategies of all the databases were based on those of PubMed (Additional file 1: Table S1).
Two authors (RXT and WPL) independently screened the search results based on the titles and abstracts. The full text of selected articles were reviewed independently by another two authors (KC and LLY) to determine the inclusion. Disagreements were resolved by referring to a third author (MC).
Studies that investigated both serum HE4 and CA125 as diagnostic tests or calculated the ROMA algorithm were included if (1) they were cross-sectional studies; and (2) performed in the same population presenting pelvic mass; (3) all serum specimens were collected preoperatively; (4) all subjects with histological diagnostic information; (5) with sufficient data for reconstructing fourfold table.
Studies recruiting participants without presenting pelvis mass, with obviously error data or ROC curve analysis containing healthy person and case–control studies were excluded. Case–control studies were excluded, for these studies had a tendency of overestimating or underestimating the diagnostic performance of a test .
The data extracted from each study included: author; year; country; design; recruitment; age; menopausal status; test methods (e.g. chemilumenesence immunoassay); number of patients; sensitivity; specificity and cut-off value. Four fold tables were reconstructed. Two reviewers (FKL and RXT) independently extracted the data for each study and referred to a third opinion (MC) when disagreements appeared. Important data that were not provided in the original studies were referred to their authors through Emails.
Index tests and reference standard
Since the Risk of Ovarian Malignancy Algorithm (ROMA™) is a qualitative serum test that combines the results of HE4 EIA (enzyme immunometric assays), ARCHITECT CA 125 II™ and menopausal status into a numerical score. Index tests for HE4 and CA125 in this meta-analysis questions were specified as EIAs and chemilumenesence immunoassays respectively. ROMA algorithm is the following :
Reference standard was based on outcomes of histopathological diagnosis. In all studies, ovarian cancer surgical stages were referred to criteria from FIGO (International Federation of Gynecology and Obsterics)  (Link 3). Early stage were defined as FIGO stages I & II, while advanced stage were FIGO stages III & IV.
Methodological quality assessment
The methodological quality of each study was evaluated with QUADAS-2 (Quality Assessment of Diagnostic Accuracy Studies 2)  quality items. Overall scores were not helpful for interpreting study quality  and were avoided in studies evaluation by QUADAS-2 tool. Doubts were resolved by discussion. In the items of QUADAS-2, the blindness of index tests and reference test has been list, but not the blindness between index tests. So one item that focus on validity of this comparative question has been added in Risk of Bias part of Domain 2 (Index Test) in QUADAS-2  as follows. “Were the results of index tests interpreted without knowledge of each other?” The answers (Yes, No or Unclear) of this question were considered to help assessing the Risk of Bias of including studies. According to the suggestion in Concerns Regarding Applicability part of Domain 2 (Index Test) in QUADAS-2 , variations in test technology, executing, or interpretation might affect estimates of the diagnostic accuracy of a test. If index test methods varied from those specified in the review question, concerns about applicability might exist.
Index tests for HE4 and CA125 in this meta-analysis questions were specified as EIAs and chemilumenesence immunoassays respectively. For tests of HE4, the chemilumenesence immunoassays were more sensitive than the specified EIAs, thus bias might be introduced into pooling of studies. And similarly, for CA125, EIA and RIA (radioimmunoassay) assays were less sensitive and steady than chemilumenesence immunoassays, so studies using either EIA or RIA will be considered as High Concern Regarding Applicability. The ROMA test employed the results from tests of CA125 and HE4 within the same study. So ROMA was considered as High Concern Regarding Applicability when either HE4 or CA125 test was evaluated as High Concern Regarding Applicability.
Data analysis plan
The statistical analysis is based on the following steps: (1) qualitatively describing the findings; (2) searching for heterogeneity and threshold effect; (3) figuring out the sources of heterogeneity by subgroup analysis; (4) choosing appropriate model and pooling estimates statistically. Univariate  and bivariate model  were two choices for diagnostic meta-analysis. When a positive correlation existed between true positive rate (TPR) and false positive rate (FPR), the bivariate analysis model was more appropriate .
Heterogeneity of studies were shown with forest graphs and explored with I2 estimates . The main advantage of I2 was inherent independence with the number of the studies included in the meta-analysis. I2 estimates below 25% were regarded as low risk of heterogeneity, between 25% and 50% as moderate heterogeneity, and 50% or higher as high heterogeneity. If there was a low level heterogeneity, univariate meta-analysis model was used (Meta-DiSc software version 1.4 ). If there was a moderate to high heterogeneity, Spearman correlation coefficients was explored. Positive Spearman correlation coefficients between Logit(TPR) and Logit(FPR) denoted the presence of threshold effects (Meta-DiSc software version 1.4). Then a bivariate model as well as HSROC (Hierarchical Summary Receiver Operator Characteristics) were estimated and plotted; if negative, summary estimates were pooled without HSROC [24, 29]; and if zero, summary estimates were pooled the way same as low level heterogeneity.
Influence analysis reestimated the meta-analysis by omitting each study in turn (STATA version 10.0) to confirm the stability of our analysis model. Publication bias was investigated by Deek’s funnel plot as well as asymmetry test . Subgroups were analyzed hierarchically by menopausal status, FIGO stages and concern of methods of index tests. In some studies, patients with low malignant potential tumors (LMP) or borderline tumors (BL) were classified into EOC group. And these studies were specifically analyzed as subgroup EOC (LMP/BL). Subgroups with less than four studies were analyzed with univariate model, because the bivariate model required 4 studies at least . Summary estimates and 95% CIs (confidence intervals) for sensitivity, specificity, DOR, LR ± and AUC were calculated (STATA version 10.0 [31, 32]). HSROC (Hierarchical summary receiver operating characteristic curves) plots were shown when appropriate. Comparisons between estimates of different tests were performed with z-test.
Characteristics of the included studies were summarized (Table1). 7792 tests from 2878 patients presenting pelvic mass at risk of ovarian cancer were retrieved. Of the 11 studies, 6 studies [15, 17, 18, 34, 36, 37] enrolling 1547 patients investigated the performance of ROMA for EOC prediction. Five studies [16, 33–36] with 883 patients compared the performance of HE4 and CA125 for OC prediction. Four studies [13, 15, 18, 36] with 715 patients compared the performance of HE4 and CA125 for EOC prediction. And 3 studies [15, 18, 36] (482 patients) compared the performance among ROMA, HE4 and CA125 for EOC prediction. In all studies, the spectrum of patients was considered representative. All enrolled participants present pelvis mass of suspected ovarian origin, have never received any treatment before and plan to have a surgical intervention. The prevalence of proven ovarian cancer across all studies ranged from 7.86% to 63.1% (overall prevalence was 18.5% for EOC). The study of Holcomb and colleagues  had the lowest prevalence (7.86%) for only investigating the results of premenopausal women.
Methods of index tests
All of 11 including studies measured serum HE4 and CA125. For HE4 measurement, 8 studies [13, 16–18, 33, 35–37] used EIA (enzyme immunoassay), the other 3 studies [14, 15, 34] employed CMIA (chemiluminescent microparticle immunoassay). For the measure of CA125, 5 studies [14, 15, 17, 34, 37] employed CMIA, 3 studies [16, 35, 36] with EIA, 3 studies [13, 18, 33] used RIA (radioimmunoassay), CLEIA (chemilumenscence enzyme immunoassay) and ECLIA (electrochemilumenscence immunoassay) respectively. CMIA, CLEIA and ECLIA belonged to chemilumenesence immunoassays, which were higher sensitive than EIA or RIA. According to Methodological quality assessment (the 4th part of Methods section), HE4 tests with CMIA, CA125 tests with EIA and RIA were regarded as high Concern Regarding Applicability. The ROMA tests were considered as high Concern Regarding Applicability when either HE4 or CA125 test was evaluated as high Concern Regarding Applicability (Figure2).
Methodological quality of all included studies
Quality of included studies was assessed by the QUADAS-2 tool (Figure2 & Table2). Within 9 [13, 14, 16–18, 34–37] of 11 studies, the results interpretation of index tests (HE4/CA125) were blind with reference standard test (ROMA). The other 2 studies [15, 33] were unclear. In 5 of the 11 studies [14, 16, 34–36] the results of index tests (HE4 and CA125) were interpreted without knowledge of each other. In the other 6 studies [13, 15, 17, 18, 33, 37] the blindness was unclear. So when assessing the studies with the item “Could the conduct or interpretation of the index test have introduced bias?” in domain 2 of QUADAS-2, the results showed that 5 studies [14, 16, 34–36] were low risk of bias, 1 study  was high risk of bias and 5 studies [15, 17, 18, 33, 37] were unclear their risk of bias. Four [16–18, 34] of the total 11 studies were considered as low risk of bias for the Patient Selection (Domain 1 of QUADAS-2) for their consecutive enrollment of patients; 2 studies [14, 15] were regarded as high risk of bias and in the other 5 studies [13, 33, 35–37] the risk was unclear.
Performance of ROMA for predicting EOC
Forest plots of sensitivity and specificity of ROMA for EOC prediction were shown in Figure3.
Mean estimates and their 95%CIs were: sensitivity 0.89 (0.84- 0.93), specificity 0.83 (0.77- 0.88) and AUC 0.93 (0.90- 0.95) (Table3). High level of heterogeneity lay in both sensitivity (I2 = 71.6%) and specificity (I2 = 80.7%).
Threshold effect existed (Spearman correlation coefficient: 0.657, p = 0.156).Thus bivariate model was used to pool estimates. HSROC plots showed the summary estimates of sensitivity and specificity as well as the confidence and prediction regions (Figure4).
Subgroups analysis observed variability in pooled estimates (Table3). We have compared these estimates between subgroups to investigate the performance of ROMA. Across all subgroups, performance (AUCs) of ROMA for EOC detection ranged from 0.88 to 0.97. The ROMA performed better in EOC whole population (AUC: 0.93, 95%CI 0.90- 0.95) than in either premenopausal subgroup (EOC-preM) (AUC: 0.88, 95% CI 0.85- 0.91) or postmenopausal subgroup (EOC-postM) (AUC: 0.89, 95% CI 0.86- 0.92). And the ROMA had better performence in EOC-advanced stage group (AUC: 0.88, 95% CI 0.85- 0.91) than in both EOC whole population and EOC-early stage group (AUC: 0.88, 95% CI 0.83- 0.93). What’s more, the ROMA performed better in EOC population than in OC population (AUC: 0.89, 95% CI 0.87- 0.92).
ROMA had lower sensitivity in premenopausal subgroup (EOC-preM) (0.82, 95%CI 0.67- 0.91) than postmenopausal subgroup (EOC-postM) (0.93, 95%CI 0.89- 0.96). EOC group (0.83, 95% CI 0.77- 0.88) had higher specificity than both EOC-early stage (0.76, 95% CI 0.73- 0.79) and EOC-advanced stage (0.76, 95% CI 0.73- 0.79) groups. ROMA had higher sensitivity in EOC-advanced stage group (0.98, 95%CI: 0.94-1.00) than in EOC whole population (0.90, 95% CI 0.84- 0.93) and EOC-early stage group (0.81, 95% CI 0.71- 0.89). In addition, we found in subgroup method with Concern Regarding Applicability, ROMA had higher specificity in high Concern Regarding Applicability group (EOC-methods High concern) (0.87, 95% CI 0.83- 0.90) than both high Concern Regarding Applicability group (EOC-methods Low concern) (0.75, 95% CI 0.72- 0.78) and EOC whole population. Finally, No differences were found in other summary estimates (except AUC between EOC and OC groups) within EOC, EOC (LMP/BL) and OC groups (Table3).
The appearance of the Deeks’ funnel plot for ROMA on EOC detection was symmetrical (Additional file 2: Figure S1), and the funnel plot asymmetry test showed little sign of publication bias (regression coefficients was −3.73; p = 0.617). When single study was omitted, the summary estimates (sensitivity, specificity and DOR) were close to those obtained with all eligible studies (Figure5 & Additional file 3: Table S2).
Performance comparison between HE4 and CA125
Four studies [13, 15, 18, 36] compared the performance of HE4 and CA125 for predicting EOC (Figures 6 & 7). All the 2 groups (EOC-HE4, EOC-CA125) were analyzed in bivariate model (Figure8). CA125 had a higher AUC than HE4, while a lower specificity than HE4. No significant differences were found within other paired estimates (Table4). Five studies [16, 33–36] compared the performance of HE4 and CA125 for predicting OC (Figures 9 & 10). All the 2 groups (OC-HE4, OC-CA125) were also analyzed via bivariate model (Figure11). CA125 had a higher AUC than HE4, while no significant differences were found within other paired estimates (Table4).
Studies included also investigated the diagnostic value of HE4 and CA125 in early stage of EOC, as well as distinguishing EOC from benign pelvic mass in premenopausal and postmenopausal women. Because all these settings contained less than 3 studies, we didn’t pool them as subgroups but summarized their sensitivity specificity with forest plots (Additional file 4: Figure S2).
Performance comparison among ROMA, HE4 and CA125 for EOC prediction
Three studies evaluated the performance of HE4, CA125 and ROMA for EOC detection (Figure12). All three groups (EOC- ROMA, EOC- HE4 and EOC- CA125) were pooled with univariate model (Figure13 & Table5).
Among the three tests, HE4 had the highest specificity (0.94, 95% CI: 0.90-0.96), but a lower sensitivity (0.80, 95% CI: 0.73-0.85) than ROMA (0.86, 95% CI: 0.81-0.91). The ROMA had a higher specificity (0.84, 95% CI: 0.79-0.88) than CA125 (0.78, 95% CI: 0.73-0.83). Meanwhile no differences were found between CA125 and HE4, as well as between CA125 and ROMA in their summary sensitivity. The DOR, LR ± and AUC values were similar among the three tests (Table5).
Summary of main results
Our results found that, first, ROMA could help distinguish EOC from benign pelvic mass with a high diagnostic accuracy (AUC: 0.93). The ROMA has high sensitivity to predict advanced stage EOC than early stage EOC and in postmenopausal women than in premenopausal women. Second, although HE4 has higher specificity than CA125 for EOC monitoring, CA125 has better diagnosis accuracy (higher AUC) than HE4 for EOC or OC prediction. This is based on the results of 4 studies that compare HE4 and CA125 within the same population. Third, based on the results of comparison of HE4, CA125 and ROMA in the same population, the overall performance (AUC) of the three tests for EOC prediction are similar. ROMA is less specific but more sensitive than HE4, while both ROMA and HE4 are more specific than CA125 for EOC monitoring.
All studies included were subjected to close scrutiny with the QUADAS-2 tool, resulting in high quality across the items. Heterogeneity often existed in diagnostic meta-analysis , and mainly resulted from characteristics of the study population, variations in the study design, different statistical methods, and different covariates . Within-study quality were highly concerned in this meta-analysis. Both high level of heterogeneity in sensitivity and specificity were found for ROMA test. The existence of threshold effect might partially explain the heterogeneity. Analysis of subgroups (EOC-methods high concern and EOC-methods low concern) found the EOC-methods High concern group had higher specificity than both EOC-methods Low concern group and EOC group.
In the current paper, only three studies evaluated the diagnostic value of ROMA at early stage of EOC. The early stage ovarian cancer usually presented non-specific clinical manifestation, and the FIGO staging by surgery often resulted in low prevalence of early stage EOC. So future clinical investigations will be promising and expectant to be prospective studies recruiting enough patients with early stage EOC.
We analyzed the predictive value of ROMA for patients with EOC, EOC(LMP/BL) and ovarian cancer. No differences were found in all summary estimates (except AUC between EOC and OC groups) of EOC, EOC (LMP/BL) and OC groups. Although EOC accounted for 90% of ovarian cancer, we didn’t think ROMA could be expanded to predict ovarian cancer, for both HE4 and CA125 were biomarkers of epithelial ovarian cancer[2, 11].
Cut-off values were variable for HE4 (70-150pM) and ROMA (preM: 7.4-13.1%; postM: 10.9-27.7%), but consistent for CA125 (35U/mL) across studies. Among the studies included, only one study used specific cut-off values for premenopausal (70pM) and postmenopausal women (140pM). Studies found that HE4 levels in healthy subjects were associated with age [40, 41]. So it would be essential to define a specific normal range and cut-off value for premenopausal and postmenopausal women respectively. For other two predictors ROMA and CA125, it would also be indispensable for each center to define their normal ranges and cut-off values.
Strengths and weaknesses
Except employing a comprehensive search strategy, strict inclusion criteria and sound analysis protocol, strengths of this paper also contain that only studies investigating both the two tests (HE4 and CA125) or all three tests (HE4, CA125 and ROMA) in a same population have been included in tests comparisons. The latter makes sure that the comparison takes place between studies under the same or similar population background, thus reduces the heterogeneity between studies .
The main limitations are: (1) unable to gain the unpublished paper. (2) Study number might be small. We believe that reliability of the meta-analysis are majorly dependent on the quality of studies included. (3) The diagnostic value of ROMA, HE4 and CA125 in early stage EOC have not been convincingly analyzed.
ROMA can help distinguish EOC from benign pelvic mass. ROMA is less specific but more sensitive than HE4. Both ROMA and HE4 are more specific than CA125 for EOC prediction. CA125 has better diagnosis accuracy than HE4 for EOC and OC prediction. ROMA is promising predictor to replace CA125, but its utilization requires further exploration.
1 What are the key statistics about ovarian cancer? American Cancer Society Web site. [http://www.cancer.org/Cancer/OvarianCancer/DetailedGuide/ovarian-cancer-key-statistics]
2 Diagnostic Test Accuracy Working Group. Handbook for DTA Reviews. Cochrane Collaboration Web site. [http://srdta.cochrane.org/handbook-dta-reviews].
3 International Federation of Gynecology and Obstetrics FIGO website [http://www.figo.org/]
Risk for ovarian malignancy algorithm
Human epididymis protein 4
Epithelial ovarian cancer
Cancer antigen 125
Quality assessment of diagnostic accuracy studies-2
Diagnostic odds ratio
Positive and negative likelihood ratio
Area under receiver operating characteristic-curve
Food and drug administration
Meta-analysis of observational studies in epidemiology
International federation of gynecology and obsterics
True positive rate
False positive rate
Low malignant potential tumors
Hierarchical summary receiver operating characteristic curves
Chemiluminescent microparticle immunoassay
Chemilumenscence enzyme immunoassay
Brown PO, Palmer C: The preclinical natural history of serous ovarian cancer: defining the target for early detection. PLoS Med. 2009, 6: e1000114-10.1371/journal.pmed.1000114.
Davis HM, Zurawski VR, Bast RC, Klug TL: Characterization of the CA 125 antigen associated with human epithelial ovarian carcinomas. Cancer Res. 1986, 46: 6143-6148.
Bast RC, Badgwell D, Lu Z, Marquez R, Rosen D, Liu J, Baggerly KA, Atkinson EN, Skates S, Zhang Z, et al: New tumor markers: CA125 and beyond. Int J Gynecol Cancer. 2005, 15: 274-281. 10.1111/j.1525-1438.2005.00441.x.
Niloff JM, Klug TL, Schaetzl E, Zurawski VR, Knapp RC, Bast RC: Elevation of serum CA125 in carcinomas of the fallopian tube, endometrium, and endocervix. Am J Obstet Gynecol. 1984, 148: 1057-1058.
Park Y, Lee JH, Hong DJ, Lee EY, Kim HS: Diagnostic performances of HE4 and CA125 for the detection of ovarian cancer from patients with various gynecologic and non-gynecologic diseases. Clin Biochem. 2011, 44: 884-888. 10.1016/j.clinbiochem.2011.04.011.
Yurkovetsky Z, Skates S, Lomakin A, Nolen B, Pulsipher T, Modugno F, Marks J, Godwin A, Gorelik E, Jacobs I, et al: Development of a multimarker assay for early detection of ovarian cancer. J Clin Oncol. 2010, 28: 2159-2166. 10.1200/JCO.2008.19.2484.
Cree I: Improved blood tests for cancer screening: general or specific?. BMC Cancer. 2011, 11: 499-10.1186/1471-2407-11-499.
Kulasingam V, Pavlou MP, Diamandis EP: Integrating high-throughput technologies in the quest for effective biomarkers for ovarian cancer. Nat Rev Cancer. 2010, 10: 371-378. 10.1038/nrc2831.
Lu R, Sun M, Feng J, Gao X, Guo L: Myofibrillogenesis regulator 1 (MR-1) is a novel biomarker and potential therapeutic target for human ovarian cancer. BMC Cancer. 2011, 11: 270-10.1186/1471-2407-11-270.
Kothandaraman N, Bajic VB, Brendan PN, Huak CY, Keow PB, Razvi K, Salto-Tellez M, Choolani M: E2F5 status significantly improves malignancy diagnosis of epithelial ovarian cancer. BMC Cancer. 2010, 10: 64-10.1186/1471-2407-10-64.
Helistrom I, Raycraft J, Hayden-Ledbetter M, Ledbetter JA, Schummer M, McIntosh M, Drescher C, Urban N, Hellstrom KE: The HE4 (WFDC2) protein is a biomarker for ovarian carcinoma. Cancer Res. 2003, 63: 3695-3700.
Drapkin R, von Horsten HH, Lin Y, Mok SC, Crum CP, Welch WR, Hecht JL: Human epididymis protein 4 (HE4) is a secreted glycoprotein that is overexpressed by serous and endometrioid ovarian carcinomas. Cancer Res. 2005, 65: 2162-2169. 10.1158/0008-5472.CAN-04-3924.
Moore RG, Brown AK, Miller MC, Skates S, Allard WJ, Verch T, Steinhoff M, Messerlian G, DiSilvestro P, Granai CO, Bast RC: The use of multiple novel tumor biomarkers for the detection of ovarian carcinoma in patients with a pelvic mass. Gynecol Oncol. 2008, 108: 402-408. 10.1016/j.ygyno.2007.10.017.
Holcomb K, Vucetic Z, Miller MC, Knapp RC: Human epididymis protein 4 offers superior specificity in the differentiation of benign and malignant adnexal masses in premenopausal women. Am J Obstet Gynecol. 2011, 205 (358): e351-e356.
Bandiera E, Romani C, Specchia C, Zanotti L, Galli C, Ruggeri G, Tognon G, Bignotti E, Tassi RA, Odicino F, et al: Serum human epididymis protein 4 and risk for ovarian malignancy algorithm as new diagnostic and prognostic tools for epithelial ovarian cancer management. Cancer Epidemiol Biomarkers Prev. 2011, 20: 2496-2506. 10.1158/1055-9965.EPI-11-0635.
Van Gorp T, Cadron I, Despierre E, Daemen A, Leunen K, Amant F, Timmerman D, De Moor B, Vergote I: HE4 and CA125 as a diagnostic test in ovarian cancer: prospective validation of the risk of ovarian malignancy algorithm. Br J Cancer. 2011, 104: 863-870. 10.1038/sj.bjc.6606092.
Moore RG, McMeekin DS, Brown AK, DiSilvestro P, Miller MC, Allard WJ, Gajewski W, Kurman R, Bast RC, Skates SJ: A novel multiple marker bioassay utilizing HE4 and CA125 for the prediction of ovarian cancer in patients with a pelvic mass. Gynecol Oncol. 2009, 112: 40-46. 10.1016/j.ygyno.2008.08.031.
Montagnana M, Danese E, Ruzzenente O, Bresciani V, Nuzzo T, Gelati M, Salvagno GL, Franchi M, Lippi G, Guidi GC: The ROMA (Risk of Ovarian Malignancy Algorithm) for estimating the risk of epithelial ovarian cancer in women presenting with pelvic mass: is it really useful?. Clin Chem Lab Med. 2011, 49: 521-525.
Stroup DF, Berlin JA, Morton SC, Olkin I, Williamson GD, Rennie D, Moher D, Becker BJ, Sipe TA, Thacker SB: Meta-analysis of observational studies in epidemiology: a proposal for reporting. Meta-analysis Of Observational Studies in Epidemiology (MOOSE) group. JAMA. 2000, 283: 2008-2012. 10.1001/jama.283.15.2008.
Lijmer JG, Mol BW, Heisterkamp S, Bonsel GJ, Prins MH, van der Meulen JHP, Bossuyt PMM: Empirical evidence of design-related bias in studies of diagnostic tests. JAMA. 1999, 282: 1061-1066. 10.1001/jama.282.11.1061.
Benedet JL, Pecorelli S: Why cancer staging? FIGO 26th Annual Report on the Results of Treatment in Gynecological Cancer. Int J Gynaecol Obstet. 2006, 95 (Suppl 1): S3-
Whiting PF, Rutjes AWS, Westwood ME, Mallett S, Deeks JJ, Reitsma JB, Leeflang MMG, Sterne JAC, Bossuyt PMM: Group tQ-: QUADAS-2: a revised tool for the quality assessment of diagnostic accuracy studies. Ann Intern Med. 2011, 155: 529-536.
Whiting P, Harbord R, Kleijnen J: No role for quality scores in systematic reviews of diagnostic accuracy studies. BMC Med Res Methodol. 2005, 5: 19-10.1186/1471-2288-5-19.
Chappell FM, Raab GM, Wardlaw JM: When are summary ROC curves appropriate for diagnostic meta-analyses?. Stat Med. 2009, 28: 2653-2668. 10.1002/sim.3631.
Rutter CM, Gatsonis CA: A hierarchical regression approach to meta-analysis of diagnostic test accuracy evaluations. Stat Med. 2001, 20: 2865-2884. 10.1002/sim.942.
Reitsma JB, Glas AS, Rutjes AW, Scholten RJ, Bossuyt PM, Zwinderman AH: Bivariate analysis of sensitivity and specificity produces informative summary measures in diagnostic reviews. J Clin Epidemiol. 2005, 58: 982-990. 10.1016/j.jclinepi.2005.02.022.
Higgins JP, Thompson SG, Deeks JJ, Altman DG: Measuring inconsistency in meta-analyses. BMJ. 2003, 327: 557-560. 10.1136/bmj.327.7414.557.
Zamora J, Abraira V, Muriel A, Khan K, Coomarasamy A: Meta-DiSc: a software for meta-analysis of test accuracy data. BMC Med Res Methodol. 2006, 6: 31-10.1186/1471-2288-6-31.
Leeflang MMG, Deeks JJ, Gatsonis C, Bossuyt PMM: Group obotCDTAW: Systematic reviews of diagnostic test accuracy. Ann Intern Med. 2008, 149: 889-897.
Deeks JJ, Macaskill P, Irwig L: The performance of tests of publication bias and other sample size effects in systematic reviews of diagnostic test accuracy was assessed. J Clin Epidemiol. 2005, 58: 882-893. 10.1016/j.jclinepi.2005.01.016.
Dwamena BA: midas: Computational and Graphical Routines for Meta-analytical Integration of Diagnostic Accuracy Studies in Stata. 2007, Division of Nuclear Medicine, Department of Radiology, University of Michigan Medical School, Ann Arbor
Roger M, Harbord PW: metandi: Meta-analysis of diagnostic accuracy using hierarchical logistic regression. Stata J. 2009, 9: 19-
Abdel-Azeez HA, Labib HA, Sharaf SM, Refaie AN: HE4 and mesothelin: novel biomarkers of ovarian carcinoma in patients with pelvic masses. Asian Pac J Cancer Prev. 2010, 11: 111-116.
Kim YM, Whang DH, Park J, Kim SH, Lee SW, Park HA, Ha M, Choi KH: Evaluation of the accuracy of serum human epididymis protein 4 in combination with CA125 for detecting ovarian cancer: a prospective case–control study in a Korean population. Clin Chem Lab Med. 2011, 49: 527-534.
Chang X, Ye X, Dong L, Cheng H, Cheng Y, Zhu L, Liao Q, Zhao Y, Tian L, Fu T, et al: Human epididymis protein 4 (HE4) as a serum tumor biomarker in patients with ovarian carcinoma. Int J Gynecol Cancer. 2011, 21: 852-858. 10.1097/IGC.0b013e31821a3726.
Jacob F, Meier M, Caduff R, Goldstein D, Pochechueva T, Hacker N, Fink D, Heinzelmann-Schwarz V: No benefit from combining HE4 and CA125 as ovarian tumor markers in a clinical setting. Gynecol Oncol. 2011, 121: 487-491. 10.1016/j.ygyno.2011.02.022.
Moore RG, Miller MC, Disilvestro P, Landrum LM, Gajewski W, Ball JJ, Skates SJ: Evaluation of the diagnostic accuracy of the risk of ovarian malignancy algorithm in women with a pelvic mass. Obstet Gynecol. 2011, 118: 280-288. 10.1097/AOG.0b013e318224fce2.
Harbord RM, Deeks JJ, Egger M, Whiting P, Sterne JA: A unification of models for meta-analysis of diagnostic accuracy studies. Biostatistics. 2007, 8: 239-251.
Dinnes J, Deeks J, Kirby J, Roderick P: A methodological review of how heterogeneity has been examined in systematic reviews of diagnostic test accuracy. Health Technol Assess. 2005, 9 (iii): 1-113.
Bolstad N, Oijordsbakken M, Nustad K, Bjerner J: Human epididymis protein 4 reference limits and natural variation in a Nordic reference population. Tumour Biol. 2012, 33: 141-148. 10.1007/s13277-011-0256-4.
Galgano MT, Hampton GM, Frierson HF: Comprehensive analysis of HE4 expression in normal and malignant human tissues. Mod Pathol. 2006, 19: 847-853.
Zhou Y, Yin X, Ying J, Zhang B: Golgi protein 73 versus alpha-fetoprotein as a biomarker for hepatocellular carcinoma: a diagnostic meta- analysis. BMC Cancer. 2012, 12: 17-10.1186/1471-2407-12-17.
The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1471-2407/12/258/prepub
The authors wish to appreciate Dr. Richard G. Moore; Dr. Toon Van Gorp; Dr. Montagnana Martina; Dr. Yong Man Kim; Dr. Dong Hee Whang; Dr. Viola Heinzelmann; Dr. Elisabetta Bandiera; Dr. Hyon-Suk Kim and Dr. Yongjung Park, for their kindly providing extra information or confirming the data in their original papers. Thanks Miss Hai-rong Zhao, librarian of Library of Third Military Medical University, for helping establish the searching strategies. Also thanks Dr. Toon Van Gorp, Dr. Mariska M. G. Leeflang and Dr. Anna Lokshin for their critical review and constructive comments about this manuscript.
Lili Yu declared a grant (NO. 81070505) from National Natural Science Foundation of China. For other authors none were declared.
FKL conceived the study, searched databases, extracted the data, performed the statistical analysis and drafted the manuscript. RXT selected the trials, extracted the data and helped draft the manuscript. KC and WPL participated in the selection of trials and the statistical analysis. FW searched databases, evaluated the studies included and revised the manuscript critically. SLD evaluated the studies included, participated in the coordination and revised the manuscript critically. LLY participated in the design, selected the trials and revised the manuscript critically. MC conceived of the study together with FKL, participated in the design and helped draft the manuscript. All authors read and approved the final manuscript.
Fake Li, Ruxiu Tie contributed equally to this work.
Electronic supplementary material
Additional file 3 : Table S2. Influence analysis of individual studies for diagnostic performance of ROMA. Estimates were pooled by bivariate model. Excluding any individual study only a small change were resulted in the sensitivity (sen), specificity (spe) or diagnostic odds ratio (DOR) compared with all eligible studies. All differences were not significant (p > 0.05). (DOC 40 KB)
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.