Formation of translational risk score based on correlation coefficients as an alternative to Cox regression models for predicting outcome in patients with NSCLC
- Wolfgang Kössler†1,
- Anette Fiebeler†2,
- Arnulf Willms3,
- Tina ElAidi4,
- Bernd Klosterhalfen5 and
- Uwe Klinge6Email author
© Kössler et al; licensee BioMed Central Ltd. 2011
Received: 26 April 2011
Accepted: 27 July 2011
Published: 27 July 2011
Personalised cancer therapy, such as that used for bronchial carcinoma (BC), requires treatment to be adjusted to the patient's status. Individual risk for progression is estimated from clinical and molecular-biological data using translational score systems. Additional molecular information can improve outcome prediction depending on the marker used and the applied algorithm. Two models, one based on regressions and the other on correlations, were used to investigate the effect of combining various items of prognostic information to produce a comprehensive score. This was carried out using correlation coefficients, with options concerning a more plausible selection of variables for modelling, and this is considered better than classical regression analysis.
Clinical data concerning 63 BC patients were used to investigate the expression pattern of five tumour-associated proteins. Significant impact on survival was determined using log-rank tests. Significant variables were integrated into a Cox regression model and a new variable called integrative score of individual risk (ISIR), based on Spearman's correlations, was obtained.
High tumour stage (TNM) was predictive for poor survival, while CD68 and Gas6 protein expression correlated with a favourable outcome. Cox regression model analysis predicted outcome more accurately than using each variable in isolation, and correctly classified 84% of patients as having a clear risk status. Calculation of the integrated score for an individual risk (ISIR), considering tumour size (T), lymph node status (N), metastasis (M), Gas6 and CD68 identified 82% of patients as having a clear risk status.
Combining protein expression analysis of CD68 and GAS6 with T, N and M, using Cox regression or ISIR, improves prediction. Considering the increasing number of molecular markers, subsequent studies will be required to validate translational algorithms for the prognostic potential to select variables with a high prognostic power; the use of correlations offers improved prediction.
Bronchial cancer, a common malignant tumour in the western world, presents as Non-Small Cell Lung Cancer, NSCLC, in more than 85% of cases . It is the leading cause of mortality in terms of malignant disorders, and its incidence is increasing . The underlying pathology is complex and numerous proteins have been described as prognostic markers, demonstrating altered expression compared with healthy surrounding lung tissue . The expression pattern of epidermal growth factor receptor (EGFR) can determine outcome and is used to influence individual therapy [4, 5]. However, only a subset of patients benefit from this specifically targeted therapy because they have a specific mutation. Therefore, marker constellations that predict the risk for recurrence and can aid individual-targeted treatment would be advantageous for the majority of patients. Despite progress in microscopic and molecular analyses, the TNM grading scale, which considers the tumour, nodes and metastases, is still the preferred classification scheme for malignancies . However, growing knowledge concerning several factors that are considered to improve or worsen prognosis has resulted in the medical community facing a major challenge to define the prognostic impact of a patient's individual constellation.
An increasing number of biomarkers that reflect the distinct aggressiveness of tumours have been identified. Therefore, they are assumed to predict a patient's risk of tumour progression. For example, the Carmeliet group recently published results that underline the promoting role of a small protein, growth arrest specific protein (Gas) 6, for tumour metastasis in mice . Previously, McCormack et al. demonstrated that Gas 6 expression was positively correlated with favourable prognostic variables in human breast cancer . An accumulation of tumour associated macrophages (TAM) in the stroma of a tumour may serve as an immunological indicator of the defence capability of a host. However, its consequence for survival may be divergent, promoting a good or bad prognosis .
Considering the complex interactions within tumours, it is unlikely that one single marker will be sufficient to predict outcome . Therefore, prediction of prognosis will rely on a combination of numerous clinical data concerning the individual patient, particularly information relating to biomarkers. However, translational integration of this large amount of information into one risk assessment is a major challenge. A multiple regression model derived from available data is the current method used to estimate prognosis for a patient. However, the selection of variables is significantly influenced by the choice of the underlying model . As a possible alternative or supplement, this study employed correlations with survival to select variables, and weighted the individual status of each, resulting in an integrated score for an individual risk (ISIR). The resulting ISIR score should predict the outcome, reflecting the individual balance between significant aggressive and protective factors.
To evaluate ISIR, the course of non-small cell lung cancer (NSCLC) was investigated in 63 consecutive patients. In addition to TNM, the expression of several proteins involved in tumour genesis, particularly Gas6, and the number of infiltrating macrophages (CD68) were analysed. In addition, the proteins Notch 3, MMP2 and COX2, were researched to confirm their roles during chronic inflammation and foreign body responses . Each variable was analyzed individually for its prognostic value and subjected to multiple Cox regression analysis. The potential of the newly developed ISIR to predict outcome was evaluated by calculating receiver operating characteristics (ROC) curves and the area under the curve (AUC). The validity of the model was evaluated using leave one-out cross validation.
Materials and methods
The course of 63 patients with NSCLC who were subjected to an operation between 2000 and 2002 was investigated. The local ethical committee approved the study and written, informed consent was obtained from participants. Clinical data included tumour grading according to TNM, level of resection R, histology, gender and age.
Tumour sections were evaluated for histology and protein expression by three independent experts. To characterise the tumour-host interaction, the following antibodies were used: CD68 mouse monoclonal antibody (Dako), Gas6 polyclonal anti-goat antibody (Santa Cruz), Notch3 polyclonal anti-goat antibody (Santa Cruz), Cox2 polyclonal rabbit antibody (DCS Innovative Diagnostic Systems), MMP2 polyclonal rabbit antibody (Biomol). As secondary antibody we used biotinylated goat anti-rabbit for Cox2 and MMP2, goat anti-mouse for CD68, and rabbit anti-goat for Notch3 and GAS 6 (all obtained from Dako).
For semi-quantitative analysis, a grading scale was used: 1 indicated very weak staining (<5% cells), 2 indicated weak (5-30%), 3 specified good (30-80%), and 4 indicated a strong (>80%) staining signal. For each marker, a minimum of five view fields were analyzed.
Simple descriptive statistics were computed for squamous cell carcinoma (SCC) and adenocarcinoma (AC), separately. Tests concerning significant differences between the two groups were carried out using a chi2 test for homogeneity and Fisher's exact test. For age and survival, nonparametric confidence intervals were calculated.
Each marker was considered in isolation and Kaplan-Meier curves for the various realizations were generated. Furthermore, log-rank tests were performed to compare survival times. Spearman correlation coefficients between survival and the various variables were computed; a p-value < 0.05 was considered significant. All variables with significant negative or positive correlations to survival time were selected for calculation of the ISIR.
Inserting the realizations of the variables for any patient resulted in an individual ISIR score, with large values for ISIR indicating high risk.
For the evaluation of ISIR a classification table of prognosis was computed and, as reported by Chen et al., three survival groups were defined: ≤ 12, between 12 and 60, and ≥ 60 months . Furthermore, three ISIR classes were defined, where ISIR ≤ 0.25 denotes low risk, ≥ 0.5 high risk, and ISIR between 0.25 and 0.5 intermediate risk. The Spearman correlation of ISIR to survival was calculated, and scatter plots of the two variables were retrieved. Classification tables were computed with estimates of the sensitivities and specificities. Integrating all features of interest into ISIR, the fact that the different variables have different scale measures (0 to 3 for N, 1 and 2 for M and H, 1-4 for the other) had to taken into consideration. Therefore, each variable was divided by the number of their possible realizations (i.e. by two for M and H, by four for the others).
To emphasize the power of ISIR, it was compared with the well-established Cox method. In Cox regression, we have the so-called proportional hazards model (the Cox model) λ(t,X) = λ0(t)exp(X β), where λ(t,X) is the hazard rate at time point t and with given vector X of covariates. The baseline hazard and λ0(t) the vector β of regression coefficients are estimated. It is very common to use automatic backward variable selection, and variables are removed from the model when p > 0.05.
The statistical analysis was carried out using the Statistical Package for Social Sciences Software (SPSS, vers. 17.0) and with the Statistical Analysis System (SAS, vers. 9.2).
Descriptive statistics for the patients.
Squamous cell carcinoma
Tumour size T
Nodal status N
I: < 5%
Survival status at census
Medians (nonparametric 95% confidence interval)
Survival time (month)
Spearman correlation of survival and AUC for various variables (ability to differentiate between survival of ≤ 12 months and ≥ 60 months).
Expression patterns of Gas6 and CD68
Integrated Score for an Individual Risk (ISIR)
Assessing risk as a balance of collaborating aggressive and protective variables, the ISIR was calculated as a ratio of weighted sums of significant aggressive (in view of patient survival; from our data T, N, M) and protective (CD68, Gas6) variables. The status of censoring was ignored, but for the present data long survival times were evident for all censored observations. Therefore, the effect of censoring was minimal.
Survival of patients assessed with ISIR.
t ≤ 12
12 > t < 60
Low risk, ISIR < 0.4
0.4 ≤ ISIR ≤ 0.8
High risk, ISIR > 0.8
Sensitivities and specificities of the ISIR and Cox methods.
Prognosis not defined
12 >t< 60
Prognosis not defined
12 >t< 60
ISIR > 0.5 (n = 42)
ISIR ≤ 0.5 (n = 19)
Cox > 5.5 (n = 38)
Cox ≤5.5 (n = 24)
The regression parameter β = (β1,.......β k ) in the proportional hazards model (Cox model) was estimated using the method of Maximum Likelihood, with the procedure PHREG from the SAS software. Backward selection was used, and variables remained in the model if the corresponding p-value was less than 0.05. The remaining variables were (together with their estimated regression coefficients): T (0.88), CD68 (-1.60), Gas6 (-0.78), histology (0.68) and Notch3 (-0.80). Perhaps somewhat surprisingly, M and N were not significant in the Cox model. Large values of indicate short survival.
Patient survival according to Cox classification.
t ≤ 12
12 < t < 60
Low risk, Cox < - 6
- 6 ≤ Cox ≤ - 4.5
High risk, Cox > - 4.5
The cut-off value for Cox was -5.5 (cf. Table 4). Taking this cut-off value, 32 of 38 (14/17 and 18/21) cross-validated patients were represented in the survival classes ≤ 12 and ≥ 60 months, which were classified correctly.
Response to therapy and the corresponding outcome of patients with bronchial carcinoma varies considerably, underlining the requirement for a personalised approach. For the most part, the individual risk profile is estimated from clinical information such as tumour stage. However, rapid advances in biomarker research suggest that tumour aggressiveness and immunological competence of the host must be considered. An increasing number of biomarkers are available for the differentiation of subgroups; the impact of each, whether positive or negative, is predominantly defined by comparisons between patients with a similar TNM status. Considering that several factors influence prognosis and the huge variety of individual constellations, an algorithm to form integrative risk scores is required.
This study confirmed that survival after resection of a non-small cell lung cancer is significantly reduced when the TNM status is improved; in contrast, marked expressions of CD68 and Gas6 as biological markers of the tumour's inflammatory reaction were associated with a favourable outcome. Furthermore, compared with individual markers, integrative models comprising clinical and molecular information provided a higher predictive power to estimate patient prognosis, regardless of whether correlation or regression analysis was used.
In an attempt to characterize the immunological defence of the host, the expression of various proteins involved in numerous physiological pathways related to inflammation and remodelling were analysed. Whether increased expression reflects a favourable outcome is open to debate. For example, expression of Gas6 appears to be beneficial for breast cancer patients but indicates poor prognosis for gastric cancer [8, 14, 15]. For tumour-associated macrophages (TAM) several functions have been described [16, 17]. The observations presented herein are in line with those of Ohri et al. and Kawai et al.; each group observed an improved prognosis related to CD68 expression in NSCLC [18, 19]. The expression of Notch was significantly related to longer survival in the Cox model. This agrees with the observation of Dang et al., who described over-expression of Notch in NSCLC . However, it is in contrast to the findings of Konishie et al. They reported that MRK-003 inhibited Notch3 signalling, reduced tumour cell proliferation and induced apoptosis in human lung cancer, indicating that reduced Notch expression may be advantageous to the patient . In summary, indicators of tumour and host biology such as Gas6, CD68 and Notch are helpful for improving the prediction of prognosis after NSCLC, but MMP2 and Cox2 were of no clinical value in the present study. No single factor could provide sufficient predictive power. However, CD68 and GAS6 expression may provide valuable information for an over-all assessment of patient risk.
The increase in information thought to be relevant to a patient's prognosis makes it very difficult to estimate the individual's outcome without condensing all the factors into an integrative risk score. However, research is required into how the best variables for modelling should be selected, and how they should be weighted for optimum prediction of the patient's individual outcome.
Currently, Cox regression is the gold standard for prognostic modelling in cancer [10, 22]. However, the selection of potentially influential variables largely depends on the type of optimization and is often unrelated to clinical experience . Cox regression usually results in an abstract algorithm, which is optimised for prediction in a defined collective and can hardly be repeated with distinct cohorts. Whereas the predictive power of any single variable including tumour size was limited, integration of molecular information into a unifying Cox score identified 84% of patients (32 of 38) with a clear prognosis, good or bad. Backward variable selection in a Cox model verified tumour size and histology, and the three molecular markers CD68, Gas6, and Notch3, as relevant factors. TNM had a significant impact on survival using univariate tests, but there was no significant effect of N and M in the Cox model, which is in accordance with the observation of Tsui et al. for renal cell carcinoma. Using a multiple analysis with a Cox proportional hazards model, these authors discovered that tumour stage demonstrated no independent impact on renal cell carcinoma prognosis . In a Cox model to predict survival of patients with gastric cancer, no independently significant relevance of UICC stage was apparent .
The ISIR is a simple and easily extendable score. The use of correlation coefficients for selecting and weighting the variables is based on the assumption that any close functional linkage to survival is reflected by significant correlations, negative in the case of shortening survival and positive when indicating longer survival. In fact, a scoring system that uses correlations is able to predict outcome quite as good as a modelling based on Cox regressions. ISIR identified 82% of patients with clearly bad or good prognosis using significant correlations of survival time, with T, N and M being aggressive factors and CD68 and GAS6 being protective factors. By including information relating to molecular markers and clinical stage, the prediction for five year survival was significantly better than that obtained with each single marker, reaching an area under the curve (AUC) of 0.90, which reflects an acceptable predictive power [11, 26, 27]. Extended gene profiling using Microarrays may not achieve a better outcome prediction; e.g. in breast cancer, microarray performed in a range for AUC of 0.6 - 0.8 .
The ISIR score considers the number of variables and the number of possible expression levels. Furthermore, standardisation should help to define general cut-offs that can be transferred to other collectives. However, in the present ISIR, possible close interferences among the variables were not considered. Therefore, the impact of a compound may be overestimated in the case of closely-linked variables with similar functions. It has to be noted that ISIR (and Cox) were evaluated using cross validation. Therefore, the ISIR concerns unbiased estimates of specificity and sensitivity.
The status of genes and proteins must be considered as parts of complex networks rather than of simple linear pathways . Correspondingly, the absolute value of any single marker cannot serve as a reliable estimate of a risk constellation without considering additional interfering and protective influences [26, 30]. As a consequence, the expression of biomarkers and clinical information requires integration into comprehensive translational assessments of the patient's risk constellation. The ISIR algorithm and the Cox model use all available information including non-clinical information from genes and proteins, therapeutic interventions and genetic polymorphism or co-morbidities. Therefore, this study presented the ISIR as a novel method for data analysis and applied it to predict disease outcome in a small cohort of patients with bronchial carcinoma. Estimations of the immunological balance of Gas6 and CD68 may supplement other established tumour markers, but their impact on survival will require confirmation in prospective studies.
We grateful thank E. Krott for her assistance in performing the tissue stainings.
- Cetin K, Ettinger DS, Hei YJ, O'Malley CD: Survival by histologic subtype in stage IV nonsmall cell lung cancer based on data from the Surveillance, Epidemiology and End Results Program. Clin Epidemiol. 2011, 3: 139-148.PubMed CentralView ArticlePubMed
- Pirozynski M: 100 years of lung cancer. Respir Med. 2006, 100: 2073-2084. 10.1016/j.rmed.2006.09.002.View ArticlePubMed
- D'Amico TA, Massey M, Herndon JE, Moore MB, Harpole DH: A biologic risk model for stage I lung cancer: immunohistochemical analysis of 408 patients with the use of ten molecular markers. J Thorac Cardiovasc Surg. 1999, 117: 736-743. 10.1016/S0022-5223(99)70294-1.View ArticlePubMed
- Cerny T, Barnes DM, Hasleton P, Barber PV, Healy K, Gullick W, Thatcher N: Expression of epidermal growth factor receptor (EGF-R) in human lung tumours. Br J Cancer. 1986, 54: 265-269. 10.1038/bjc.1986.172.PubMed CentralView ArticlePubMed
- Wheatley-Price P, Shepherd FA: Epidermal growth factor receptor inhibitors in the treatment of lung cancer: reality and hopes. Curr Opin Oncol. 2008, 20: 162-175. 10.1097/CCO.0b013e3282f335a3.View ArticlePubMed
- Goldstraw P, Ball D, Jett JR, Le Chevalier T, Lim E, Nicholson AG, Shepherd FA: Non-small-cell lung cancer. Lancet. 2011
- Loges S, Schmidt T, Tjwa M, van Geyte K, Lievens D, Lutgens E, Vanhoutte D, Borgel D, Plaisance S, Hoylaerts M, Luttun A, Dewerchin M, Jonckx B, Carmeliet P: Malignant cells fuel tumor growth by educating infiltrating leukocytes to produce the mitogen Gas6. Blood. 2010, 115: 2264-2273. 10.1182/blood-2009-06-228684.View ArticlePubMed
- Mc Cormack O, Chung WY, Fitzpatrick P, Cooke F, Flynn B, Harrison M, Fox E, Gallagher E, Goldrick AM, Dervan PA, Mc Cann A, Kerin MJ: Growth arrest-specific gene 6 expression in human breast cancer. Br J Cancer. 2008, 98: 1141-1146. 10.1038/sj.bjc.6604260.PubMed CentralView ArticlePubMed
- Bingle L, Brown NJ, Lewis CE: The role of tumour-associated macrophages in tumour progression: implications for new anticancer therapies. J Pathol. 2002, 196: 254-265. 10.1002/path.1027.View ArticlePubMed
- Nativ O, Sabo E, Madeb R, Halachmi S, Madjar S, Moskovitz B: Prognostic score for patients with localized renal cell carcinoma treated by nephrectomy. Isr Med Assoc J. 2001, 3: 24-27.PubMed
- Harrell FE, Lee KL, Mark DB: Multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors. Stat Med. 1996, 15: 361-387. 10.1002/(SICI)1097-0258(19960229)15:4<361::AID-SIM168>3.0.CO;2-4.View ArticlePubMed
- Klink CD, Binnebosel M, Kaemmer D, Schachtrupp A, Fiebeler A, Anurov M, Schumpelick V, Klinge U: Comet-tail-like inflammatory infiltrate to polymer filaments develops in tension-free conditions. Eur Surg Res. 2011, 46: 73-81. 10.1159/000322250.View ArticlePubMed
- Chen HY, Yu SL, Chen CH, Chang GC, Chen CY, Yuan A, Cheng CL, Wang CH, Terng HJ, Kao SF, Chan WK, Li HN, Liu CC, Singh S, Chen WJ, Chen JJ, Yang PC: A five-gene signature and clinical outcome in non-small-cell lung cancer. N Engl J Med. 2007, 356: 11-20. 10.1056/NEJMoa060096.View ArticlePubMed
- Hafizi S, Dahlback B: Gas6 and protein S. Vitamin K-dependent ligands for the Axl receptor tyrosine kinase subfamily. FEBS J. 2006, 273: 5231-5244. 10.1111/j.1742-4658.2006.05529.x.View ArticlePubMed
- Sawabu T, Seno H, Kawashima T, Fukuda A, Uenoyama Y, Kawada M, Kanda N, Sekikawa A, Fukui H, Yanagita M, Yoshibayashi H, Satoh S, Sakai Y, Nakano T, Chiba T: Growth arrest-specific gene 6 and Axl signaling enhances gastric cancer cell survival via Akt pathway. Mol Carcinog. 2007, 46: 155-164. 10.1002/mc.20211.View ArticlePubMed
- Coussens LM, Werb Z: Inflammation and cancer. Nature. 2002, 420: 860-867. 10.1038/nature01322.PubMed CentralView ArticlePubMed
- Mantovani A: Cancer: Inflaming metastasis. Nature. 2009, 457: 36-37.View ArticlePubMed
- Kawai O, Ishii G, Kubota K, Murata Y, Naito Y, Mizuno T, Aokage K, Saijo N, Nishiwaki Y, Gemma A, Kudoh S, Ochiai A: Predominant infiltration of macrophages and CD8(+) T Cells in cancer nests is a significant predictor of survival in stage IV nonsmall cell lung cancer. Cancer. 2008, 113: 1387-1395. 10.1002/cncr.23712.View ArticlePubMed
- Ohri CM, Shikotra A, Green RH, Waller DA, Bradding P: Macrophages within NSCLC tumour islets are predominantly of a cytotoxic M1 phenotype associated with extended survival. Eur Respir J. 2009, 33: 118-126. 10.1183/09031936.00065708.View ArticlePubMed
- Dang TP, Gazdar AF, Virmani AK, Sepetavec T, Hande KR, Minna JD, Roberts JR, Carbone DP: Chromosome 19 translocation, overexpression of Notch3, and human lung cancer. J Natl Cancer Inst. 2000, 92: 1355-1357. 10.1093/jnci/92.16.1355.View ArticlePubMed
- Konishi J, Kawaguchi KS, Vo H, Haruki N, Gonzalez A, Carbone DP, Dang TP: Gamma-secretase inhibitor prevents Notch3 activation and reduces proliferation in human lung cancers. Cancer Res. 2007, 67: 8051-8057. 10.1158/0008-5472.CAN-07-1022.View ArticlePubMed
- van Ramshorst GH, Nieuwenhuizen J, Hop WC, Arends P, Boom J, Jeekel J, Lange JF: Abdominal wound dehiscence in adults: development and validation of a risk model. World J Surg. 2009, 34: 20-27.PubMed CentralView Article
- Mallett S, Royston P, Dutton S, Waters R, Altman DG: Reporting methods in studies developing prognostic models in cancer: a review. BMC Med. 2010, 8: 20-10.1186/1741-7015-8-20.PubMed CentralView ArticlePubMed
- Tsui KH, Shvarts O, Smith RB, Figlin R, de Kernion JB, Belldegrun A: Renal cell carcinoma: prognostic significance of incidentally detected tumors. J Urol. 2000, 163: 426-430. 10.1016/S0022-5347(05)67892-5.View ArticlePubMed
- Klinge U, Ackermann D, Lynen-Jansen P, Mertens PR: The risk to develop a recurrence of a gastric cancer-is it independent of time?. Langenbecks Arch Surg. 2008, 393: 149-155. 10.1007/s00423-007-0272-4.View ArticlePubMed
- Veltri RW, Miller MC, An G: Standardization, analytical validation, and quality control of intermediate endpoint biomarkers. Urology. 2001, 57: 164-170. 10.1016/S0090-4295(00)00965-1.View ArticlePubMed
- Wenske S, Korets R, Cronin AM, Vickers AJ, Fleisher M, Scher HI, Pettersson K, Guillonneau B, Scardino PT, Eastham JA, Lilja H: Evaluation of molecular forms of prostate-specific antigen and human kallikrein 2 in predicting biochemical failure after radical prostatectomy. Int J Cancer. 2009, 124: 659-663. 10.1002/ijc.23983.PubMed CentralView ArticlePubMed
- Yasrebi H, Sperisen P, Praz V, Bucher P: Can survival prediction be improved by merging gene expression data sets?. PLoS One. 2009, 4: e7431-10.1371/journal.pone.0007431.PubMed CentralView ArticlePubMed
- Baudot A, Gomez-Lopez G, Valencia A: Translational disease interpretation with molecular networks. Genome Biol. 2009, 10: 221-10.1186/gb-2009-10-6-221.PubMed CentralView ArticlePubMed
- Behrends C, Sowa ME, Gygi SP, Harper JW: Network organization of the human autophagy system. Nature. 2010, 466: 68-76. 10.1038/nature09204.PubMed CentralView ArticlePubMed
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.