TEXT SIZE

search for



CrossRef (0)
Estimation of the time-dependent AUC for cure rate model with covariate dependent censoring
Communications for Statistical Applications and Methods 2024;31:365-375
Published online July 31, 2024
© 2024 Korean Statistical Society.

Yang-Jin Kim1,a

aDepartment of Statistics, Sookmyung Women’s University, Korea
Correspondence to: 1 Department of Statistics, SookmyungWomen’s University, chengpa-ro 47 gil, Yongsan-Gu, Seoul 04310, Korea. E-mail: yjin@sookmyung.ac.kr
This research is supported by Korean research foundation (NRF-2020R1A2C1A01100755).
Received October 23, 2023; Revised February 15, 2024; Accepted February 20, 2024.
 Abstract
Diverse methods to evaluate the prediction model of a time to event have been proposed in the context of right censored data where all subjects are subject to be susceptible. A time-dependent AUC (area under curve) measures the predictive ability of a marker based on case group and control one which are varying over time. When a substantial portion of subjects are event-free, a population consists of a susceptible group and a cured one. An uncertain curability of censored subjects makes it difficult to define both case group and control one. In this paper, our goal is to propose a time-dependent AUC for a cure rate model when a censoring distribution is related with covariates. A class of inverse probability of censoring weighted (IPCW) AUC estimators is proposed to adjust the possible sampling bias. We evaluate the finite sample performance of the suggested methods with diverse simulation schemes and the application to the melanoma dataset is presented to compare with other methods.
Keywords : cure rate model, discrimination, IPCW, mixture model, prediction accuracy, time-dependent ROC
1. Introduction

In prognostic studies, it happens a substantial portion of patients can be event-free, which is denoted as a cured group or a risk-free group. For evaluating the effect of covariates both on the cure rate and on the failure time of susceptible (uncured) patients, several models have been proposed. Among them, the mixture model is expressed of a logistic model for the cure rate and a proportional hazard regression model for susceptible patients (Kuk and Chen, 1992; Maller and Zhou, 1996). In the context of a cure rate model, two issues related with a predictive accuracy have been considered. The first one is to predict who is cured and the second is to predict the survival probabilities of uncured subjects based on the markers. Both issues can be dealt by extending the classical discriminative accuracy measures such as the ROC curve and C-index.

The ROC curve has been the most frequently applied measure by providing both a graph and an AUC value. There are two probabilities to construct the curve; a sensitivity is defined as the probability of having a higher marker value among a case group (true positive rate; TPR) and a specificity is defined as the probability of having a lower marker value among a control one (true negative rate; TNR), respectively. These probabilities have been changed according to the threshold value of a marker and are displayed as the ROC curve where plots sensitivity against one minus specificity over all possible thresholds. The predictive performance of a marker can be evaluated with the AUC (area under curve) where a higher AUC value indicates a better performance.

For survival data, the time to event as the response variable has been observed during follow-up and changed over time which results in a time-dependent AUC denoted by AUC(t). Suppose that M denotes the continuous marker to evaluate the predictive accuracy where the marker can be a risk factor, a combination of several risk factors or a risk score derived from working prediction models. Without the loss of generality, a higher value of M is assumed to indicate a higher chance of experiencing the event and of bringing an early event time. Heagerty and Zhang (2005) proposed several types of time dependent sensitivity, specificity and the corresponding time-dependent ROC curves. Among them, we consider a cumulative sensitivity S eC(c, t) and a dynamic specificity S pD(c, t) defined as follows,

SeC(t,c)=Pr (M>c∣Tt)SpD(t,c)=Pr (Mc∣T>t),

the corresponding ROC curve is given by ROCC/D(t) = S eC[(1 – S pD(p, t))−1, t], p ∈ [0, 1] and the resulting AUCC/D(t) is defined as

AUCC/D(t)=Pr (Mi>Mj∣Tit,Tj>t), β€Šβ€Š β€Šβ€Š β€Šβ€Šij,

which is interpreted as the probability that for two randomly chosen subjects, one experiencing the event prior to t has the greater marker value compared to the other one free from the event at t. This definition is more relevant in clinical studies to discriminate between subjects experiencing the event and those event-free prior to the specific time (Pepe, 2003; Kamarudin et al., 2017).

For a cure rate model, most estimators of the AUC have been focused on the cure probability of a prediction model. Asano et al. (2014) proposed two estimators of AUC by incorporating the full imputation and the mean score imputation for unknown cure status as the extension of Alonzo and Pepe (2005)’s method and Asano and Hirakawa (2017) also suggested the C-index with different weights for three groups (cure, uncured and censored subjects). Recently, several time-dependent AUC estimators have been proposed for evaluating a survival probability of susceptible group. Beyene et al. (2019) applied a nonparametric estimator of AUC proposed by Li et al. (2018) to cure model and Wang and Wang (2020) considered to implement a smoothing technique into the conditional survival functions of two main cure rate models such as mixture model and bounded cumulative hazard model (Yakovlev et al., 1993).

In this study, our interest is to suggest the time-dependent AUC estimator based on the inverse probability censoring weight (IPCW) technique, when a censoring is related with a covariate. The rest of this article is organized as follows. In Section 2, we introduce the notations and propose three types of time-dependent AUCs. In Section 3, the finite sample performance of suggested methods is evaluated through simulation studies. Application of the suggested methods to a melanoma dataset is presented in Section 4 and several discussions are given in Section 5.

2. Time-dependent AUC using IPCW

In the context of survival data, the time to event is not always observed due to the censoring related with diverse observation schemes. Furthermore, in the presence of nonsusceptible (cured) patients, the time to event is denoted as T = UT*+(1–U)∞, where U is an indicator that equals 1 if the subject is susceptible (not cured) and 0 if cured (insusceptible) and T* denotes an event time of a susceptible subject. Given a covariate vector Z, let π(Z) = Pr(U = 1|Z) be a susceptible probability modeled with Z. Let C denote a random censoring time with a survival function G(c) = Pr(Cc) and the censoring time is assumed to be independent of T* conditional on the covariate vector Z. Then observable data is denoted as (TΜƒ, δ, Z), where TΜƒ = min(T,C) and δ = I(T < C). When δ = 1, the individual has experienced an event, thus U = 1. When δ = 0, however, the information of U is missing. Therefore, a population survival function S (t|Z) is expressed as

S(t∣Z)=π(Z)S˜(t∣Z,U=1)+1-π(Z),

where SΜƒ (t|Z,U = 1) denotes the conditional survival function of a susceptible group. As t → ∞, SΜƒ (∞|Z,U = 1) = 0, but S (∞|Z) = 1 – π(Z). Therefore, ignoring a cure fraction would result in the biased inference of the survival function.

For modelling the susceptible rate of a subject i, a logistic regression model is implemented for estimating the effect of the covariate vector Xi = (1, Zi)

πi=Pr (Ui=1∣Xi)=exp(Xiγ)1+exp(Xiγ).

Under a PH model assumption, the conditional hazard model of a susceptible group is written as

λ(t∣Zi,Ui=1)=λ0(t)exp (Ziβ),

then S˜i(t∣Zi,Ui=1)=exp(-Λ(t∣Zi,Ui=1))=exp(-Λ0(t)eZiβ). For estimating θ = (γ, β, λ0), the EM algorithm is implemented to recover the unknown event status in a mixture model (Lam et al., 2008; Sy and Taylor, 2000; Kim and Jhun, 2008).

For evaluating the prediction model of a cure rate data, a risk score Mi=Ziβ^ is utilized as a marker. To reflect the susceptibility of a censored subject on cumulative sensitivity and dynamic specificity in (1.1) and (1.2), several methods have been proposed.

Beyene et al. (2019) considered a missing status of a censored subject T̃i and implemented the probability of experiencing an event until t > T̃i when a subject i is censored at T̃i which is expressed as

Bi(t)=1-P(T>t∣T>T˜i), β€Šβ€Š β€Šβ€Š β€Šβ€Št>T˜i,

is estimated

B^i(t)=δiI(T˜it)+(1-δi)(1-S^(t∣Zi)S^(T˜i∣Zi))I(T˜it),

where the survival function S^(t∣Z)=π^(Z)S˜^(t∣Z,U=1)+(1-π^(Z)) is estimated from the cure rate model (2.1). Then a time-dependent cumulative sensitivity, dynamic specificity and corresponding AUCB(t) are estimated as follows,

SeB(c,t)=i=1nI(Mi>c)B^i(t)i=1nB^i(t), β€Šβ€Š β€Šβ€Š β€Šβ€ŠSpB(c,t)=i=1nI(Mic)(1-B^i(t))i=1n(1-B^i(t)),AUCB(t)=i=1nj=1nB^i(t)(1-B^j(t))Miji=1nj=1nB^i(t)(1-B^j(t)), β€Šβ€Š β€Šβ€Š β€Šβ€ŠMij=I(Mi>Mj).

For a same problem, Wang and Wang (2020) directly implemented the estimated survival function as follows,

SeW(c,t)=i=1nI(Mi>c)(1-S^i(t∣Zi))i=1n(1-S^i(t∣Zi)), β€Šβ€Š β€Šβ€Š β€Šβ€ŠSpW(c,t)=i=1nI(Mic)S^i(t∣Zi)i=1nS^i(t∣Zi),AUCW(t)=i=1nj=1n(1-S^i(t∣Zi))S^j(t∣Zj)Miji=1nj=1n(1-S^i(t∣Zi))S^j(t∣Zj),

where they applied a smoothing technique to obtain AUCW(t).

In general survival data, a right censoring causes a biased sampling when a censoring distribution is related with a certain subpopulation which is sometimes modelled with a vector of covariates. Inverse probability of censoring weighting (IPCW) technique has been originally proposed to adjust for dependent censoring (Robins, 1993; Robins and Finkelstein, 2000). Under a competing risk data, it has been adopted to reflect the effect of the subpopulation with competing event (Fine and Gray, 1999) and also applied to the discriminative measures such as C-index (Uno et al., 2011) and AUC(t) (Blanche et al., 2013).

In this paper, we propose a class of IPCW estimators of time-dependent AUC(t) when a censoring distribution is related with covariates. Set Wi(TΜƒi) = 1/Ĝ(TΜƒi), where Ĝ denotes the estimated censoring survival function obtained from either a Kaplan-Meier estimator or regression models given a covariate vector Zi.

The first estimator is to incorporate IPCW into Beyene’s method (2.4) and (2.5) as follows,

SeBW(c,t)=i=1nI(Mi>c)B^i(t)W^i(Ti)i=1nB^i(t)W^i(T˜i),SpBW(c,t)=i=1nI(Mic)(1-B^i(t))W^i(t)i=1n(1-B^i(t))W^i(t),AUCBW(t)=i=1nj=1nB^i(t)(1-B^j(t))MijW^i(T˜i)W^j(t)i=1nj=1nB^i(t)(1-B^j(t))W^i(T˜i)W^j(t).

Blanche et al. (2013) proposed the IPCW estimators of ROC(t) under competing risk data and explained the role of weights on both case and two types of control. As the second estimator, we extend their idea to a cure rate model. For the cumulative sensitivity (1.1), the IPCW is incorporated into the case group who has experienced the event until t.

SeCW(c,t)=i=1nI(Mi>c)I(T˜it,δi=1)Wi(T˜i)i=1nI(T˜it,δi=1)Wi(T˜i).

For the control group of a dynamic specificity in (1.2), two versions are presented. The first version of control group, the event-free subjects at t have weighted with both a susceptible proportion πi and Wi(t) in order to reflect the chance of experiencing the event and the time to event is certain to be greater than t. The second version expands the control by including the subjects censored before t which are weighted with the conditional survival probability. Therefore, the two versions of dynamic specificity S pCW,1(c, t) and S pCW,2(c, t) are estimated by

SpCW,1(c,t)=i=1nI(Mic)I(T˜i>t)W^i(t)π^ii=1nI(T˜i>t)W^i(t)π^i,SpCW,2(c,t)=i=1nI(Mic)(I(T˜i>t)+(S^(t)/S^(T˜i))I(T˜i<t,δi=0))W^i(t)i=1n(I(T˜i>t)+(S^(t)/S^(T˜i))I(T˜i<t,δi=0))W^i(t).

Then the corresponding time-dependent AUC(t)s are estimated by

AUCCW,1(t)=i=1nj=1nMijI(T˜it,δi=1)I(T˜j>t)W^i(Ti)W^j(t)π^ji=1nj=1nI(T˜it,δi=1)I(T˜j>t)W^i(Ti)W^j(t)π^j,AUCCW,2(t)=i=1nj=1nMijI(T˜it,δi=1)(I(T˜i>t)+(S^(t)/S^(T˜i))I(T˜i<t,δi=0))W^i(T˜i)W^j(t)i=1nj=1nI(T˜it,δi=1)(I(T˜j>t)+(S(t)/S^(T˜j))I(T˜j<t,δi=0))W^i(T˜i)W^j(t).

For the variance estimation, the bootstrap samples are generated and the confidence intervals are obtained from their standard deviations.

3. Simulation

In this section, the performance of the suggested estimators is evaluated with three situations; (i) light censoring; 35% (cure-rate: 15%), (ii) medium censoring; 55% (cure-rate: 30%) and (iii) heavy censoring; 70%(cure-rate: 50%), respectively. The difference of these censoring rates is inclined to the amount of cure rates. For reflecting the effect of a covariate on cure rate, a failure time and a censoring distribution, a covriate Z is generated from a normal distribution N(0, 1). A cure status U = {0, 1} is generated based on Pr(U = 1) = (exp(γ0 + γ1Z))/(1 + exp(γ0 + γ1Z)), where γ0 is selected to get a suitable cure rate and γ1 = −1.0. For a subject with U = 1, generate a failure time T* from a hazard function λ(t|U = 1) = λ0(t)exp(βZ), where a baseline hazard function is assumed to follow a Weibull distribution and β = 0.5 is assigned to represent the effect of covariate on failure time. Let the marker define as Mi=Ziβ^ using the regression coefficient estimated at (2.3).

To compare the performance of several IPCW estimators of AUC(t), a censoring time is generated with two scenarios. (i) Covariate independent censoring: λc = θc where θc = 0.5 and (ii) to reflect the effect of covariate on the censoring, λc = 0.5exp(1.0Z). Then the observable time is composed of (TΜƒ, δ, X), where TΜƒ = min(T*,C) and δ = I(T* < C). For a cured subject with U = 0, set TΜƒ = C and δ = 0.

300 datasets are generated with two sample sizes n = 200 and n = 400. Table 1 and Table 2 show the biases and standard deviations of six estimators (AUCUno(t), AUCB(t) in (2.5), AUCW(t) in (2.6), AUCBW(t) in (2.7), AUCCW,1(t) in (2.8) and AUCCW,2(t) in (2.9)) at two percentile points (t(0.15), t(0.30)). Here AUCUno(t)(Uno et al., 2016) is also presented to show the effect of ignoring the cure rate but reflecting the IPCW and obtained from the R package SurvAUC.

Table 1 shows the biases(standard deviations) of suggested methods when a censoring distribution is independent of covariate. All estimates have similar results and seem to be unbiased. However, AUCCW,1(t) shows large biases all cases. It seems to be related with the definition of a control group. Implementing the weights to the subjects with T̃i > T seems to result in decreasing the size of the control group. Meanwhile, by augmenting the control group by including censored subjects, AUCCW,2(t) have smaller biases. For the standard deviation, the AUCW(t) based on smoothing technique has the smallest variation. Table 2 presents the simulation results at a covariate dependent censoring scenario. For non-IPCW estimator, AUCUno(t) ignoring the cure rate has the largest biases increasing with a censoring rates. The biases at t(0.3) tend to be larger than ones at t(0.15). Among the suggested IPCW-based methods, AUCCW,2(t)) has smallest biases at all situations and AUCCW,1(t) has smaller biases at low censoring rate but shows increasing biases. The IPCW-based AUCs at t(0.30) have smaller biases compared with those values at t(0.15) which have different result with non-IPCW ones. Also, according to standard deviation, the inclusion of weights brings the increment of variation of IPCW estimators.

Table 3 shows the coverage probabilities (CP) and the standard errors obtained using 50 bootstrap samples at n = 200 with a censoring rate 60% and a cure-rate 40%. In order to distinguish between IPCW estimators based on Ĝ(t) and Ĝ(t|Z), (AUCBW*(t),AUCCW,1*(t),AUCCW,2*(t)) represent the results obtained under the former case. Similar to the results of Table 1 and Table 2, under independent censoring scheme, AUCCW,1(t) has much smaller CP because of large biases. Under covariate dependent censoring, AUCUno(t), AUCB(t) and AUCW(t) show undesirable results while AUCCW,1(t) and AUCCW,2(t) have coverage probabilities close to a nominal one.

4. Data analysis

We analyzed a malignant melanoma dataset which is available in the R package MASS. The dataset consists of 205 patients whose tumors were completely removed together with the skin within a distance of about 2.5cm around it at the operation. The study started in the period 1962–1977 and all patients have been followed for checking disease progression and survival until 1977. Among 205 patients, only 57 patients died of melanoma, 14 one died from other causes and the remaining were alive. In this study, the death from other causes is regarded as a censoring (censoring rate 72%). The time scale is days since operation and four covariates Z such as sex (male = 1), age at operation and characteristics of the tumor such as tumor thickness (median = 1.94mm) and ulcer (1 = presence; 0 = absence). As the prediction model, we applied a PH cure model and R package smcure is used to estimate the parameters. According to Table 4, unlike Wang and Wang’s result applying the additive model λ(t|z) = λ0(t) + βz, ulcer is significant (p-value = 0.039) at the susceptibility and log (thickness) is significant in the hazard model (p-value = 0.002). At the regression model on censoring distribution, two covarites (log (thickness) and age) are significant under the PH model. That is, older patients with lower value of log (thickness) are likely to get censored.

Table 5 shows the nine AUC(t) values estimated at 1, 4, 8 years and 95% confidence intervals based on the standard errors obtained from 100 bootstrap samples. Here, the mark Mi=Ziβ^ is defined as the risk score calculated from the estimated latency distribution. The suggested IPCW estimators are presented with two versions according to the covariate-dependency on censoring distribution and G(t) and G(t|Z) give (AUCBW*,AUCCW,1*,AUCCW,2*) and (AUCBW, AUCCW,1,AUCCW,2), respectively. Among unweighted AUC estimators, AUCW has the smallest values at all times. AUCUno has higher values at all cases which is the same result as in the simulation. For AUCB and its weight versions AUCBW* and AUCBW, they have similar values at 1 year and 2 year but the weight versions have smaller values at 8 year. For IPCW estimators, comparing the results based on G(t) and G(t|Z), the AUC values at t = 1 and t = 4 year have almost same values but the covariate dependent AUC values at 8 year have smaller values and larger standard errors. This result is explained with a high censoring rate and uncommon weights Wi. AUCCW,2 and AUCBW show similar result but AUCCW,1 has the smallest values at two censoring situations. According to simulation and data analysis, AUCCW,1 is unsuitable to apply as the predictive measure. Figure 1 presents the ROC curves of six estimators at t = 1, 4 and 8 year with only covariate dependent versions of IPCW.

5. Concluding remarks

In this paper, we applied the IPCW approach to estimate time-dependent AUC for cure rate models when a censoring distribution is related with covariates. Simulation results show that the proposed procedures work well for covariate dependent censoring and a large censoring rate. However, Uno’s method AUCUno(t) for right censored data still works at covariate independent censoring but has largest biases at covariate dependent censoring. AUCW(t) based on the smoothing technique has the smallest variation at all cases but shows large biases as censoring rate and sample size increases. Among the IPCW-version estimators, AUCCW,1(t) shows a undesirable result at covariate independent censoring but has small bias only at the case with covariate dependent light censoring rate. The difference between AUCCW,1(t) and AUCCW,2(t) depends on the definition of the control group. At the former case AUCCW,1(t), the only subjects with TΜƒ > t is included with weights which makes the influence of true susceptible group decrease thus brings the underestimated result. At the latter case AUCCW,2(t), the censored subjects with Ti < t is added to augment the control group with a weight Pr(T > t|T > TΜƒ) = S (t)/S (TΜƒ). While it makes unbiased results at most scenarios, the implementation of the estimated survival function causes the increment of variation.

At melanoma data analysis, nine AUC(t) values are similar at the 1 and 4 year and the suggested ones have smaller values than non-IPCW AUC as time increases. In particular, covariate-dependent versions have large variations which bring the wider confidence intervals.

As another discriminative measure, a concordance index or C-index is defined as the proportion of concordant pairs where a patient with an early event time is likely to have a higher marker. Asano and Hirakawa (2017) proposed the C-index reflecting the patients’ cure status estimated the cure rate model. A time-dependent C-index C(t) (Gerds et al., 2013) will be considered to evaluate the prediction model of cure rate data.

As another interesting topic, dynamic prediction models have been studied with joint model and landmark approach when a cure rate model includes longitudinal covariates (Rizopulos et al., 2017). A two-dimensional AUC(s, t) can utilize to evaluate a time-dependent marker M(s) to predict the survival probability at time t where s < t.

Figures
Fig. 1. ROC(t) curves (AUCs) of six estimators at t = 1, 4, and 8 year.
TABLES

Table 1

Bias(sd) of AUC(t) at C ~ exp(θc), θc = 0.5

(Cure, Cen)ntAUCUnoAUCBAUCWAUCBWAUCCW,1AUCCW,2
(0.15, 0.35)200t(0,15)0.00010.00010.00010.00010.0200.0001
(0.053)(0.052)(0.028)(0.052)(0.057)(0.053)

t(0,30)0.0010.0010.0030.0020.0350.001
(0.041)(0.040)(0.029)(0.040)(0.046)(0.041)

400t(0,15)0.00010.00010.00020.00010.0310.0001
(0.037)(0.036)(0.022)(0.036)(0.040)(0.037)

t(0,30)0.0020.0010.0010.0010.0320.002
(0.028)(0.028)(0.022)(0.027)(0.032)(0.028)

(0.30, 0.55)200t(0,15)0.0010.0010.00310.0010.05810.0001
(0.050)(0.052)(0.037)(0.052)(0.059)(0.053)

t(0,30)0.0010.0010.0030.0020.0710.000
(0.040)(0.043)(0.038)(0.043)(0.047)(0.045)

400t(0,15)0.0010.00010.0010.00010.0540.001
(0.038)(0.037)(0.022)(0.037)(0.044)(0.038)

t(0,30)0.0020.00010.00010.00010.06170.001
(0.032)(0.032)(0.021)(0.032)(0.040)(0.029)

(0.50, 0.70)200t(0,15)0.0080.0110.0170.0110.1120.010
(0.049)(0.057)(0.048)(0.058)(0.062)(0.058)

t(0,30)0.0050.0030.0000.0000.0970.002
(0.038)(0.055)(0.052)(0.055)(0.049)(0.056)

400t(0,15)0.0120.00110.00080.00060.1010.0010
(0.038)(0.032)(0.024)(0.031)(0.041)(0.033)

t(0,30)0.0030.0030.0050.0050.1040.003
(0.029)(0.027)(0.024)(0.027)(0.034)(0.028)

cure: cure rate; cp: censoring rate;


Table 2

Bias(sd) of AUC(t) at c ~ exp(0.5exp(1.0Z))

(Cure, cen)ntAUCUnoAUCBAUCWAUCBWAUCCW,1AUCCW,2
(0.15, 0.35)200t(0,15)0.0200.0120.0100.0120.0130.013
(0.049)(0.047)(0.031)(0.048)(0.055)(0.050)

t(0,30)0.0370.0200.0140.0160.0040.001
(0.042)(0.041)(0.032)(0.043)(0.049)(0.045)

400t(0,15)0.0210.0320.0120.0120.0140.001
(0.038)(0.037)(0.022)(0.039)(0.042)(0.039)

t(0,30)0.0360.0200.0190.0160.0050.005
(0.029)(0.029)(0.023)(0.04)(0.041)(0.040)

(0.30, 0.55)200t(0,15)0.0340.0190.0130.0170.0280.001
(0.052)(0.050)(0.037)(0.052)(0.059)(0.051)

t(0,30)0.0560.0280.0250.0200.0080.008
(0.042)(0.042)(0.037)(0.050)(0.056)(0.053)

400t(0,15)0.0330.0180.0160.0150.0320.003
(0.037)(0.035)(0.024)(0.038)(0.043)(0.039)

t(0,30)0.0570.0310.0300.0190.0130.000
(0.031)(0.029)(0.024)(0.053)(0.061)(0.060)

(0.50, 0.70)200t(0,15)0.0390.0170.0120.0150.0560.005
(0.053)(0.062)(0.054)(0.065)(0.070)(0.067)

t(0,30)0.0700.0300.0270.0120.0300.002
(0.041)(0.058)(0.057)(0.075)(0.082)(0.077)

400t(0,15)0.0360.0180.0130.0180.0610.003
(0.034)(0.035)(0.028)(0.036)(0.044)(0.036)

t(0,30)0.0670.0320.0300.0120.0380.002
(0.029)(0.028)(0.020)(0.057)(0.070)(0.061)

Cure: cure rate; cen: censoring rate;


Table 3

Coverage probability of AUC(t) at n = 200 and (cure, cen) = (40%, 60%)

Covariate independent censoring
t(0,15)t(0,30)
MethodEstSDSECPEstSDSECP
AUCUno0.7060.0510.0500.9290.7220.0420.0400.948
AUCB0.7070.0540.0510.9350.7230.0480.0430.967
AUCW0.7020.0400.0310.9620.7220.0410.0310.967
AUCBW*0.7060.0540.0470.9350.7210.0470.0360.967
AUCCW,1*0.6430.0590.0600.8010.6590.0490.0470.775
AUCCW,2*0.7010.0560.0520.9170.7190.0490.0410.961

Covariate dependent censoring
t(0,15)t(0,30)
MethodEstSDSECPEstSDSECP

AUCUno0.7200.0550.0510.8900.750.0440.0420.680
AUCB0.7080.0560.0570.9100.7270.0560.0510.830
AUCW0.7050.0490.0450.8600.7230.0520.0470.830
AUCBW0.7070.0560.0600.9000.7180.0620.0640.900
AUCCW,10.6600.0570.0650.9400.6920.0590.0680.970
AUCCW,20.6890.0560.0610.9300.6930.0580.0640.930

Table 4

Summary of regression models of Melanoma dataset.

Cure modelCensoring distribution

CovSusceptible rateLatency distributionCox PH

Est(se)p-valueEst(se)p-valueEst(se)p-value
Intercept−2.33(0.929)0.012
Sex0.288(0.582)0.6210.569(.532)0.284−0.041(0.178)0.818
Log (thick)0.042(0.095)0.6590.874(0.287)0.002−0.206(0.094)0.028
Ulcer1.323(0.641)0.0390.086(0.536)0.8720.196(0.197)0.321
Age0.019(0.017)0.253−0.008(0.013)0.5570.026(0.006)<0.0001

Table 5

Estimation of AUC values and 95% CI at t = (1, 4, 8) years of malignant melanoma patients

Methodt = 1(Ŝ (t) = 0.97)t = 4(Ŝ (t) = 0.82)t = 8(Ŝ (t) = 0.68)
Est(se)95% CIEst(se)95% CIEst(se)95% CI

AUCUno0.904(0.814,0.994)0.824(0.746,0.902)0.772(0.628,0.816)
AUCB0.868(0.768,0.968)0.812(0.706,0.918)0.737(0.643,0.831)
AUCW0.789(0.685,0.893)0.780(0.610,0.806)0.731(0.619,0.848)

covariate-independent censoring:G(t)

AUCBW*0.887(0.785,989)0.812(0.710,0.914)0.725(0.615,0.835)
AUCCW,1*0.839(0.729,0.949)0.755(0.669,0.841)0.641(0.527,0.755)
AUCCW,2*0.889(0.787,0.991)0.810(0.706,0.914)0.738(0.634,0.842)

covariate-dependent censoring: G(t|Z)

AUCBW0.887(0.773,1.000)0.812(0.675,0.949)0.672(0.428,0.915)
AUCCW,10.838(0.715,0.971)0.755(0.669,0.841)0.583(0.338,0.828)
AUCCW,20.888(0.774,1.000)0.809(0.674,0.944)0.701(0.485,0.917)

References
  1. Alonzo TA and Pepe MS (2005). Assessing accuracy of a continuous screening test in the presence of verification bias. Journal of the Royal Statistical Society Series C: Applied Statistics, 54, 173-190.
    CrossRef
  2. Asano J, Hirakawa A, and Hamada C (2014). Assessing the prediction accuracy of cure in the Cox proportional hazards cure model: An application to breast cancer data. Pharmaceutical Statistics, 13, 357-363.
    Pubmed CrossRef
  3. Asano J and Hirakawa A (2017). Assessing the prediction accuracy of a cure model for censored survival data with long-term survivors: Application to breast cancer data. Journal of Biopharmaceutical Statistics, 27, 918-932.
    Pubmed CrossRef
  4. Beyene KM, Ghouch AE, and Oulhaj A (2019). On the validity of time-dependent AUC estimation in the presence of cure fraction. Biometrical Journal, 61, 1430-1447.
    Pubmed CrossRef
  5. Blache P, Dartigues J, and Jacqmin-Gadda H (2013). Estimating and comparing time-dependent areas under receiver operating characteristic curves for censored event times with competing risks. Statistics in Medicine, 32, 5381-5397.
    CrossRef
  6. Fine JP and Gray RJ (1999). A proportional hazards model for the subdistribution of a competing risk. Journal of the American Statistical Association, 446, 496-509.
    CrossRef
  7. Gerds TA, Kattan MW, Schumacher M, and Yu C (2013). Estimating a time-dependent concordance index for survival prediction models with covariate dependent censoring. Statistics in Medicine, 32, 2173-2184.
    CrossRef
  8. Heagerty PJ and Zhang Y (2005). Survival model predictive accuracy and ROC curves. Biometrics, 61, 92-105.
    Pubmed CrossRef
  9. Kamarudin AN, Cox T, and Kolamunnage-Dona R (2017). Time-dependent ROC curve analysis in medical research: Current methods and applications. BMC Medical Research Methodology, 17, 53.
    Pubmed KoreaMed CrossRef
  10. Kim YJ and Jhun M (2008). Cure rate model for interval censored data. Statistics in Medicine, 27, 3-14.
    CrossRef
  11. Kuk AYC and Chen C (1992). A mixture model combining logistic regression with proportional hazards regression. Biometrika, 79, 531-541.
    CrossRef
  12. Li L, Greene T, and Hu B (2018). A simple method to estimate the time-dependent receiver operating characteristic curve and the area under the curve with right censored data. Statistical Methods in Medical Research, 27, 2264-2278.
    CrossRef
  13. Maller RA and Zhou X (1996). Survival Analysis with Long-term Survivors, Wiley, New York.
  14. Pepe MS (2003). The Statistical Evaluation of Medical Tests for Classification and Prediction, Oxford University Press, USA.
    CrossRef
  15. Rizopoulos D, Molenberghs G, Emmanuel MEH, and Lesaffre E (2017). Dynamic predictions with time-dependent covariates in survival analysis using joint modeling and landmarking. Biometrical Journal, 59, 1261-1276.
    Pubmed CrossRef
  16. Robins JM (1993). Information recovery and bias adjustment in proportional hazards regression analysis of randomized trials using surrogate markers. In Proceedings of the Biopharmaceutical Section, American Statistical Association, Alexandria, Virginia, 24-33.
  17. Robins JM and Finkelstein DM (2000). Correcting for noncompliance and dependent censoring in an AIDS clinical trial with inverse probability of censoring weighted (IPCW) log-rank tests. Biometrics, 56, 779-788.
    Pubmed CrossRef
  18. Sy JP and Taylor JMG (2000). Estimation in a Cox proportional hazards cure model. Biometrics, 56, 227-236.
    Pubmed CrossRef
  19. Uno H, Cai T, Pencina MJ, D’Agostino RB, and Wei LJ (2011). On the C-statistics for evaluating overall adequacy of risk prediction procedures with censored survival data. Statistics in Medicine, 30, 1105-1117.
    Pubmed KoreaMed CrossRef
  20. Wang Z and Wang X (2020). Evaluating the time-dependent predictive accuracy for event-to-time outcome with a cure fraction. Pharmaceutical Statistics, 19, 955-974.
    Pubmed CrossRef
  21. Yakovlev AY, Asselain B, Bardou VJ, Fourquet A, Hoang T, Rochefediere A, and Tsodikov AD (1993). A simple stochastic model of tumor recurrence and its application to data on pre-menopausal breast cancer. Biometrie et Analyse de Donnees Spatio-temporelles, 12, 66-82.