
The accelerated failure time model or accelerated life model relates the logarithm of the failure time linearly to the covariates. The parameters in the model provides a direct interpretation. In this paper, we review some newly developed practically useful estimation and inference methods for the model in the analysis of right censored data.
The survival data are common in many fields, such as economics, business, industrial engineering and biomedical studies. The nonparametric and semiparametric modeling has been studied extensively because it offers valid estimation and inference with less stringent model assumptions. The Cox proportional hazards model is the most used semiparametric regression models for the analysis of survival data (Cox, 1972) due to the availability of statistical software packages for implementation of the estimation and inference procedures. The accelerated failure time (AFT) models relates the logarithm of the failure time linearly to the covariates. The parameters in the model provides a direct interpretation. However, its estimation and inference procedures are challenging in the presence of censroing. Over the last forty years, there has been considerable research on the AFT models, see Buckely and James (1979), Jin
Let
where
It is assumed that conditional on the
Let
Currently, all available estimation method starts with following estimation equation, based on the Gehan’s statistic:
The function
where the notation
Or alternatively, the minimization of
where
Therefore, point estimator of
The general weighted log-rank estimating function for
or
where
Under regularity conditions, the root
In general, it is difficult to solve the equation
where
The
Similar to (
The iterative algorithm for estimating
where
and
When there is no censoring,
where
In the presence of censoring, we can only observe
where
The resulting Buckly-James estimator is the solution to the follwoing equation:
The difficulty to solve the
or equivalently
Set
It leads to an iterative algorithm
With the consistent initial estimator
where
Zeng and Lin (2007) proposed a kernel-smoothed profile likelihood estimator by an approximate nonparametric maximum likelihood method. Based on the semiparametric AFT model (
Zeng and Lin (2007) showed that the direct maximization of the log-likelihood does not yield a solution. It also does not lead to the maximization of a profile likelihood obtained by replacing the
by
where
The kernel-smoothed profile likelihood estimator
Given
Under regularity conditions, Zeng and Lin (2007) showed that the estimator
Except the kernel-smoothed profile likelihood estimator, the variance-covariance estimation of the Gehan-type rank estimator, weighted logrank-type estimator, and least-squares-type estimator is difficult because the corresponding variance-covariance matrices involve nonparametric estimation of the underlying probability density function. Here two computationally feasible approaches will be presented: resampling method and induced smoothing method.
The resampling method is similar to the resampling scheme similar to those in Rao and Zhao (1992), Parzen
Jin
The minimisation of
Jin
The developed methods are implemented in the Splus/R package lss and the design and application features of lss are provided in Huang and Jin (2007).
Due to the similarity between the objective function (
and define
Jin
and
Then,
where
The developed methods are implemented in the Splus/R package lss and the design and application features of lss are provided in Huang and Jin (2007).
Brown and Wang (2005) proposed a general variance estimation procedure based on an induced smoothing for non-smooth estimating functions. The approach is computationally efficient and easy to implement when the smoothing in terms of integration with respect to a Gaussian distribution has an explicit form. For the Gehan-type rank estimator, the induced smoothing in Brown and Wang (2005) yields an explicit form and its varaince-covariance matrix can be easily estimated as shown in Brown and Wang (2007). However, the induced smoothing in Brown and Wang (2005) for the general logrank-weighted estimator and least-squares-type estimator does not yield an explicit form and cannot be used directly. To overcome the difficulty, Jin
Brown and Wang (2007) and Johnson and Strawderman (2009) applied the induced smoothing method in Brown and Wang (2005) to the Gehan-type rank estimator
where
where Φ(·) is the cumulative distribution function of
where
For the weighted logrank-type rank estimator
The challenging part is to estimate
where
Based on the result (
Step 1: Calculate a consistent estimator
Step 2: Choose
Step 3: For the
Step 4: Calculate
Step 5: Repeat Step 3 and Step 4 for next
The covariance matrix of
In Jin
For the least-squares-type estimator
Again, the challenging part is to estimate
where
see Jin
In this paper, we have reviewed recently developed methods for point and variance estimation for the AFT model in the analysis of right censored data. The development is based on the assumption that errors are independent and identically distributed. The resampling approach offers valid inference but numerically intensive for large datasets, on the other hand, the induced smoothing approach is attractive due to its computational efficiency.
The paper of Jin and Ying (2004) studied asymptotic theory of rank estimation for AFT model under fixed censorship. Zhou (1992) and Jin (2007) studied
For the hypothesis testing framework, empirical likelihood approach has also been developed, see Zhou (2005a, 2005b), Zhou and Li (2008). For the heteroscedastic errors, Stute (1993, 1996) studied convergence properties of weighted estimators and Zhou
For time-dependent covariates, there has been theoretical development, see Robins and Tsiatis (1992) and Lin and Ying (1995). The profile nonparametric likelihood approach of Zeng and Lin (2007) is applicable theoretically, but there has been no numerical investigation.
There are still many research problems, such as model checking, variable selection and efficient and reliable software development.