Here we describe how the correlation between progression-free survival (PFS) and overall survival (OS) is computed.
The illness-death model used in simIDM
is designed to
jointly model PFS and OS as endpoints in an oncology clinical trial.
Within each treatment arm, the model is specified through the transition
hazards λ01(t), λ02(t) and λ12(t). This
approach allows us to consider the joint distribution of the two
endpoints and to derive the correlation between PFS and OS directly from
the assumed transition hazards.
Meller, Beyersmann, and Rufibach (2019) derive a closed formula for Cor(PFS, OS). The general formula for the correlation is $$ Cor(PFS, OS) = \frac{Cov(PFS, OS)}{\sqrt{Var(PFS) \, Var(OS)}}. $$ The expected values of PFS and OS can be derived via the survival functions for PFS and OS, respectively: 𝔼(PFS) = ∫0∞SPFS(u) du and 𝔼(OS) = ∫0∞SOS(u) du. The variance of PFS and OS are computed as follows: Var(PFS) = 𝔼(PFS2) − 𝔼(PFS)2 where 𝔼(PFS2) = 2 ⋅ ∫0∞u ⋅ SPFS(u) du and in a similar way for Var(OS). Cov(PFS, OS) is derived using Cov(PFS, OS) = 𝔼(PFS ⋅ OS) − 𝔼(PFS) ⋅ 𝔼(OS) together with the general formula for deriving expected survival times, and where $$ P(PFS \cdot OS > t) = P(PFS > \sqrt{t}) \, + \int_{\left(0, \sqrt{t} \right]} P_{11}(u,t/u;u) \cdot P(PFS>u-) \cdot \lambda_{01}(u) \, du. $$ This requires the transition probability P11(s, t; t1), which has the form of a standard survival function: P11(s, t; t1) = exp(−∫stλ12(u; t1) du).
We can compute the correlation based on assumptions or estimate it from data. Both work by simply plugging in assumed or estimated survival functions.
We consider three alternative scenarios to model transition hazards within a treatment arm:
library(simIDM)
# constant hazards:
transitionExp <- exponential_transition(h01 = 1.2, h02 = 1.5, h12 = 1.6)
# Weibull hazards:
transitionWeib <- weibull_transition(h01 = 1, h02 = 1.2, h12 = 1.3, p01 = 1.1, p02 = 0.8, p12 = 1.2)
# piecewise constant hazards:
transitionPwc <- piecewise_exponential(
h01 = c(1, 1.3), h02 = c(0.8, 1.5), h12 = c(1, 1),
pw01 = c(0, 3), pw02 = c(0, 1), pw12 = c(0, 8)
)
Now, we can compute the PFS-OS correlation with corTrans()
:
In case we are given trial data and want to estimate the PFS-OS correlation from the data, the following approach can be adopted:
We can estimate the parameters via maximum likelihood (ML), using the log-likelihood based on the counting process notation of Andersen et al. (1993), see also Meller, Beyersmann, and Rufibach (2019): $$ L(\theta) = \sum_{i=1}^{n} \sum_{k=1}^{3} \left( \log \left[ \lambda_{k}(t_{ik})^{d_{ik}} \frac{S_{k}(t_{ik})}{S_{k}(t_{0ik})} \right] \times \mathbb{I}(i \in Y_{ik}) \right), $$ where the sum is over all n individuals. k ∈ {1, 2, 3} is a simplified notation for the transitions 0 → 1, 0 → 2 and 1 → 2, λk is the corresponding transition hazard and Sk the survival function. dik is an indicator function, taking the value 1 if the i-th individual made the k-th transition and t0ik and tik are the time the i-th individual enters and exits, respectively, the root state of the k-th transition. The indicator function at the end of the formula is equal to 1 if the i-th individual is at risk for the transition k and 0 otherwise.
Currently, this package supports parameter estimation for assuming
either constant or Weibull transition hazards. The estimateParams()
function expects a data
argument of the same format as used
throughout the package, and a transition
argument of class
TransitionParameters
, specifying the assumed distribution
and desired starting values for ML estimation. To demonstrate this, we
simulate data using constant transition hazards:
transitionExp <- exponential_transition(h01 = 1.2, h02 = 1.5, h12 = 1.6)
simData <- getOneClinicalTrial(
nPat = c(500), transitionByArm = list(transitionExp),
dropout = list(rate = 0.8, time = 12),
accrual = list(param = "time", value = 1)
)
We can estimate the parameters as follows:
# Create TransitionParameters object with starting values for ML estimation:
transition <- exponential_transition(h01 = 1, h02 = 1, h12 = 1)
# Estimate parameters:
est <- estimateParams(data = simData, transition = transition)
# Get estimated transition hazards:
est$hazards
#> $h01
#> [1] 1.144761
#>
#> $h02
#> [1] 1.477287
#>
#> $h12
#> [1] 1.337076
Then, in a final step, we pass est
to corTrans()
to compute the PFS-OS correlation.
Alternatively, one can combine these steps efficiently via corPFSOS()
,
which has an additional bootstrap
argument to quantify the
uncertainty of the correlation estimate: