A note on promotion time cure models
with a new biological consideration

Zhi Zhao Department of Biostatistics, University of Oslo
P.O.Box 1122 Blindern, 0317 Oslo, Norway Fatih Kızılaslan Department of Biostatistics, University of Oslo
P.O.Box 1122 Blindern, 0317 Oslo, Norway Department for Method Development and Analytics,
Norwegian Institute of Public Health, Oslo, Norway

Abstract

We introduce a generalized promotion time cure model motivated by a new biological consideration. The new approach is flexible to model heterogeneous survival data, in particular for addressing intra-sample heterogeneity. We also indicate that the new approach is suited to model a series or parallel system consisting of multiple subsystems in reliability analysis.

Keywords: Multiscale data integration; cell composition analysis; survival modeling; Weibull mixture models; multinomial-Poisson transformation; reliability analysis

1 Introduction

The promotion time cure model (PTCM) is one of the most important models in survival analysis, but it has not yet been studied much in the literature (Amico and Keilegom,, 2018). The PTCM is constructed by motivating biological considerations, which assumes that after initial treatment the time to recurrence of cancer is the result of a latent process of the residual tumor cells (i.e. clonogenic cells) propagating into a newly detectable tumor (Yakovlev,, 1996). As shown in Yakovlev, (1996) and Chen et al., (1999), the construction of the PTCM dependents on a latent variable $N$ that is the number of clonogenic cells left active in a patient after initial treatment. Assume that $N$ is Poisson distributed with mean $\theta>0$ , i.e. $\mathbb{P}(N=k)=\theta^{k}e^{-\theta}/k!$ , $k\in\mathbb{N}$ . Let another latent variable $Z_{j}$ ( $j=1,...,N$ ) be the random time for the $j$ -th clonogenic cell to produce a detectable tumor mass. Given $N$ , the variables $Z_{j}$ are independently and identically distributed with cumulative distribution function (cdf) $F(t)=1-S(t)$ . Here $F(t)$ is the promotion time distribution of any clonogenic cell and $S(t)$ is its corresponding survival function. The time to tumor recurrence can be defined by the random variable $T=\min_{0\leq j\leq N}\{Z_{j}\}$ , i.e. tumor recurrence when the one of the clonogenic cells becomes activated, where $\mathbb{P}(Z_{0}=\infty)=1$ (i.e. no tumor recurrence in a finite time). Note that the time to tumor recurrence $T$ of a patient is observable, but $N$ and $Z_{j}$ are unobservable latent variables. The survival function of the population is the probability of no newly detectable tumor by time $t$ given by

$\displaystyle S_{pop}(t)$	$\displaystyle=\mathbb{P}(N=0)+\mathbb{P}(Z_{1}>t,....,Z_{N}>t,N>0)$
	$\displaystyle=e^{-\theta}+\sum_{k=1}^{\infty}\mathbb{P}(Z_{1}>t,....,Z_{N=k}>t% )\mathbb{P}(N=k)$
	$\displaystyle=e^{-\theta}+\sum_{k=1}^{\infty}S(t)^{k}\frac{\theta^{k}e^{-% \theta}}{k!}=e^{-\theta\left(1-S(t)\right)}=e^{-\theta F(t)}.$	(1)

Covariates can be introduced in the parameter $\theta$ and may also be introduced in the proper baseline distribution function $F(t)$ . However, it is not clear how to include clonogenic cell data information in the PTCM (1) for better progression-free survival prediction or potentially for the identification of subclonal driver genes predictive of survival. Motivated by this, we propose the following generalized promotion time cure model (GPTCM) to integrate multiscale data, i.e. cancer patient data on multiple biological scales: individual-level survival data, cellular-level cell type proportions data, and subcellular-level cell-type-specific genetic variables.

2 The generalized promotion time cure model (GPTCM)

2.1 Formulation

In cancer cell biology, the tumor might contain a mixture of cell subtypes, for example, invasive tumor cells, non-invasive tumor cells and stromal cells (Trapnell,, 2015). Motivated by the classical PTCM (1), we assume that all tumor cells are composed of multiple clonogenic cell groups (i.e. tumor cell subtypes or subclones). Suppose a patient after an initial treatment has the total number of tumor cells $N=\sum_{l=1}^{L}N_{l}$ , $L\geq 2$ , where $N_{l}$ is the number of $l$ -th cluster of cells (e.g. tumor cell subtype). Similar to the PTCM, let the $l$ -th cluster have multivariate random times for $N_{l}$ clonogenic cells propagating into a newly detectable tumor:

\bm{W}_{l}=\left(Z_{\sum_{j=1}^{l-1}N_{j-1}+1},...,Z_{\sum_{j=1}^{l-1}N_{j-1}+% N_{l}}\right),

where $l\in\{1,...,L\}$ and $N_{0}=0$ . For the $N_{l}$ homogeneous cells in the $l$ -th cluster, we assume cluster-specific promotion time distribution $F_{l}(t)=1-S_{l}(t)$ , and then we have

\mathbb{P}(\min\{\bm{W}_{l}\}>t)=S_{l}(t)^{N_{l}}.

The time to tumor recurrence can be defined as $T=\min\{\min\{\bm{W}_{1}\},...,\min\{\bm{W}_{L}\}\}$ . Then the survival function for the population is given by

$\displaystyle S_{pop}(t)$	$\displaystyle=\mathbb{P}(\text{no cancer by time }t)$
	$\displaystyle=\mathbb{P}(N=0)+\mathbb{P}\left(\min\{\min\{\bm{W}_{1}\},...,% \min\{\bm{W}_{L}\}\}>t,N>0\right)$
	$\displaystyle=\mathbb{P}(N=0)+\mathbb{P}(\min\{\bm{W}_{1}\}>t,...,\min\{\bm{W}% _{L}\}>t,N>0)$
	$\displaystyle=\mathbb{P}(N=0)+\sum_{k=1}^{\infty}\mathbb{P}\left(\min\{\bm{W}_% {1}\}>t,...,\min\{\bm{W}_{L}\}>t\|N=k\right)\cdot\mathbb{P}(N=k).$	(2)

To compute the second term in (2), we use the multinomial theorem and the multinomial-Poisson transformation (Brookmeyer and Damiano,, 1989). With the multinomial theorem, we can consider all configurations of $(N_{1},...,N_{L})$ such that their sum is $k$ . If $N_{l}$ ’s are independent Poisson random variables with mean $\theta_{l}$ denoted as $\mathcal{P}ois(N_{l};\theta_{l})$ , ( $l=1,...,L$ ), by the multinomial-Poisson transformation, the unconditional joint distribution of $(N_{1},...,N_{L})$ can be factorized into the product of a Poisson distribution and a multinomial distribution. The multinomial distribution is $\mathbb{P}(N_{1},...,N_{L}|N=k)=:\mathcal{M}ult(k;\bm{p})$ , where $\bm{p}=(p_{1},...,p_{L})$ , $p_{l}=\theta_{l}/\theta,\theta=\sum_{l}\theta_{l}$ . Then we obtain

	$\displaystyle S_{>0}(t):=\sum_{k=1}^{\infty}\mathbb{P}\left(\min\{\bm{W}_{1}\}% >t,...,\min\{\bm{W}_{L}\}>t\|N=k\right)\cdot\mathbb{P}(N=k)$
	$\displaystyle=\sum_{k=1}^{\infty}\sum_{\begin{subarray}{c}\text{All config.}\\ (N_{1},...,N_{L})\\ \text{ with sum }k\end{subarray}}\mathbb{P}\left(\min\{\bm{W}_{1}\}>t,...,\min% \{\bm{W}_{L}\}>t\|N=k,N_{1},...,N_{L}\right)\cdot\mathbb{P}(N=k,N_{1},...,N_{L})$
	$\displaystyle=\sum_{k=1}^{\infty}\sum_{\begin{subarray}{c}\text{All config.}\\ (N_{1},...,N_{L})\\ \text{ with sum }k\end{subarray}}\mathbb{P}\left(\min\{\bm{W}_{1}\}>t,...,\min% \{\bm{W}_{L}\}>t\|N=k,N_{1},...,N_{L}\right)\cdot\mathbb{P}(N=k)\mathbb{P}(N_{1% },...,N_{L}\|N=k)$
	$\displaystyle=\sum_{k=1}^{\infty}\sum_{\begin{subarray}{c}\text{All config.}\\ (N_{1},...,N_{L})\\ \text{ with sum }k\end{subarray}}\mathbb{P}\left(\min\{\bm{W}_{1}\}>t,...,\min% \{\bm{W}_{L}\}>t\|N=k,N_{1},...,N_{L}\right)\cdot\mathcal{P}ois(k;\theta)% \mathcal{M}ult(k;\bm{p})$
	$\displaystyle=\sum_{k=1}^{\infty}\sum_{\begin{subarray}{c}\text{All config.}\\ (N_{1},...,N_{L})\\ \text{ with sum }k\end{subarray}}\prod_{l=1}^{L}\mathbb{P}(\min\{\bm{W}_{l}\}>% t\|N_{l})\cdot\mathcal{P}ois(k;\theta)\mathcal{M}ult(k;\bm{p})$
	$\displaystyle=\sum_{k=1}^{\infty}\sum_{\begin{subarray}{c}\text{All config.}\\ (N_{1},...,N_{L})\\ \text{ with sum }k\end{subarray}}\prod_{l=1}^{L}S_{l}(t)^{N_{l}}\cdot\mathcal{% P}ois(k;\theta)\mathcal{M}ult(k;\bm{p})$
	$\displaystyle=\sum_{k=1}^{\infty}\sum_{\begin{subarray}{c}\text{All config.}\\ (N_{1},...,N_{L})\\ \text{ with sum }k\end{subarray}}\prod_{l=1}^{L}S_{l}(t)^{N_{l}}\cdot\frac{% \theta^{k}e^{-\theta}}{k!}\cdot\frac{k!}{N_{1}!...N_{L}!}p_{1}^{N_{1}}...p_{L}% ^{N_{L}}$
	$\displaystyle=\sum_{k=1}^{\infty}\frac{\theta^{k}e^{-\theta}}{k!}\sum_{\begin{% subarray}{c}\text{All config.}\\ (N_{1},...,N_{L})\\ \text{ with sum }k\end{subarray}}\frac{k!}{N_{1}!...N_{L}!}\{p_{1}S_{1}(t)\}^{% N_{1}}...\{p_{L}S_{L}(t)\}^{N_{L}}$
	$\displaystyle=\sum_{k=1}^{\infty}\frac{\theta^{k}e^{-\theta}}{k!}\{p_{1}S_{1}(% t)+...+p_{L}S_{L}(t)\}^{k}$
	$\displaystyle=\left\{e^{\theta\sum_{l=1}^{L}p_{l}S_{l}(t)}-1\right\}e^{-\theta}.$

Finally, the population survival function is

$\displaystyle S_{pop}(t)$	$\displaystyle=\mathbb{P}(N=0)+S_{>0}(t)$
	$\displaystyle=e^{-\theta}+\left\{e^{\theta\sum_{l=1}^{L}p_{l}S_{l}(t)}-1\right% \}e^{-\theta}$
	$\displaystyle=e^{-\theta\left\{1-\sum_{l=1}^{L}p_{l}S_{l}(t)\right\}}.$	(3)

Let $F(t)=1-\sum_{l=1}^{L}p_{l}S_{l}(t)$ , and then $S_{pop}(t)=e^{-\theta F(t)}$ . This means that if $S(t)=S_{l}(t)$ , $\forall l\in\{1,...,L\}$ (i.e. no different types of cells), the population survival function (3) is degenerated into PTCM (1), so the new model is named as generalized promotion time cure model (GPTCM).
Remark 1. Note that the proportions $(p_{1},...,p_{L})$ in the GPTCM (3) are patients’ cancer cell proportions data (i.e. $n$ -by- $L$ data matrix collected from $n$ patients) not simple weight parameters. Therefore, the GPTCM can integrate multiscale data [i.e. individual-level survival data, cellular-level cell type proportions data, and subcellular-level cell-type-specific molecular and genomic data (see Remark 4 below)] for joint modeling.
Remark 2. The GPTCM is similar to the general class of PTCM with Equation 2 in Gómez et al., (2023) whose population survival function was derived based on a compound Poisson distribution for the total number of clonogenic cells. But they assumed a common promotion time distribution for all cells, i.e. without distinguishing heterogeneous cells. Gómez et al., (2023) also fixed the number of cells to be 1, 2 or $\infty$ regardless of the exact tumor cells in patients.
Remark 3. The GPTCM is also similar to a mixtures-of-experts model for survival analysis (Rosen and Tanner,, 1999) or a generalized Weibull mixture model for reliability analysis (Jiang and Murthy,, 1995). But the (generalized) mixture models are to model inter-sample heterogeneity, assuming every sample is from one of the mixture clusters. In contrast, our GPTCM is to model intra-sample heterogeneity, since every sample/patient has data for multiple clusters of $p_{l}S_{l}(t),l\in\{1,...,L\}$ . In fact, the formulation of a mixture model like $S_{pop}(t)=\mathbb{P}(N=0)+\sum_{l=1}^{L}p_{l}S_{l}(t)$ is not biologically meaningful in the situation that the time to recurrence is the result of a latent process for cancer recurrence, see Appendix A.
Remark 4. Similar to the PTCM, the GPTCM can introduce covariates $X$ through the Poisson rate parameter $\theta$ , e.g. $\theta=\exp(\xi_{0}+X\xi)$ . Benefiting from the mixture part in the GPTCM, cluster-specific covariates (e.g. genetic variables from each tumor cell subtype) $X_{l}$ can be introduced in $S_{l}(t)$ . For example, using a log-linear model to capture the mean survival time, i.e. $\log\mu_{l}=X_{l}\beta_{l},l\in\{1,...,L\}$ , where $\mu_{l}$ is the mean of the Weibull distribution $S_{l}(t)=\exp\{-(t/\lambda_{l})^{\kappa}\},\lambda_{l}=\mu_{l}/\Gamma(1+1/% \kappa),\kappa\in\mathbb{R}_{+},$ and $\Gamma(\cdot)$ is the gamma function. The modeling of tumor cell-type-specific genes has the potential to identify cell-type-specific drivers for cancer prognosis, and ultimately improve individualized cancer diagnosis and personalized cancer therapies. Furthermore, if we assume randomness in the proportions data (e.g. following Dirichlet distribution), any covariate may also be introduced to model the compositional data of cell proportions (Greenacre,, 2021; Mangiola et al.,, 2023).
Remark 5. Identifiability is an important issue in the estimation of cure models. The GPTCM is identifiable when $\theta=\exp(\xi_{0}+X\xi)$ and $\lim_{t\to\infty}S_{l}(t)=0$ ( $\forall l\in\{1,...,L\}$ ) according to Proposition 7 in Hanin and Huang, (2014). In a finite mixture model $\sum_{l=1}^{L}p_{l}S_{l}(t)$ , the label switching problem is a common identifiability issue, since there is no prior information to distinguish between the clusters of the mixture. However, in the applications of single-cell data, cell types can be predefined based on cell biology, and single-cell sequencing data usually result in well estimated cell type proportions. As mentioned in Remark 1, the proportions $(p_{1},...,p_{L})$ in the GPTCM are cancer cell proportions data collected from patients rather than weight parameters, so the label switching is irrelevant.

2.2 Connection to last-activation scheme and reliability analysis

The PTCM is also referred to as the first-activation scheme (Cooner et al.,, 2007). When all clonogenic cells are homogeneous (i.e. no different types of cells) and the time to tumor recurrence is when the last clonogenic cell becomes activated (i.e. $T=\max_{0\leq j\leq N}\{Z_{j}\}$ ), Cooner et al., (2007) referred this as the last-activation scheme and its corresponding population survival function is $1+e^{-\theta}(1-e^{\theta F(t)})$ . Similar to the last-activation scheme, the GPTCM can be extended for the recurrence to be observed when the last class of clonogenic cells becomes activated. Then the time to tumor recurrence can be defined as $T=\max\{\max\{\bm{W}_{1}\},...,\max\{\bm{W}_{L}\}\}$ and the population survival function is given by (see Appendix B for details)

\tilde{S}_{pop}(t)=1+e^{-\theta}-e^{-\theta\sum_{l=1}^{L}p_{l}S_{l}(t)}.

(4)

Here $\tilde{S}_{pop}(0)$ is improper with $\tilde{S}_{pop}(0)=1$ and cure rate $\tilde{S}_{pop}(\infty)=e^{-\theta}$ .

From the perspective of system reliability, the PTCM can be interpreted as analogous to a series system with a random number of units under random shock (Cha and Finkelstein,, 2018). In such a system, failure occurs as soon as one unit fails, making the PTCM conceptually similar to a reliability structure where the weakest link dictates the overall system failure. Our proposed GPTCM can be suited to model a system consisting of multiple heterogeneous subsystems (Fig. 1A), as discussed in Wei and Liu, (2023), which investigates the reliability of the time until one critical subsystem fails.

In reliability engineering, a natural extension to the series system is the latent parallel system model, in which failure occurs only after all latent factors have been activated, known as the last-activation scheme defined in Cooner et al., (2007). It represents a contrasting mechanism where the survival time depends on the simultaneous activation of multiple latent processes, rather than being dictated by the earliest activation. Therefore, the last-activation scheme model (4) can be used for a parallel system with multiple subsystems (Fig. 1B). The cure rate $\tilde{S}_{pop}(\infty)=e^{-\theta}$ means that a harmful event does not result in an ultimate system failure. Further extensions can be for a parallel-series system (Fig. 1C) with the failure time $T=\max\{\min\{\bm{W}_{1}\},...,\min\{\bm{W}_{L}\}\}$ , i.e. the failure occurs when all of the parallel subsystems fail, and can also be for a series-parallel system (Fig. 1D) with the failure time $T=\min\{\max\{\bm{W}_{1}\},...,\max\{\bm{W}_{L}\}\}$ , i.e. the failure occurs when one of the parallel subsystems fails.

Refer to caption — Figure 1: Illustration of series system, parallel system, parallel-series system and series-parallel system.

2.3 Statistical characteristics of the GPTCM

The GPTCM (for the first-activation scheme) and the classical PTCM have similar statistical properties. For example, both the PTCM and GPTCM do not have proper survival functions, since their cure fraction is $S_{pop}(\infty)=e^{-\theta}>0$ . The survival function of the noncured population of the GPTCM is a proper survival function, i.e. $S^{*}(t)=S_{>0}(t)/(1-e^{-\theta})$ , $S^{*}(0)=1$ and $S^{*}(\infty)=0$ . Assuming all covariates $X$ are time-independent, the population probability density function (pdf) of the GPTCM is given by

\displaystyle f_{pop}(t|X)=-\frac{\operatorname{d}\!{S}_{pop}(t|X)}{% \operatorname{d}\!{t}}=\theta f(t)e^{-\theta F(t)},

where $F(t)=1-\sum_{l=1}^{L}p_{l}S_{l}(t)=\sum_{l=1}^{L}p_{l}F_{l}(t)$ , $f(t)=(\operatorname{d}\!{/}\operatorname{d}\!{t})F(t)=\sum_{l=1}^{L}p_{l}f_{l}% (t)$ , and $f_{l}(t)$ and $F_{l}(t)$ are the cluster-specific promotion time pdf and cdf, respectively. Note that here $f_{pop}(t|X)$ is not a proper pdf, since $S_{pop}(t|X)$ is not a proper survival function.

The hazard functions of the entire population and the noncured population of the GPTCM are

	$\displaystyle h_{pop}(t\|X)$	$\displaystyle=\frac{-\operatorname{d}\!{S}_{pop}(t\|X)/\operatorname{d}\!{t}}{S% _{pop}(t\|X)}=\theta f(t)=\theta\sum_{l=1}^{L}p_{l}f_{l}(t),$
	$\displaystyle h^{*}(t\|X)$	$\displaystyle=\frac{-\operatorname{d}\!{S}^{}(t\|X)/\operatorname{d}\!{t}}{S^{% }(t\|X)}=\left\{1-e^{-\theta\sum_{l=1}^{L}p_{l}S_{l}(t)}\right\}^{-1}\theta% \sum_{l=1}^{L}p_{l}f_{l}(t).$

Similar to Chen et al., (1999), we also have

h_{pop}(t|X)=h^{*}(t|X)\left\{1-e^{-\theta\sum_{l=1}^{L}p_{l}S_{l}(t)}\right\}% \leq h^{*}(t|X),

i.e. the hazard function of the noncured samples is greater than a sample selected from the entire population. We can also obtain the population cumulative hazard function $H_{pop}(t)=\int_{0}^{t}h_{pop}(s)\operatorname{d}\!{s}=\theta\sum_{l=1}^{L}p_{% l}F_{l}(t)$ , and $S_{pop}(t)=e^{-H_{pop}(t)}$ . Fig. 2 shows the population survival curves and population hazards of the GPTCM when assuming two clusters and Weibull distributed survival. It is interesting that the population hazard function of the GPTCM can be multimodal, because $h_{pop}(t|X)$ is a mixture of $L$ density functions, which is beyond the basic shapes of the hazard function [e.g. constant, decreasing, increasing, unimodal (up-then-down), or bathtub (down-then-up) shape] (Christen and Rubio,, 2025).

The importance measure of cluster-specific survival can provide valuable insight for developing effective strategies to improve or intervene the entire system, applicable to both biomedical applications and reliability engineering. Such measures help to identify which clusters should receive attention in survival improvement efforts. The population survival function at a given time $t$ is expressed as a function of the $L$ clusters’ survival at that time, i.e.

S_{pop}(t)=f(S_{1}(t),S_{2}(t),...,S_{L}(t)).

The Birnbaum measure (Birnbaum,, 1969) can be used to evaluate the survival importance of different clusters given by

\frac{\partial S_{pop}(t)}{\partial S_{l}(t)}=S_{pop}(t)\theta p_{l},\ l=1,...% ,L.

Note that the Birnbaum measure does not account for any time-dependent covariate. A systematic overview of different importance measures can be found in Wu and Coolen, (2022).

3 Simulation study

We provide insights about the parameter estimation of the proposed GPTCM in Section 2.1 by using Monte Carlo simulations. We consider sample sizes of $n=200,\;500$ and $1000$ . Each sample/patient has two clinical covariates (i.e. one row of the clinical data matrix $\mathbf{X}_{0}\in\mathbb{R}^{n\times 2}$ ), and has cells belonging to $L=3$ tumor cell subtypes with each subtype consisting of two cell-type-specific covariates (i.e. one row of data matrix $\mathbf{X}_{l}\in\mathbb{R}^{n\times 2}$ , $l\in\{1,...,L\}$ ). Each sample also has tumor cell subtype proportions data (i.e. one row of the proportions data matrix $\mathbf{p}\in[0,1]^{n\times 2}$ ). Every covariate is generated independently from the standard normal distribution except the first clinical variable generated from the Bernoulli distribution. The tumor cell subtype proportions of each sample is generated independent from the Dirichlet distribution. The survival times are generated based on the population survival function (3) using rate parameter $\theta=\exp(\xi_{0}+\mathbf{X}_{0}\bm{\xi})$ , and using the Weibull distributed survival functions with mean parameters $\log\bm{\mu}_{l}=\mathbf{X}_{l}\bm{\beta}_{l}$ , $l\in\{1,...,L\}$ . Censoring is generated through an exponential distribution with approximately $50\%$ censoring rate. The true values of all parameters are shown in Table 1.

The maximum likelihood (ML) estimation is to maximize the log-likelihood function for right-censored survival data, i.e.

\mathcal{L}(\bm{\vartheta}|\mathcal{D})=\prod_{i=1}^{n}f_{pop}(t_{i}|\mathbf{X% }_{0},\mathbf{X}_{1},\mathbf{X}_{2},\mathbf{X}_{3},\mathbf{p})^{\delta_{i}}S_{% pop}(t_{i}|\mathbf{X}_{0},\mathbf{X}_{1},\mathbf{X}_{2},\mathbf{X}_{3},\mathbf% {p})^{{1-\delta_{i}}},

where $\bm{\vartheta}$ consists of all unknown parameters and $\mathcal{D}=\{t_{i},\delta_{i},\mathbf{X}_{0},\mathbf{X}_{1},\mathbf{X}_{2},% \mathbf{X}_{3},\mathbf{p}\}_{i=1}^{n}$ consists of all data information including each sample’s observed survival time $t_{i}$ , censoring indicator $\delta_{i}$ , covariates, and cell subtype proportions. The R function nlminb using the adaptive nonlinear least-squares algorithm is to perform the optimization. We repeat the scenario of each sample size $1000$ times to obtain the ML estimates with mean and standard error.

Table 1 shows that the mean squared error (MSE) of each estimate decreases with the increase of sample size as we expected. The performance of the ML estimates of all parameters, except for the Weibull’s shape parameter $\log(\kappa)$ , are close to their true values. However, future work for further investigations with more simulation scenarios (e.g. more covariates and model misspecification) and applications to real data is needed to better understand the implications of the proposed model.

Table 1: Simulation results with maximum likelihood estimates (standard errors in parentheses) and the mean squared errors for different sample sizes

Parameter	Truth	Estimate	MSE	Estimate	MSE	Estimate	MSE
		$n=200$		$n=500$		$n=1000$
$\log(\kappa)$	1.10	1.69 (0.098)	0.361	1.64 (0.060)	0.293	1.62 (0.041)	0.271
$\xi_{1}$	-0.80	-0.81 (0.193)	0.037	-0.80 (0.120)	0.014	-0.79 (0.082)	0.007
$\xi_{2}$	0.90	1.05 (0.242)	0.080	1.05 (0.150)	0.044	1.03 (0.100)	0.028
$\xi_{3}$	0.60	0.71 (0.134)	0.031	0.71 (0.080)	0.020	0.71 (0.054)	0.015
$\beta_{11}$	0.40	0.29 (0.086)	0.018	0.29 (0.049)	0.014	0.29 (0.034)	0.013
$\beta_{12}$	-0.30	-0.22 (0.084)	0.013	-0.22 (0.049)	0.009	-0.22 (0.034)	0.008
$\beta_{21}$	0.25	0.18 (0.076)	0.010	0.18 (0.046)	0.007	0.18 (0.032)	0.005
$\beta_{22}$	-0.45	-0.33 (0.084)	0.021	-0.33 (0.051)	0.017	-0.33 (0.036)	0.016
$\beta_{31}$	-0.20	-0.15 (0.075)	0.008	-0.15 (0.048)	0.005	-0.15 (0.033)	0.004
$\beta_{32}$	0.30	0.22 (0.080)	0.012	0.22 (0.050)	0.008	0.22 (0.035)	0.007

4 Conclusion

We have presented a new promotion time cure model GPTCM that is a generalized version of the classical PTCM. The new formulation consists of the part of a mixture of survival distributions that is strongly motivated by biological intra-tumor heterogeneity of patients rather than mathematical construction of a mixture model. The new modeling framework is flexible to model survival data with intra-sample heterogeneity in biomedicine or intra-system heterogeneity in reliability engineering. Note that both the PTCM and GPTCM assume the latent variable $N$ (i.e. number of clonogenic cells) independent of time $t$ . Cha and Finkelstein, (2018) presented various shock models, including the PTCM as a special case, from the counting process of view. Therefore, a future direction for an extension of the GPTCM is to model the dynamics of the number of clonogenic cells by treating $N(t)$ as a counting process, which can better mimic tumor evolution.

Acknowledgments

This work was supported by the University of Oslo innovation funds, ERA PerMed under the ERA-NET Cofund scheme of the European Union’s Horizon 2020 research and innovation framework program (grant ‘SYMMETRY’ ERAPERMED2021-330). The authors would like to thank Manuela Zucknick for discussions.

Appendix A Mixture survival model

When we assume only one tumor cell left active after an initial treatment, this tumor cell belongs to one of the $L$ tumor cell subtypes, with probability $\mathbb{P}(N_{l}=1)=p_{l}$ , $l\in\{1,2,...,L\}$ , $\sum_{l=1}^{L}p_{l}=1$ . Using the same notations as Section 2.1, now the population survival function is

	$\displaystyle S_{pop}(t)$	$\displaystyle=\mathbb{P}(N=0)+\mathbb{P}(\bm{W}_{1}>t,...\bm{W}_{L}>t,N=\sum_{% l=1}^{L}N_{l}=1)$
		$\displaystyle=\mathbb{P}(N=0)+\sum_{l=1}^{L}\mathbb{P}(\bm{W}_{l}>t\|N_{l}=1)% \cdot\mathbb{P}(N_{l}=1)$
		$\displaystyle=\mathbb{P}(N=0)+\sum_{l=1}^{L}p_{l}S_{l}(t).$

This is a classical mixture model. However, the assumption with only one tumor cell left active after an initial treatment is not biologically meaningful in cancer research.

Appendix B Last-activation scheme

Our approach in Section 2.1 can be adapted straightforwardly for the last-activation scheme. The population survival function is now

	$\displaystyle\tilde{S}_{pop}(t)$	$\displaystyle=\mathbb{P}(N=0)+\mathbb{P}\left(\max\{\max\{\bm{W}_{1}\},...,% \max\{\bm{W}_{L}\}\}>t,N>0\right)$
		$\displaystyle=\mathbb{P}(N=0)+\mathbb{P}\left(\max\{\max\{\bm{W}_{1}\},...,% \max\{\bm{W}_{L}\}\}>t\|N>0\right)\mathbb{P}(N>0)$
		$\displaystyle=\mathbb{P}(N=0)+\left[1-\mathbb{P}\left(\max\{\max\{\bm{W}_{1}\}% ,...,\max\{\bm{W}_{L}\}\}\leq t\|N>0\right)\right]\mathbb{P}(N>0)$
		$\displaystyle=\mathbb{P}(N=0)+\mathbb{P}(N>0)-\mathbb{P}\left(\max\{\max\{\bm{% W}_{1}\},...,\max\{\bm{W}_{L}\}\}\leq t\|N>0\right)\mathbb{P}(N>0)$
		$\displaystyle=1-\mathbb{P}(\max\{\bm{W}_{1}\}\leq t,...,\max\{\bm{W}_{L}\}\leq t% ,N>0)$
		$\displaystyle=1-\sum_{k=1}^{\infty}\mathbb{P}\left(\max\{\bm{W}_{1}\}\leq t,..% .,\max\{\bm{W}_{L}\}\leq t\|N=k\right)\cdot\mathbb{P}(N=k)$
		$\displaystyle=1-\sum_{k=1}^{\infty}\sum_{\begin{subarray}{c}\text{All config.}% \\ (N_{1},...,N_{L})\\ \text{ with sum }k\end{subarray}}\mathbb{P}\left(\max\{\bm{W}_{1}\}\leq t,...,% \max\{\bm{W}_{L}\}\leq t\|N=k,N_{1},...,N_{L}\right)\cdot\mathbb{P}(N=k,N_{1},.% ..,N_{L}).$

Denote the component-specific cdf as $F_{l}(t)=1-S_{l}(t)$ . By using the multinomial-Poisson transformation, the multinomial theorem and the power series of the exponential function, we obtain

	$\displaystyle\tilde{S}_{pop}(t)$	$\displaystyle=1-\sum_{k=1}^{\infty}\sum_{\begin{subarray}{c}\text{All config.}% \\ (N_{1},...,N_{L})\\ \text{ with sum }k\end{subarray}}\prod_{l=1}^{L}\mathbb{P}(\max\{\bm{W}_{l}\}% \leq t\|N_{l})\cdot\mathcal{P}ois(k;\theta)\mathcal{M}ult(k;\bm{p})$
		$\displaystyle=1-\sum_{k=1}^{\infty}\sum_{\begin{subarray}{c}\text{All config.}% \\ (N_{1},...,N_{L})\\ \text{ with sum }k\end{subarray}}\prod_{l=1}^{L}F_{l}(t)^{N_{l}}\cdot\mathcal{% P}ois(k;\theta)\mathcal{M}ult(k;\bm{p})$
		$\displaystyle=1-\left\{e^{\theta\sum_{l=1}^{L}p_{l}F_{l}(t)}-1\right\}e^{-\theta}$
		$\displaystyle=1+e^{-\theta}-e^{-\theta\{1-\sum_{l=1}^{L}p_{l}F_{l}(t)\}}$
		$\displaystyle=1+e^{-\theta}-e^{-\theta\sum_{l=1}^{L}p_{l}S_{l}(t)}.$

References

Amico and Keilegom, (2018) Amico, M. and Keilegom, I. V. (2018). Cure models in survival analysis. Annual Review of Statistics and Its Application, 5(1):311–342.
Birnbaum, (1969) Birnbaum, Z. W. (1969). On the importance of different components in a multicom- ponent system. In Krishnaiah, P., editor, Multivariate Analysis - II, page 581–592, New York, USA. Academic Press.
Brookmeyer and Damiano, (1989) Brookmeyer, R. and Damiano, A. (1989). Statistical methods for short‐term projections of aids incidence. Statistics in Medicine, 8(1):23–34.
Cha and Finkelstein, (2018) Cha, J. H. and Finkelstein, M. (2018). Point Processes for Reliability Analysis. Springer.
Chen et al., (1999) Chen, M.-H., Ibrahim, J. G., and Sinha, D. (1999). A new Bayesian model for survival data with a surviving fraction. Journal of the American Statistical Association, 94(447):909–919.
Christen and Rubio, (2025) Christen, J. and Rubio, F. (2025). On harmonic oscillator hazard functions. Statistics & Probability Letters, 217:110304.
Cooner et al., (2007) Cooner, F., Banerjee, S., Carlin, B. P., and Sinha, D. (2007). Flexible cure rate modeling under latent activation schemes. Journal of the American Statistical Association, 102(478):560–572.
Greenacre, (2021) Greenacre, M. (2021). Compositional data analysis. Annual Review of Statistics and Its Application, 8(1):271–299.
Gómez et al., (2023) Gómez, Y. M., Gallardo, D. I., Bourguignon, M., Bertolli, E., and Calsavara, V. F. (2023). A general class of promotion time cure rate models with a new biological interpretation. Lifetime Data Analysis, 29(1):66–86.
Hanin and Huang, (2014) Hanin, L. and Huang, L.-S. (2014). Identifiability of cure models revisited. Journal of Multivariate Analysis, 130:261–274.
Jiang and Murthy, (1995) Jiang, R. and Murthy, D. (1995). Modeling failure-data by mixture of 2 weibull distributions: a graphical approach. IEEE Transactions on Reliability, 44(3):477–488.
Mangiola et al., (2023) Mangiola, S., Roth-Schulze, A. J., Trussart, M., Zozaya-Valdés, E., Ma, M., Gao, Z., Rubin, A. F., Speed, T. P., Shim, H., and Papenfuss, A. T. (2023). sccomp: Robust differential composition and variability analysis for single-cell data. Proceedings of the National Academy of Sciences, 120(33).
Rosen and Tanner, (1999) Rosen, O. and Tanner, M. (1999). Mixtures of proportional hazards regression models. Statistics in Medicine, 18(9):1119–1131.
Trapnell, (2015) Trapnell, C. (2015). Defining cell types and states with single-cell genomics. Genome Research, 25(10):1491–1498.
Wei and Liu, (2023) Wei, Y. and Liu, S. (2023). Reliability analysis of series and parallel systems with heterogeneous components under random shock environment. Computers & Industrial Engineering, 179:109214.
Wu and Coolen, (2022) Wu, S. and Coolen, F. (2022). Importance measures in reliability engineering: An introductory overview. In Salhi, S. and Boylan, J., editors, The Palgrave Handbook of Operations Research, page 659–674, Cham, Switzerland. Springer.
Yakovlev, (1996) Yakovlev, A. (1996). Threshold models of tumor recurrence. Mathematical and Computer Modelling, 23(6):153–164.

	$\displaystyle h_{pop}(t\|X)$	$\displaystyle=\frac{-\operatorname{d}\!{S}_{pop}(t\|X)/\operatorname{d}\!{t}}{S% _{pop}(t\|X)}=\theta f(t)=\theta\sum_{l=1}^{L}p_{l}f_{l}(t),$
	$\displaystyle h^{*}(t\|X)$	$\displaystyle=\frac{-\operatorname{d}\!{S}^{}(t\|X)/\operatorname{d}\!{t}}{S^{% }(t\|X)}=\left\{1-e^{-\theta\sum_{l=1}^{L}p_{l}S_{l}(t)}\right\}^{-1}\theta% \sum_{l=1}^{L}p_{l}f_{l}(t).$

	$\displaystyle\tilde{S}_{pop}(t)$	$\displaystyle=\mathbb{P}(N=0)+\mathbb{P}\left(\max\{\max\{\bm{W}_{1}\},...,% \max\{\bm{W}_{L}\}\}>t,N>0\right)$
		$\displaystyle=\mathbb{P}(N=0)+\mathbb{P}\left(\max\{\max\{\bm{W}_{1}\},...,% \max\{\bm{W}_{L}\}\}>t\|N>0\right)\mathbb{P}(N>0)$
		$\displaystyle=\mathbb{P}(N=0)+\left[1-\mathbb{P}\left(\max\{\max\{\bm{W}_{1}\}% ,...,\max\{\bm{W}_{L}\}\}\leq t\|N>0\right)\right]\mathbb{P}(N>0)$
		$\displaystyle=\mathbb{P}(N=0)+\mathbb{P}(N>0)-\mathbb{P}\left(\max\{\max\{\bm{% W}_{1}\},...,\max\{\bm{W}_{L}\}\}\leq t\|N>0\right)\mathbb{P}(N>0)$
		$\displaystyle=1-\mathbb{P}(\max\{\bm{W}_{1}\}\leq t,...,\max\{\bm{W}_{L}\}\leq t% ,N>0)$
		$\displaystyle=1-\sum_{k=1}^{\infty}\mathbb{P}\left(\max\{\bm{W}_{1}\}\leq t,..% .,\max\{\bm{W}_{L}\}\leq t\|N=k\right)\cdot\mathbb{P}(N=k)$
		$\displaystyle=1-\sum_{k=1}^{\infty}\sum_{\begin{subarray}{c}\text{All config.}% \\ (N_{1},...,N_{L})\\ \text{ with sum }k\end{subarray}}\mathbb{P}\left(\max\{\bm{W}_{1}\}\leq t,...,% \max\{\bm{W}_{L}\}\leq t\|N=k,N_{1},...,N_{L}\right)\cdot\mathbb{P}(N=k,N_{1},.% ..,N_{L}).$

A note on promotion time cure models with a new biological consideration