Joint and Survivor Annuity Valuation With A Bivariate Reinforced Urn Process
Joint and Survivor Annuity Valuation With A Bivariate Reinforced Urn Process
a r t i c l e i n f o a b s t r a c t
Article history: We introduce a novel way of modeling the dependence of coupled lifetimes, for the pricing of joint and
Received July 2020 survivor annuities. Using a well-known Canadian data set, our results are analyzed and compared with
Received in revised form February 2021 the existing literature, mainly relying on copulas. Based on urn processes and a one-factor construction,
Accepted 6 April 2021
the proposed model is able to improve its performances over time, in line with the machine learning
Available online 20 April 2021
paradigm, and it also allows for the use of experts’ judgements, to complement the empirical data.
JEL classification: © 2021 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY license
G22 (http://creativecommons.org/licenses/by/4.0/).
C11
E17
Keywords:
Annuity
Bivariate survival function
Reinforced urn process
Bayesian nonparametrics
Right-censoring
https://doi.org/10.1016/j.insmatheco.2021.04.004
0167-6687/© 2021 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
L.A. Souto Arias and P. Cirillo Insurance: Mathematics and Economics 99 (2021) 174–189
of an a priori is not compulsory, it is only a relevant plus, and a We can then look at the probability that at least one of the two
B-RUP can also be used in an “objective” empirically-based way. annuitants survives another k years, i.e.
Another interesting feature of reinforced urn processes, when (x, y )
seen through statistical learning glasses, is that they are somehow P L S (k) := P [( X > x + k) ∪ (Y > y + k)| X > x, Y > y ]
immune to the “black box” argument (Knight, 2017) commonly (x, y ) (x, y ) (x, y )
= SX (k) + S Y (k) − S X Y (k, k), (2)
used to criticize machine/deep learning. In fact, differently from
other approaches, a B-RUP can be controlled and studied in its where the subscript L S is the acronym of “last survivor”, with
(x, y )
probabilistic details, controlling its features and allowing for fine S Y (k) = P (Y > y + k| X > x, Y > y ) indicating the marginal sur-
tuning. vival probability of the second annuitant given the present situa-
(x, y )
The paper is organized as follows. In Section 2 we briefly re- tion of the couple, and similarly S X (k) for the first annuitant.
call joint and last-survivor annuities, while in Section 3 we shortly It is important to stress that, in Equation (2), both marginal prob-
deal with the problem of right-censoring. In Section 4, we describe abilities are conditioned on the ages of both annuitants. This is
the B-RUP model, focusing on its ability of dealing with censored because, in the general situation where the lifetimes X and Y
(x, y ) (x,0)
observations. In Section 5, we discuss our results, using artificial share some kind of dependence, we have that S X (k) = S X (k)
and real data, and providing suggestions on how to use the B-RUP (Youn et al., 2002).
in practice, in terms of prior elicitation and parameter tuning. The If we assume a constant interest rate r, the basic one-unit pric-
real data are the same used by Frees et al. (1996), that we thank ing formula for a 100% joint and last-survivor annuity is given by
for sharing them with us, and that will be a benchmark for us. In-
∞ (x, y )
terestingly, as we shall see, the B-RUP is able to obtain the same P LS(k)
C (x, y ) = . (3)
results of Frees et al. (1996), while guaranteeing more flexibility. (1 + r )k
k =0
Finally, Section 6 concludes the work and looks at future research.
An appendix extends some aspects of Section 5. In the 100p% situation, with 0 < p < 1, this formula is easily ad-
justed (Winklevoss, 1977).
2. Joint and survivor annuities Equations (2) and (3) imply that, in order to evaluate joint and
survivor annuities, one needs to specify the dependence between
An annuity is a financial contract between two counterparties: the lifetimes of the annuitants. If we assume that their lifetimes
(x, y ) (x) ( y)
an individual, called the annuitant, and a financial institution, like are independent, then S X Y (k, k) = S X (k) S Y (k), and the estima-
an insurance company. In such a contract, the former pays a given tion of the marginal probabilities of survival is all that is needed.
amount of money (in full or periodically) to the latter and, in re- Since the estimation of univariate survival functions is a widely
turn, they receive periodic payments, usually starting from a given studied and well-known topic (e.g. Cox and Oakes (1984)), pricing
time point in the future, and commonly until death. The goal annuities under independence makes both modeling and compu-
of annuities is therefore to provide the annuitant with a steady tations considerably easier (Winklevoss, 1977), but not necessarily
stream of income, typically during retirement, and to offer in- realistic.
surance against lifetime uncertainty. It goes without saying that That of independence is indeed always a strong assumption. As
annuities are strictly connected to private pensions and other sim- Frees et al. (1996); Carriere (2000); Luciano et al. (2008); Sanders
ilar products (Sheshinski, 2007). and Melenberd (2016) and others have shown, the lifetimes of
A simple and quite popular extension of the single annuitant several couples in annuity contracts present a non-negligible pos-
case we have just described is given by annuities involving more itive dependence. Moreover, some studies suggest that annuitants
annuitants, most of the times just two. These are often the mem- tend to have higher individual survival probabilities with respect
bers of a married couple or of a partnership, but parent-child and to people who do not buy annuities (Mitchell et al., 1999). Such a
relative-relative combinations are also common (Frees et al., 1996; result opens to the possibility that also the dependence structure
Brown and Poterba, 2000). An example is offered by joint and last- among couples that buy annuities, and couples that do not, dif-
survivor annuities, in which the insurance company pays as long fers. Although there is no conclusive study about differences in the
as at least one of the two annuitants is alive. In this kind of con- coupled lifetimes of annuitants versus non-annuitants, Sanders and
tracts, the death of one of the annuitants may have an impact on Melenberd (2016) observe that the magnitude of the positive de-
the stream of payments that the survivor receives. For instance, for pendence for a random subset of 50, 000 couples–not necessarily
p ∈ [0, 1], in a 100p% (e.g. 50%) joint and survivor annuity contract, annuitants–sampled from the whole Dutch population is signifi-
payments are made in full while both annuitants are alive, but if cantly smaller than what Frees et al. (1996) and Luciano et al.
one of the two dies, only 100p% (e.g. 50%) of the original amount is (2008) find for about 15000 couples of annuitants in Canada.1
paid to the survivor (Sheshinski, 2007; Winklevoss, 1977). A pop- Regardless of its magnitude, there are several reasons that ex-
ular version is the contingent joint and survivor annuity, which plain, at least qualitatively, why the dependence in the lifetimes
pays the full benefit as long as the first annuitant (plan member) of couples (especially married couples and partnerships) is posi-
tive. Some of them are rather intuitive, like the possibility of both
is alive, and then changes the benefit to a portion which is usually
individuals dying at the same time because of an accident or a
one-half or two-thirds.
contagious disease, or the common habits they might share as a
From an actuarial point of view, to evaluate a joint and sur-
consequence of living together, like the food they eat or the en-
vivor annuity, one needs the marginal and joint survival functions
vironment they live in. Others are more linked to emotional, psy-
of both annuitants. Let X and Y be two random variables rep-
chological and medical aspects, like the “broken heart syndrome”,
resenting the lifetimes of two annuitants in a joint and survivor a well-known phenomenon in medicine, in which the sudden pass-
annuity. Assume that today the age (say in years) of the first an- ing of one spouse greatly increases (at least temporarily) the prob-
nuitant is observed to be x, while for the second we have y. The ability of death of the surviving one because of bereavement and
joint probability of survival for another k years, given the current sadness (Jagger and Sutton, 1991).
ages, is
(x, y )
S X Y (k, k) := P ( X > x + k, Y > y + k| X > x, Y > y ), (1) 1
Naturally an analysis taking into consideration the differences in the two coun-
tries and the related markets would be necessary to clean the effects, but the idea
where k = 0, 1, ..., K . of some selection mechanism does not seem ludicrous.
175
L.A. Souto Arias and P. Cirillo Insurance: Mathematics and Economics 99 (2021) 174–189
A question that naturally arises, when we consider the several copulas, but, at least within the realm of positive dependence, it
possibilities for the modeling of joint mortality, is how these affect also allows for a more flexible modeling, as it improves its per-
the annuity price in Equation (3). This question is well-addressed formances over time, and it allows for the exploitation of experts’
in Frees et al. (1996), where they compute the ratio between the judgements.3
annuity price under positive dependence and the price under inde- To describe the model, we start by defining its main building
pendence (we shall call it annuity ration from now on). To model blocks: the beta-Stacy process of Walker and Muliere (1997), and
dependence, they make use of copula models with Gompertz and the Reinforced Urn Process (RUP) of Muliere et al. (2000). Then
Weibull marginals, and they find out that the annuity ratio varies we show an intuitive representation of RUPs via urns. Finally we
between approximately 0.95 and 1.05, depending on the initial discuss how several RUPs can be easily combined into a one-factor
ages of the annuitants at contract initiation. This means that, if construction leading to the B-RUP.
one assumes independence to simplify modeling, they may end
up with the underestimation or the overestimation of the annu- 4.1. Reinforced Urn Processes and beta-Stacy processes
ity price by as much as a 5%. Compatible results are more recently
obtained by Sanders and Melenberd (2016), even if the error they The Reinforced Urn Process was first described in Walker and
find is smaller than 5% on average, in absolute terms, for different Muliere (1997), where the results of Blackwell and MacQueen
types of contracts. Furthermore, the same authors underline that (1973) and Ferguson (1973) are extended to right-censored data,
the error can be particularly larger and relevant for specific types thus defining the so-called beta-Stacy process, a neutral-to-the-
of annuities, in which the joint survival distribution plays a major right (Doksum, 1974) generalization of the Dirichlet process (Fer-
role, like for instance joint annuities. guson, 1973).
The RUP is a combinatorial stochastic process, and it can be
3. Right-censoring seen as a reinforced random walk over a state space of urns. De-
pending on how its parameters are specified, it can generate a
In survival analysis, right-censoring is a well-known problem large number of interesting models. Essential references on the
in estimation (Kaplan and Meier, 1958). This usually occurs when topic are Muliere et al. (2000, 2003) and Fortini and Petrone
individuals drop from the study before the event of interest has (2012). Examples of applications can be found for example in Cir-
happened. If X is our age variable, when right-censoring occurs, illo et al. (2013) and Peluso et al. (2015).
the minimum X ∗ = min( X , T X ) between X and a random censor- In this paper we specify a reinforced urn process able to gener-
ing time T X is observed.2 Every potentially censored observation ate a discrete beta-Stacy process, a particular random distribution
is therefore represented by the couple ( X ∗ , δ), where the indica- over the space of discrete distributions.
tor δ tells if a given observation is censored (δ = 0) or not (δ = 1).
Left-censoring can also be defined, but it is less relevant here. For Definition 4.1 (Walker and Muliere (1997)). A random distribution
more details see Klein and Moeschberger (2003). function F is a discrete beta-Stacy process with jumps at j ∈ N
While univariate censoring has been extensively studied in the and parameters {β j , ω j ∈ R+ , j ∈ N}, if there exist mutually in-
literature, e.g. in Kaplan and Meier (1958); Elandt-Johnson and dependent random variables { V j } j ∈N , each beta distributed with
Johnson (1980) or Cox and Oakes (1984), far less progress has been parameters (β j , ω j ), such that therandom mass assigned by F to
made in the bivariate case. One of the first works to study this { j }, written F ({ j }), is given by V j i < j (1 − V i ).
problem is that of Dabrowska (1988), in which the Kaplan-Meier
estimator is extended to the two-dimensional framework. How- Following Definition 4.1, we introduce couples {β j , ω j } ∈
ever, this estimator is known to produce negative masses when R+ × R + , with j ∈ N , such that β , ω ≥ 0, β + ω > 0, and
j j j j
a large amount of censoring is present (Pruitt, 1991). In Wang and n ω
limn→∞ j =0 β +jω = 0. Then, given a discrete beta-Stacy process
j j
Wells (1997) or Gribkova and Lopez (2015), the censoring condi-
F with parameters {β j , ω j ∈ R+ , j ∈ N}, and a sample ( X n∗ , δn ) of
tions are relaxed to obtain better estimators. For example, in the
context of coupled lifetimes, Gribkova and Lopez (2015) assume exchangeable4 and potentially censored observations, with X n∗ =
that the age difference between individuals is known and related { Xn∗ , n ≥ 1}, the sequence X n = { Xn , n ≥ 1} is a RUP if
to the censoring variables; in Lin (1993), conversely, the censoring
Ŝ X (x) = P ( X n+1 > x| X n∗ = xn∗ , δn = dn )
variable is taken to be the same for both lifetimes. Other recent
articles on nonparametric estimators under bivariate censoring are
x
β j + m∗j (xn∗ , dn )
= 1− , (4)
those of Lopez (2012) and Shen and Yan (2008). In all these works, β j + ω j + s j (xn∗ )
censoring is assumed to be independent from the variables of in- j =0
n
terest, an assumption we will also make in our construction. where m∗j (xn , dn ) = i =1 1{xi = j ,di =1} is the number of exact ob-
In line with the univariate case, in the presence of bivariate n
servations at x = j and s j (xn ) = i =1 1{xi ≥ j } is the number of
censoring one plays with the quartet ( X ∗ , δ, Y ∗ , ), where is the
observations at x ≥ j. In the context of survival analysis, Equation
censoring variable associated to Y , and Y ∗ = min(Y , T Y ). The vari-
(4) gives the probability of survival of a new individual ( X n+1 ),
ables ( X , Y ) are assumed independent from ( T X , T Y ).
conditioned on the lifetime observations of previous individuals
( X n∗ ).
4. The Bivariate Reinforced Urn Process
Notice that, by defining β ∗j = β j + m∗j (xn , dn ) and ω∗j = ω j +
The Bivariate Reinforced Urn Process (B-RUP) is our proposal for s j (xn ) − m∗j (xn , dn ), we can obtain a new beta-Stacy process F ∗
the modeling of joint and survivor annuities. Originally introduced
by Bulla (2005); Bulla et al. (2007), it is here extended and adapted
3
to the annuity framework. It is worth noticing that copulas can also deal with negative dependence, so
they are more flexible in that sense. However, when positive dependence can be
We will show that this model is not only able to replicate
safely assumed, the B-RUP needs less and weaker hypotheses about the shape of
the results one can obtain with other existing methodologies, like the joint distribution.
4
Exchangeability is a common assumption in Bayesian statistics (de Finetti,
2017), and it represents a relaxation of the stronger hypothesis of independence.
2
A common assumption (Gribkova and Lopez, 2015) is that the support of the A sequence of random variables is exchangeable if their joint distribution is im-
distribution of X is included in the support of the censoring variable T X . mune to permutations in the order of appearance of the variables.
176
L.A. Souto Arias and P. Cirillo Insurance: Mathematics and Economics 99 (2021) 174–189
177
L.A. Souto Arias and P. Cirillo Insurance: Mathematics and Economics 99 (2021) 174–189
with × denoting the convolution operation, so that both F X and 5. Empirical results
F Y are convolutions of beta-Stacy processes. Furthermore, if P is
the probability function corresponding to F , one has In this section we apply the B-RUP model to calculate the de-
pendence of coupled lifetimes, in order to price joint and survivor
annuities.
min(x, y )
P X Y (x, y ) = P A (a) P B (x − a) P C ( y − a), ∀x, y ∈ N02 . We first consider an analytical example, where we sample pos-
a =0
sibly censored observations from a known distribution. Knowing
the original distribution will not only allow us to estimate the
(9)
difference between the original distribution and the posterior com-
Although the joint posterior distribution of Equation (9) can in puted through MCMC, but also to study how these differences
principle be computed analytically5 (Bulla et al., 2007), its com- affect the final price of the annuity.
plexity grows quickly with the number of observations, making In the second part of the section, the B-RUP is tested on a well-
it already unfeasible for a relatively small sample. A convenient known Canadian data set originally used by Frees et al. (1996),
way of by-passing the problem is therefore to use a Markov Chain which we will also use as a benchmark.
Monte Carlo (MCMC) approach, also known as Gibbs sampler, to
obtain the joint probability distribution from the posterior distri- 5.1. Analytical example
butions of A n , B n and C n , using the one-factor construction.
The MCMC method consists of the following steps: Assume that X and Y are linked through the one-factor model
of Equation (7). Assume that A follows a Poisson distribution with
[k]
1. Generate a realization of A n , with the superscript referring to intensity parameter λ equal to 25 (from now on we use the no-
the iteration number, at the k-th iteration conditioned on the tation A ∼ Poi(25)), and that B ∼ Poi(35) and C ∼ Poi(40). This
[k] implies that the marginals of X and Y are Poisson distributions
available data of the previous iteration. That is, sample A n
from its conditional distribution, i.e. with parameters 60 and 65, respectively; and that Cov( X , Y ) =
Var( A ) = 25. The correlation between X and Y is therefore ap-
P [ An[k] = an | An[k−−11] , X n∗ , δn , Y n∗ , n ] ∝ (10) proximately 0.4.
[k−1] Moreover, while not necessary true in general, we will make
P [ B n = xn − an , δn = dn | B n−1 = X n∗−1 − A n −1 , δ n −1 ] some assumptions about the dependence between the censoring
[k−1]
P [C n = yn − an , n = en |C n−1 = Y n∗−1 − A n −1 , n −1 ] variables, T X and T Y (see Section 3), so that we can generate
censored observations and better study the properties of our con-
P [ An = an | An[k−−11] ],
[k]
struction. We start from the naive observation that the two an-
nuitants of a joint and survivor annuity enter the contract at the
where, using the fact that P [ X ≥ k] − P [ X > k] = P [ X = k],
same time; say that X 0 is the age of the first annuitant at the date
one has
of the signature, while Y 0 is that of the second person. If we de-
note the observation period by , it is clear that T X = X 0 + and
5
Also observe that the conjugacy of the constituent RUPs { An }, { B n } and {C n } T Y = Y 0 + . Moreover, X 0 = Y 0 + θ , where θ is the age difference
allows for the continuous updating of the B-RUP as well. between the annuitants, which can be negative if Y 0 > X 0 .
178
L.A. Souto Arias and P. Cirillo Insurance: Mathematics and Economics 99 (2021) 174–189
Fig. 2. Comparison of the marginal distributions of X and Y . In green we show the prior distribution, the Kaplan-Meier estimators in red, the B-RUP solution in blue and the
original distribution in black. All distributions are conditioned on the minimum value of the uncensored observations. For this sample those were Xmin = 34 and Y min = 39.
Table 1 so that, for this particular example, we generate 104 samples from
Proposed strength of belief scenarios. Notice that, since we are considering a data a Poi(20).
set of size 104 , even under the high belief scenario, the prior will not be able to
In Fig. 2 we show the recovered marginal distribution using the
actually affect those areas where the observations are more concentrated, but it
will definitely influence the areas with fewer data points. B-RUP via MCMC, as well as the KM estimators and the original
distribution. Notice how, due to the lack of exact samples on the
Low belief High belief
right tail of the distribution, the KM estimator is not properly de-
c A , c B , cC 10−6 102 fined on the whole support of the original distribution, but only up
to the maximum value of the uncensored observations. The B-RUP
model, on the contrary, is able to recover the right tail of the dis-
Therefore, to obtain right-censored observations, we have to tribution, although the fitting is clearly worse in that part of the
choose the distributions of T X , and θ . In what follows we take curve, because of the evident lack of observations. If available, a
T X ∼ Poi(50), ∼ Poi(2) and θ = θ ∗ − θ0 , with θ ∗ ∼ Poi(7) and better a priori could here be used to improve tail fitting.
θ0 = 5. We then sample n = 104 couples of observations, of which We also present the joint distribution in Figs. 3a and 3b, for the
almost 90% turn out to be at least partially censored (i.e. at least low and high belief scenarios, respectively. Notice how, in Fig. 3a,
one of the annuitants in the couple is right-censored). the fitting is worse in the upper-right corner than in the lower-left
Following the procedure defined in Section 4, the first step to corner, in accordance with the right-censoring effect. For the high
use the B-RUP is to define prior distributions for the target vari- belief scenario, however, the prior distribution dominates in those
ables according to Equation (6), as well as the strength of belief areas of few observations and we recover a smoother surface, as
per Fig. 3b.
parameters. The priors we choose are G A = Poi(20), G B = Poi(20)
A more quantitative comparison can be found in Table 2, where
and G C = Poi(20), respectively. For the strengths of belief, ck with
we show the means, the variances and the correlation for both
k ∈ { A , B , C }, we consider two different scenarios: one where the
B-RUP scenarios, as well as for the KM estimator, the theoretical
strengths of belief are very small, so that the posterior distribution
analytical solution, and the data. In this last case we provide the
is strongly affected by the incoming data, as if we did not trust our
sample estimates under two different points of view: 1) correctly
a priori; and another scenario where the strengths of belief are big
using only the truly uncensored observations (“Uncensored”), and
enough to keep memory of the a priori—which we trust—and, in
2) taking into consideration the entire data set, ignoring the pres-
particular, to influence those areas in the data with less observa-
ence of censoring (“Whole”). While this second approach is not
tions. As said, the a priori can indeed be used to complement the
correct (Klein and Moeschberger, 2003), it can be heuristically use-
data with information about rarely observed events and trends. In
ful to better understand the impact of right-censoring in the data.
Table 1 the values of the strengths of belief for both scenarios6 are
For example we can clearly see that not considering censoring can
provided. In the following we will refer to them as the “low be-
lead to a serious overestimation of the dependence in the data.
lief” and the “high belief” scenarios, or with the subscripts l and h, Always in Table 2, observe how the B-RUP is able to recover the
when referring to the results obtained with the B-RUP model. For correlation, under both scenarios,7 something not possible when
example, when we write B-RUPl , we refer to the B-RUP estimators using the KM approach. Moreover, observe how the B-RUP, in par-
in the low belief scenario. B-RUPh is defined analogously for the ticular under the low strength of belief, better captures the vari-
high belief scenario. ability of both X and Y .
In order to initialize the Gibbs sampler, we also need to define In Table 3 we show permutation tests performed on the means
an initial sample for A, which is not observable in practice. There- and the variances. Under the null hypothesis of no difference be-
fore, this sample will be “artificial”, and we need a reasonable way tween the obtained estimates and the true analytical values of
of generating it, taking into consideration the characteristics of A, Table 2, the tests never reject the null for the B-RUP estimators
like for example its positiveness. Moreover, the resulting posterior under the low strength of belief scenario, at a standard 5% signif-
distribution will be affected by this artificial sample, and thus in icance level. These permutation tests where performed simulating
practice it is recommended to compute the posterior for several samples of size 103 from both the B-RUP (low and high strength
appropriate initial samples. In our case, we will sample observa- of belief) and the KM marginals. For the mean, the test statistic
tions from the same distribution that we chose for the prior of A, used is the absolute difference between the means, while for the
variance Good’s test (Good, 1994) is employed. Looking at Table 3,
6
Note that we could go as far as to define different strengths of belief for each
7
urn within a RUP (Muliere et al., 2000). In this work, however, we will always as- Although both scenarios give reasonable results, the low belief scenario returns
sume the same strength of belief for each RUP. values closer to the actual ones, being more data-driven, as expected.
179
L.A. Souto Arias and P. Cirillo Insurance: Mathematics and Economics 99 (2021) 174–189
Fig. 3. Contour plot comparing the analytical distribution with the one obtained through the B-RUP model for the low (a) and high (b) belief scenarios. All distributions are
conditioned on the minimum value of the uncensored observations: X min = 34 and Y min = 39.
Fig. 4. Ratio between the annuity values of the joint and independent survival models. The left plot (a) shows the surface obtained using the B-RUP construction with low
strength of belief, while the right plot (b) shows the price ratio using the actual distribution (via the known analytics). Note how the ratio differs more between the two
plots for high initial ages, since, because of the censored observations, the B-RUP fitting is worse in that area. The result is nevertheless quite positive.
180
L.A. Souto Arias and P. Cirillo Insurance: Mathematics and Economics 99 (2021) 174–189
Fig. 5. Ratio between the annuity values of the joint and independent survival models. The left plot (a) shows the surface obtained using the B-RUP construction with low
strength of belief, while the right plot (b) shows the price ratio using the true (analytical) distribution. A visual comparison suggests that the B-RUP definitely performs well.
181
L.A. Souto Arias and P. Cirillo Insurance: Mathematics and Economics 99 (2021) 174–189
Fig. 6. Comparison of the marginals of X and Y using a Poisson prior. In green we show the prior distribution, the Kaplan-Meier estimators in red and the B-RUP solution in
blue. The upper row shows the results for the low belief scenario and the lower row the same for the high belief scenario. All distributions are conditioned on the minimum
value of the uncensored observations. For this example those were X min = 51 and Y min = 46.
Table 5 females are three years younger than males, when the observation
Comparison of the means, variances and correlation of X and Y using: the uncen- period starts. This combination of younger entry age plus higher
sored data, the whole data set (wrongly assuming no censoring), the KM estimator
life expectancy could be the reason why censoring affects more
and the B-RUP estimator for both low and high strength of belief, and the copula
model, respectively. the distribution of females than that of males for this particular
data set.
Unc. Whole KM Poil Poih
Some additional numbers can give insight about the lack of
Mean(X) 74.514 71.710 86.769 85.183 85.801 data in the tails. Only 48 females have indeed survived until at
Mean(Y) 74.011 68.871 89.625 85.509 87.108
Var(X) 52.049 42.821 75.553 57.556 74.305
least 90 years. Conversely, the number of males observed in the
Var(Y) 61.707 54.118 56.316 37.034 54.947 same range is 77. While this difference may not seem significant,
Corr(X,Y) 0.820 0.776 – 0.477 0.421 notice that the life expectancy of males, according to the results in
Table 5, is 3 years lower than that of women. Therefore one would
expect to observe more women at high ages than men, whereas
5.2.1. B-RUP results for the joint survival function the opposite happens in our data. This could also be explained by
Fig. 6 shows the B-RUP marginal distributions for both the low the lower entry ages of females with respect to males.
and high strengths of belief. We observe that the marginal distri- Table 5 presents the means and the variances of X and Y , and
bution for the first annuitants (males) is nicely recovered. However, their correlation, for the raw uncensored data, the KM estimator
the differences are clearly bigger9 for the marginal of Y –the sec- (notice that in this case correlation is not obtained because inde-
ond annuitant (females)–since censoring seems to affect more the pendence is assumed), and the B-RUP. For completeness, we also
right-tail. provide all quantities when taking all observations as uncensored
The lack of information about the right-tail of the distribution (“Whole”), as we did in the analytical example.
of female lifetimes is so large that the KM cumulative probability First notice that using only uncensored observations leads to an
at the age of 97 is around 0.8, while for the age of 98 the cu- underestimation of the average lifetimes and to a likely overesti-
mulative distribution function (cdf) reaches its maximum value of mation of dependence. But this is no surprise, as it is known that
1. Taken literally, this would mean that there is a 20% probabil- ignoring censoring is highly misleading in survival analysis (Klein
ity for females of dying between 97 and 98 years, something we and Moeschberger, 2003).
know is not true. This unrealistic behavior could be due to the fact In Figs. 7a and 7b we show the joint distributions obtained
that, as per Table 5, female annuitants have a slightly longer life with the B-RUP model for the low and high belief scenarios, re-
expectancy than males, while, from the initial ages at which they spectively, as well as the prior distribution for comparison pur-
enter the contract, we see that on average (Luciano et al., 2008) poses. Notice how, although their marginals are very similar–see
Fig. 6–the shape of the contours is considerably different around
the areas of lower probability. Conversely the joints are more com-
9
We observe that the differences are slightly smaller for the high strength of
patible around the inner contours, for the bulk of the distribution,
belief. This could be due to the fact that a strong a priori helps filling the gaps in sign that the strength of belief is not large enough to affect the ar-
the right tail of the marginals, which, because of censoring, are highly unrealistic. eas where data are concentrated. In particular, for high strengths of
182
L.A. Souto Arias and P. Cirillo Insurance: Mathematics and Economics 99 (2021) 174–189
Fig. 7. Contour plot showing the joint distribution obtained via the B-RUP model for the low (a) and high (b) belief scenarios and their respective densities via bivariate
kernel estimation ((c) and (d)) using a Poisson prior. All distributions are conditioned on the minimum value of the uncensored observations, i.e. Xmin = 51 and Y min = 46.
Fig. 8. Surface plot showing the relative difference between the joint survival distribution and the product of the marginal survival distributions (independence) for the low
(a) and high (b) belief scenarios.
belief the contours are somewhat smoother, a phenomenon which symmetric than those of the low belief kernel estimate, in accor-
we already observed in the analytical example. This makes the dis- dance with the results of Figs. 7a and 7b.
tribution of Fig. 7b easier to interpret. Nevertheless, notice that Notice that, by increasing the strength of belief, the areas with
this smoothing induced by the strength of belief is a secondary fewer observations are dominated by the prior, and this causes a
role, while the main purpose of a reliable prior is to embed behav- decline in the correlation, which goes from 0.477 to 0.421, consis-
iors that are not captured by the data. Moreover, whereas further tently with the findings in Table 5.
increasing the strength of belief would surely result in smoother As observed in Carriere (2000) for this very same data set–
distributions, we could end up giving a larger weight to the prior and more recently in Sanders and Melenberd (2016) for a data set
distribution than originally intended. concerning the whole Dutch population–the positive dependence
A way to improve our previous results, obtaining a more between the lifetimes of married couples increases with the ages
tractable joint distribution, while maintaining the desired strength of its members. In other words, people in their forties are expected
of belief, is to use a kernel bivariate density estimation. For ex- to present a smaller positive dependence than a couple in their
ample, we can employ a bivariate normal kernel, with bandwidth eighties. To check for this phenomenon, as in Sanders and Melen-
parameters consistent with Sylverman’s rule of thumb (Sylverman, berd (2016) we compute the relative difference between the joint
1986). The obtained distributions are presented in Figs. 7c and 7d, survival function and the product of the marginal survival distri-
for low and high strengths of belief, respectively. Note that the butions. The results are in Figs. 8a and 8b for the low and high
contours of the high belief kernel estimate are considerably more belief scenarios, respectively, and they are clearly in line with an
183
L.A. Souto Arias and P. Cirillo Insurance: Mathematics and Economics 99 (2021) 174–189
Fig. 9. Ratio between the annuity values of the joint and independent survival models using the B-RUP estimators. The left plot (a) shows the dependency with the initial
age of the annuitants an interest rate of 0.05 for low strength of belief. The right plot (b) shows similar results for high strength of belief.
Fig. 10. Ratio between the annuity values of the joint and independent survival models using the B-RUP with different strengths of belief ((a) and (b)).
increase of dependence with age. Moreover, from the same figures more relevant in the near future than in the long term—according
we can conclude that the relative difference is bigger under the to the meaning of “high” entry ages, there is no long term survival
low belief scenario, which would explain the slight increase in the to be expected in the first place—and the annuity ratio keeps on
overall correlation (Table 5). decreasing below the unit. All these findings are in line with the
results obtained in Frees et al. (1996) for the same data set with
5.2.2. Annuity pricing with the B-RUP copula models.
Now that we have estimated the joint probability distribution, Also notice the similarity of these surfaces with those of Fig. 4:
we can calculate the annuity ratio. We still assume a fixed interest qualitatively the behavior is mostly the same. This is because in
rate of 0.05 and show the annuity ratio for several initial ages of both cases we have positive dependence, and thus all the reason-
the annuitants. Next, when we want to compute the annuity ratio ing developed in this section also applies for the analytical exam-
as a function of the interest rate, we will hypothesize that annui- ple. Moreover, since the correlation levels are very similar (around
tants enter the contract at the same age to simplify the exposition, 0.5), we obtain annuity ratios of comparable orders.
so that we can easily plot the results as a function of the interest
The results of the annuity ratio as a function of the interest
rate only.
rate can be found in Fig. 10. Please notice how the overall shape is
Fig. 9 shows the results for the low and high strengths of be-
in accordance with what we already observed in Fig. 5. For small
lief, and the fixed interest rate. In line with expectations, even for
interest rates and entry ages, the annuity ratio is below one, mean-
the annuity ratio, the high belief scenario gives again a smoother
ing that the annuity price with independent mortality is overesti-
surface.
mated. As we increase the entry age, the annuity ratio decreases
Notice that, in both cases, the annuity ratio increases with the
up to a minimum (0.9527 for and interest rate equal to 0 and an
age difference between the annuitants. When one of the annui-
tants dies at a very old age, the probability for the surviving spouse entry age of 71 years), from where it starts to increase again, also
to also die at an old age is larger under positive correlation than reaching values above the unit.
under independence. This effect seems to be bigger when the fe- We would like to stress that, while annuity ratios of 1.02 or 0.98
male is the surviving annuitant, probably because of the larger life may seem close enough to 1, and this could lead to the conclusion
expectancy of females with respect to males (Carriere, 2000). that it is ok to assume independence, the nominal value for an
When both spouses are around the same age, the annuity ra- annuity contract is usually considerably large, and thus the final
tio takes values below one, specially for very high entry ages. If difference in the prices may not be negligible in monetary terms.
one annuitant dies at a young age, the probability of the remain- In Table 6 we show some values of the annuity ratio for particular
ing annuitant to also die in the near future is higher when positive entry ages.
dependence is assumed, but in the long term the surviving annu- The results above are in line with what Frees et al. (1996) and
itant will count as being independent from the deceased spouse, Luciano et al. (2008) found on the same data, using different ap-
and the annuity ratio is closer to 1. Conversely, for high entry ages, proaches like copulas, suggesting that the B-RUP can be safely used
the impact of the deceased spouse upon the surviving annuitant is as an alternative model.
184
L.A. Souto Arias and P. Cirillo Insurance: Mathematics and Economics 99 (2021) 174–189
Fig. 11. Comparison of the marginals of X and Y using a Poisson prior. In red we show the Kaplan-Meier estimators and the copula distribution in black. All distributions
are conditioned on the minimum value of the uncensored observations. For this example those were Xmin = 51 and Y min = 46.
Table 6 Table 7
Annuity ratio obtained with the B-RUP model with low strength of belief parame- Calibration of the copula model via MLE. The subscripts X , Y refer to the marginals
ters for several initial ages of the annuitants with an interest rate of 0.05. Assuming of the male and female annuitants, respectively. Notice that α is negative and away
independence can definitely be a bad choice in pricing. from 0, so according to Equation (12) there is positive dependence.
Y μX σX μY σY α
40 50 60 70 80
X
88.783 5.927 90.118 5.145 −4.144
40 0.993 0.996 0.999 1.001 1.007
50 0.996 0.989 0.992 1.000 1.012
60 0.999 0.993 0.979 0.988 1.019
70 1.001 1.001 0.990 0.966 1.009 Table 8
80 1.007 1.013 1.021 1.015 0.977 Comparison of the means, variances and correlation of X and Y using: the uncen-
sored data, the whole data set (wrongly assuming no censoring), the KM estimator
and the copula model, respectively.
In the next Subsection we compare the B-RUP and a copula Unc. Whole KM Copula
approach in some more detail.
Mean(X) 74.514 71.710 86.769 85.427
Mean(Y) 74.011 68.871 89.625 87.167
5.2.3. What about copulas? Var(X) 52.049 42.821 75.553 55.199
We finally compare the performance of the B-RUP with copu- Var(Y) 61.707 54.118 56.316 43.090
las. In particular, we choose the Frank copula model of Frees et al. Corr(X,Y) 0.820 0.776 – 0.512
(1996), which proved to give good results for the annuity problem.
In Frees et al. (1996), Gompertz distributions are used for the
individual lifetimes, and the Frank copula to model the depen-
dence. The Gompertz distribution is given by
μ x
Gomp(x; μ, σ ) = 1 − exp e − σ (1 − e σ ) , (11)
1 (e α u − 1)(e α v − 1)
C (u , v ; α ) = log 1 + , (12)
α eα − 1
where u , v are the marginal distributions for the male and female
annuitant, respectively, and α is the parameter controlling the de- Fig. 12. Joint lifetime distribution for the copula construction of Frees et al. (1996).
pendence. A negative value of α indicates positive dependence,
while α = 0 means that we have independence (Nelsen, 2006).
It was already shown in Frees et al. (1996) that this model can earlier, for this particular data set there is barely any information
be calibrated via maximum likelihood estimation (MLE), and we about the right tail of the females’ distribution, and so there is no
refer to the original paper for all the details regarding the estima- surprise in models yielding slightly different results.
tion of the parameters. In Table 7 we present the results obtained In Fig. 12 we show the joint distribution given by the copula,
using MLE, where (μ X , σ X ) are the Gompertz estimates for the with the B-RUP results already presented in Fig. 7. As expected,
male annuitant, and (μY , σY ) the estimates for the female annui- the parametric copula produces a less “cloudy” and elliptical dis-
tant. Since the value of α is highly negative, we expect a strong tribution, with less mass on “peripheral” couples like (70, 85). The
positive dependence. It can be seen in Table 8 that this corre- copula also gives a slightly stronger correlation (0.51 versus 0.48),
sponds to a Pearson correlation of 0.51. but we can consider this difference as negligible in practice.
In Figs. 11a and 11b we present the marginal distributions for Finally, we check if the small discrepancies in the survival func-
this model, with also the KM estimator for comparison. If we com- tions of the B-RUP and the copula model are also observed once
pare these plots with those of Fig. 6, we see that both the B-RUP we compute the annuity ratio. The annuity results for the Frank
and the copula model return very similar curves. We can also com- copula are in Figs. 13a and 13b. Apart from the areas where both
pare the moments using Tables 5 and 8. For the male annuitants, annuitants are considerably old (> 75) at the time of contract ini-
both the mean and the variance are remarkably similar, while tiation, we see that the B-RUP and copula models tend to give very
there are some discrepancies for female annuitants. As discussed similar results.
185
L.A. Souto Arias and P. Cirillo Insurance: Mathematics and Economics 99 (2021) 174–189
Fig. 13. Ratio between the annuity values of the joint and independent survival models using the copula construction. The left plot (a) shows the annuity ratio for a fixed
interest rate r = 0.05. Similar results when the couples have the same age are presented on the right plot (b).
All in all the B-RUP and the Frank copula of Frees et al. (1996) In terms of performances, the model provides results in a very
show similar performances–but this holds true also for the other reasonable time, spanning from a few minutes to a couple of
models of Frees et al. (1996) and Luciano et al. (2008)–so why hours, depending on the size of the data set and the amount of
should one prefer the B-RUP for annuity pricing? predictions required. In particular, for the Canadian data set con-
While working nicely on data, copulas do not have the same sidered and 104 iterations of the Gibbs sampler, the algorithm took
flexibility of the B-RUP. In particular when a priori knowledge can approximately 1 minute of computing time in a C++ environment
be used to improve fitting, or to deal with specific characteris- and an Intel(R) Core(TM) i7-7700HQ CPU @ 2.80 GHz processor.
tics of the data—like a severe lack of observations in the tails—the In the future, it would be extremely useful to go deeper into
B-RUP clearly has an edge over the other commonly used ap- the study of credible and reliable a prioris for annuity modeling,
proaches. And this without taking into consideration the model by gathering opinions and recommendations from experts. From
risk involved in every parametric choice behind the use of copulas a computational point of view, conversely, it could be meaning-
(Hull, 2015; Mikosch, 2006). This is why we believe that the non- ful to find ways of introducing parallelization in the simulations of
parametric approach the B-RUP proposes, which combines a priori the B-RUP. While not immediately relevant to the application de-
knowledge (when meaningful) and the ability of learning from the scribed in this paper, such a possibility could dramatically expand
data, represents a viable and powerful alternative to the existing the applicability and the performances of reinforced urn models
approaches to the modeling of coupled lifetimes. The fact that one in insurance and finance (Amerio et al., 2004; Cheng and Cirillo,
can get the same results, but also extend them with experts’ judge- 2018, 2019; Peluso et al., 2015).
ments (eliciting a different a priori as in Appendix A) is a plus we
deem worth using. Declaration of competing interest
186
L.A. Souto Arias and P. Cirillo Insurance: Mathematics and Economics 99 (2021) 174–189
Fig. 14. Comparison of the marginals of X and Y using a Gompertz prior. In green we show the prior distribution, the Kaplan-Meier estimators in red and the B-RUP solution
in blue. The upper row shows the results for the low belief scenario and the lower row the same for the high belief scenario. All distributions are conditioned on the
minimum value of the uncensored observations. For this example, X min = 51 and Y min = 46.
Fig. 15. Comparison of the marginals of X and Y using an Uniform prior. In green we show the prior distribution, the Kaplan-Meier estimators in red and the B-RUP solution
in blue. The upper row shows the results for the low belief scenario and the lower row the same for the high belief scenario. All distributions are conditioned on the
minimum value of the uncensored observations. Here X min = 51 and Y min = 46.
187
L.A. Souto Arias and P. Cirillo Insurance: Mathematics and Economics 99 (2021) 174–189
Table 9 Derbyshire, J., 2017. The siren call of probability: dangers associated with using
Comparison of the means, variances and correlation of X and Y using: the uncen- probability for consideration of the future. Futures 88, 43–54. https://doi.org/
sored data, the whole data set (wrongly assuming no censoring), the KM estimator 10.1016/j.futures.2017.03.011.
and the B-RUP estimator with Gompertz and Uniform priors, respectively. The sub- Doksum, K., 1974. Tailfree and neutral random probabilities and their posterior
script l (h) indicates low (high) strength of belief. distributions. Annals of Probability 2 (2), 183–201. https://doi.org/10.1214/aop/
1176996703.
Unc. Whole KM Gompl Gomph Unil Unih
Elandt-Johnson, R.C., Johnson, N.L., 1980. Survival Models and Data Analysis. John
Mean(X) 74.514 71.710 86.769 85.907 85.865 87.457 90.819 Wiley. ISBN 9780471031741.
Mean(Y) 74.011 68.871 89.625 84.950 88.208 89.696 94.295 Ferguson, T.S., 1973. A Bayesian analysis of some nonparametric problems. The An-
Var(X) 52.049 42.821 75.553 65.519 67.886 106.766 208.802 nals of Statistics 1 (2), 209–230. https://doi.org/10.1214/aos/1176342360.
Var(Y) 61.707 54.118 56.316 45.047 64.267 91.114 217.951 Fortini, S., Petrone, S., 2012. Hierarchical reinforced urn processes. Statistics & Prob-
Corr(X,Y) 0.820 0.776 – 0.475 0.414 0.611 0.810 ability Letters 82 (8), 1521–1529. https://doi.org/10.1016/j.spl.2012.04.012.
Frees, E.W., Carriere, J., Valdez, E., 1996. Annuity valuation with dependent mortal-
ity. The Journal of Risk and Insurance 63 (2), 229–261. https://doi.org/10.2307/
253744.
are significantly different, especially for high strengths of belief–
Good, P., 1994. Permutation, Parametric and Bootstrap Tests of Hypotheses. Springer,
see Figs. 15c and 15d–where the B-RUP and KM marginals have New York.
very little in common. Trivially, as one grows older the probability Gribkova, S., Lopez, O., 2015. Non-parametric copula estimation under bivariate cen-
of demise increases, and therefore imposing a uniform distribu- soring. Scandinavian Journal of Statistics 42 (4), 925–946. https://doi.org/10.
tion for the lifetime of an individual is highly unrealistic, so this 1111/sjos.12144.
Hull, J.C., 2015. Risk Management and Financial Institutions, 4th edition. Wiley, New
mismatch was actually to be expected. In this sense, the B-RUP
York. ISBN 9781119448112.
inherits the pros and the cons of the prior setting used, for sure Jagger, C., Sutton, C.J., 1991. Death after marital bereavement—is the risk increased?
under high strengths of belief. Naturally and conversely, for low Statistics in Medicine 10 (3), 395–404. https://doi.org/10.1002/sim.4780100311.
strengths of belief, the B-RUP is able to capture both marginals, Kaplan, E.L., Meier, P., 1958. Nonparametric estimation from incomplete observa-
tions. Journal of the American Statistical Association 53 (282), 457–481. https://
except for the areas with very few observations, so that the model
doi.org/10.2307/2281868.
is able to override a wrong prior given a sufficient amount of ob- Klein, J.P., Moeschberger, M.L., 2003. Survival Analysis Techniques for Censored and
servations. Truncated Data, second edition.
In Table 9 we show the first two moments and the correla- Knight, W., 2017. The dark secret at the heart of AI. Technology Review 120, 54–63.
tion, similar to what we did in Table 5 for the Poisson prior. As Lin, D.Y., 1993. A simple nonparametric estimator of the bivariate survival function
under univariate censoring. Biometrika 80, 573–581. https://doi.org/10.2307/
before, the subscript l (h) denotes low (high) strength of belief.
2337178.
Notice that, for low strengths of belief, the difference in the B-RUP Lopez, O., 2012. A generalization of Kaplan-Meier estimator for analyzing bivariate
results are mainly due to the choice of the initial sample for A in mortality under right-censoring and left-truncation with applications to model-
the Gibbs sampler, which in our case is generated from its prior checking for survival copula models. Insurance. Mathematics & Economics 51
(3), 505–516. https://doi.org/10.1016/j.insmatheco.2012.07.009.
distribution. Consistently with what we have already said, the uni-
Luciano, E., Spreeuw, J., Vigna, E., 2008. Modelling stochastic mortality for depen-
form scenario with a high strength of belief generates the most dent lives. Insurance. Mathematics & Economics 43, 234–244. https://doi.org/10.
radical results, including a correlation close to the one (0.8) we 1016/j.insmatheco.2008.06.005.
can estimate from the uncensored observations. All the other B- Mahmoud, H., 2008. Polya Urn Models. Chapman & Hall/CRC, Boca Raton.
RUPs tend to suggest a less extreme correlation, in the vicinity of ISBN 9781420059830.
Mikosch, T., 2006. Copulas: tales and facts. Extremes 9, 3–20. https://doi.org/10.
0.5, in line with Frees et al. (1996) and Luciano et al. (2008). In
1007/s10687-006-0015-x.
any case, a correlation of 0.5 should definitely not be ignored by Mitchell, O., Poterba, J.M., Warshawsky, M., Brown, J., 1999. New evidence on the
assuming independence, as per the KM approach. money’s worth of individual annuities. The American Economic Review 89 (5),
1299–1318. http://www.jstor.org/stable/117059.
Muliere, P., Secchi, P., Walker, S.G., 2000. Urn schemes and reinforced random walks.
References
Stochastic Processes and Their Applications 88 (1), 59–78. https://doi.org/10.
1016/S0304-4149(99)00119-2. ISSN 03044149.
Amerio, E., Muliere, P., Secchi, P., 2004. Reinforced urn processes for modeling credit Muliere, P., Secchi, P., Walker, S.G., 2003. Reinforced random processes in continuous
default distributions. International Journal of Theoretical and Applied Finance 07 time. Stochastic Processes and Their Applications 104 (1), 117–130. https://doi.
(04), 407–423. https://doi.org/10.1142/S0219024904002505. org/10.1016/S0304-4149(02)00234-X.
Blackwell, D., MacQueen, J.B., 1973. Ferguson distributions via Pólya urn schemes. Murphy, K.P., 2012. Machine Learning: A Probabilistic Perspective. The MIT Press,
The Annals of Statistics 1 (2), 353–355. https://doi.org/10.1214/aos/1176342372. Cambridge MA. ISBN 0262018020. 9780262018029.
Brown, J.R., Poterba, J.M., 2000. Joint life annuities and annuity demand by married Nelsen, R.B., 2006. An Introduction to Copulas. Springer, New York. http://www.
couples. The Journal of Risk and Insurance 74 (4), 527–553. https://doi.org/10. worldcat.org/search?qt=worldcat_org_all&q=0387286594.
2307/253849. Peluso, S., Mira, A., Muliere, P., 2015. Reinforced urn processes for credit risk models.
Bulla, P., 2005. Application of Reinforced Urn Processes to Survival Analysis. PhD Journal of Econometrics 184 (1), 1–12. https://doi.org/10.1016/j.jeconom.2014.
thesis. Bocconi University. 08.003.
Bulla, P., Muliere, P., Walker, S., 2007. Bayesian nonparametric estimation of a bi- Pruitt, R.C., 1991. On negative mass assigned by the bivariate Kaplan-Meier
variate survival function. Statistica Sinica 17 (2), 427–444. http://www.jstor.org/ estimator. The Annals of Statistics 19, 443–453. https://doi.org/10.1214/aos/
stable/24307726. 1176347992.
Carriere, J.F., 2000. Bivariate survival models for coupled lives. Scandinavian Actuar- Sanders, L., Melenberd, B., 2016. Estimating the joint survival probabilities of mar-
ial Journal 1, 17–32. https://doi.org/10.1080/034612300750066700. ried individuals. Insurance. Mathematics & Economics 67, 88–106. https://doi.
Cheng, D., Cirillo, P., 2018. A reinforced urn process modeling of recovery rates and org/10.1016/j.insmatheco.2015.12.006.
recovery times. Journal of Banking & Finance (ISSN 0378-4266) 96, 1–17. http:// Shackle, G.L.S., 1955. Uncertainty in Economics and Other Reflections. Cambridge
www.sciencedirect.com/science/article/pii/S037842661830178X. University Press, Cambridge. ISBN 9780521153317.
Cheng, D., Cirillo, P., 2019. A urn-based nonparametric modeling of the dependence Shen, P.-S., Yan, Y.-F., 2008. Nonparametric estimation of the bivariate survival func-
between PD and LGD with an application to mortgages. https://doi.org/10.2139/ tion with left-truncated and right-censored data. Journal of Statistical Planning
ssrn.3360531. and Inference 138 (12), 4041–4054. https://doi.org/10.1016/j.jspi.2008.02.007.
Cirillo, P., Hüsler, J., Muliere, P., 2013. Alarm systems and catastrophes from a di- Sheshinski, E., 2007. The Economic Theory of Annuities. Economics Books, vol. 8536.
verse point of view. Methodology and Computing in Applied Probability 15 (4), Princeton University Press. ISBN 9780691133058.
821–839. Sylverman, B.W., 1986. Density Estimation for Statistics and Data Analysis. Chapman
Cox, D.R., Oakes, D., 1984. Analysis of Survival Data. Chapman & Hall. & Hall/CRC. ISBN 9780412246203.
ISBN 9780412244902. Taleb, N.N., 2007. The Black Swan: The Impact of the Highly Improbable. Random
Dabrowska, D.M., 1988. Kaplan-Meier estimate on the plane. The Annals of Statis- House. ISBN 978-0812973815.
tics 16 (4), 1475–1489. https://doi.org/10.1214/aos/1176351049. Walker, S., Muliere, P., 1997. Beta-Stacy processes and a generalization of the Pólya-
de Finetti, B., 2017. Theory of Probability: A Critical Introductory Treatment. Wiley urn scheme. The Annals of Statistics 25 (4), 1762–1780. https://doi.org/10.1214/
Series in Probability and Statistics. Wiley. ISBN 9781119286370. aos/1031594741.
188
L.A. Souto Arias and P. Cirillo Insurance: Mathematics and Economics 99 (2021) 174–189
Wang, W., Wells, M.T., 1997. Nonparametric estimators of the bivariate survival func- STATISTICAL-ASPECTS-OF-JOINT-LIFE-INSURANCE-PRICING-Youn-Shemyakin/
tion under simplified censoring conditions. Biometrika 84, 863–880. https:// a77552c3400d4d49cd97777bea63524f54299b64.
doi.org/10.1093/biomet/84.4.863. Youn, H., Shemaykin, A., 2001. Pricing practices for joint last survivor
Winklevoss, H., 1977. Pension Mathematics with Numerical Illustrations. insurance. In: Actuarial Research Clearing House, 2001.1. https://
Pension Research Council, Wharton School, University of Pennsylvania. www.soa.org/globalassets/assets/library/research/actuarial-research-clearing-
ISBN 9780812231960. house/2000-09/2001/arch-1/arch01v19.pdf.
Youn, H., Shemaykin, A., 1999. Statistical aspects of joint life insurance pricing. Youn, H., Shemyakin, A., Herman, E., 2002. A re-examination of the joint mortality
In: 1999 Proceedings of the Business and Statistics Section of the Ameri- functions. North American Actuarial Journal 6 (1), 166–170. https://doi.org/10.
can Statistical Association, pp. 34–38. https://www.semanticscholar.org/paper/ 1080/10920277.2002.10596035.
189