Introduction: Models for aggregate losses #

A portfolio of contracts or a contract will potentially experience a sequence of losses: $Y_{1}, Y_{2}, Y_{3}, \dots$ We are interested in the aggregate sum $S$ of these losses over a certain period of time.

How many losses will occur?
- if deterministic $(n)$ $⟶$ individual risk model
- if random $(N)$ $⟶$ collective risk model
How do they relate to each other?
- usual assumption: iid
When do these losses occur?
- usual assumption: no time value of money
  $⟶$ short term models
How big are these losses?

The Individual Risk Model #

Definition #

The Individual Risk Model #

In the Individual Risk Model $S = Y_{1} + \dots + Y_{n} = \sum_{i = 1}^{n} Y_{i},$ where $Y_{i}$ , $i = 1, 2, . . ., n$ , are iid claims. There are several methods to get probabilities about $S$ :

get the whole distribution of $S$ (if possible)
- Convolutions
- Generating functions
( $✠$ ) approximate with the help of the moments of $S$ (Module 4)

Convolutions of random variables #

In probability, the operation of determining the distribution of the sum of two random variables is called a convolution. It is denoted by $F_{X + Y} = F_{X} * F_{Y} .$ The result can then be convolved with the distribution of another random variable. For instance, $F_{X + Y + Z} = F_{Z} * F_{X + Y} .$ This can be done for both discrete and continuous random variables. It is also possible for mixed rv’s, but it is more complicated.

Formulas #

In short

Discrete case:
- df: $F_{X + Y} (s) = \sum_{x} F_{Y} (s - x) f_{X} (x)$
- pmf: $f_{X + Y} (s) = \sum_{x} f_{Y} (s - x) f_{X} (x)$
Continuous case:
- cdf: $F_{X + Y} (s) = \int_{- \infty}^{s} F_{Y} (s - x) f_{X} (x) d x$
- pdf: $f_{X + Y} (s) = \int_{- \infty}^{s} f_{Y} (s - x) f_{X} (x) d x$

Examples:

discrete case: Bowers et al. (1997) Example 2.3.1 on page 35
continuous case: Bowers et al. (1997) Example 2.3.2 on page 36

Numerical example #

Consider 3 discrete r.v.’s with probability mass functions

$\begin{array}{rcll} f_{1} (y) & = & \frac{1}{4}, \frac{1}{2}, \frac{1}{4} & for y = 0, 1, 2 \\ f_{2} (y) & = & \frac{1}{2}, \frac{1}{2} & for y = 0, 2 \\ f_{3} (y) & = & \frac{1}{4}, \frac{1}{2}, \frac{1}{4} & for y = 0, 2, 4 \end{array}$

Calculate the pmf $f_{1 + 2 + 3}$ and the df $F_{1 + 2 + 3}$ of the sum of the three random variables.

Solution #

$y$	$f_{1} (y)$	$f_{2} (y)$	$f_{1 + 2} (y)$	$f_{3} (y)$	$f_{1 + 2 + 3} (y)$	$F_{1 + 2 + 3} (y)$
$0$	$1 / 4$	$1 / 2$	$1 / 8$	$1 / 4$	$1 / 32$	$1 / 32$
$1$	$1 / 2$	$0$	$2 / 8$	$0$	$2 / 32$	$3 / 32$
$2$	$1 / 4$	$1 / 2$	$2 / 8$	$1 / 2$	$4 / 32$	$7 / 32$
$3$	$0$	$0$	$2 / 8$	$0$	$6 / 32$	$13 / 32$
$4$	$0$	$0$	$1 / 8$	$1 / 4$	$6 / 32$	$19 / 32$
$5$	$0$	$0$	$0$	$0$	$6 / 32$	$25 / 32$
$6$	$0$	$0$	$0$	$0$	$4 / 32$	$29 / 32$
$7$	$0$	$0$	$0$	$0$	$2 / 32$	$31 / 32$
$8$	$0$	$0$	$0$	$0$	$1 / 32$	$32 / 32$

$\begin{array}{rcl} f_{1 + 2} (2) & = & 1 / 4 \cdot 1 / 2 + 1 / 2 \cdot 0 + 1 / 4 \cdot 1 / 2 \\ f_{1 + 2 + 3} (4) & = & 1 / 8 \cdot 1 / 4 + 2 / 8 \cdot 0 + 2 / 8 \cdot 1 / 2 + 2 / 8 \cdot 0 + 1 / 8 \cdot 1 / 4 \end{array}$

Using generating functions #

There is a 1-1 relation between a distribution and its mgf or pgf.

Because $M_{S} (t) = E [e^{t S}] = E [e^{t (Y_{1} + \dots + Y_{n})}] = E [e^{t Y_{1}} \dots e^{t Y_{n}}]$ and if losses are independent then we have $M_{S} (t) = E [e^{t S}] = E [e^{t Y_{1}}] \dots E [e^{t Y_{n}}] = M_{Y_{1}} (t) \dots M_{Y_{n}} (t) .$ The same argument holds for the pgf’s.

Sometimes, $M_{S} (t)$ or $p_{S} (t)$ can be recognised: this is the case for infinitely divisible distributions (Normal, Poisson, Inverse Gaussian, ) and certain other distributions (Binomial, Negative binomial).
Otherwise, $M_{S} (t)$ or $p_{S} (t)$ can be expanded numerically to get moments and/or probabilities.

Example #

Consider a portfolio of 10 contracts. The losses $Y_{i}$ ’s for these contracts are iid rv’s with mean 100 and variance 100. Determine the distribution, the expected value and the variance of $S$ if these losses are

Normal;
Gamma;
Poisson.

Using R #

Contrary to Excel, convolutions are extremely easy to implement in R using vectors.

f1 <- c(1/4, 1/2, 1/4, 0, 0)
f2 <- c(1/2, 0, 1/2, 0, 0)
f12 <- c(f1[1] * f2[1], sum(f1[1:2] * f2[2:1]), sum(f1[1:3] *
  f2[3:1]), sum(f1[1:4] * f2[4:1]), sum(f1[1:5] * f2[5:1]))
f12

## [1] 0.125 0.250 0.250 0.250 0.125

The example above is generalised in Exercise los9R.
A more advanced R function is convolve. It actually involves the Fast Fourier Transform (a method that is related to that of the mgf’s) for efficiency. We do not discuss this here, but it is used in the implementation of convolutions in the function aggregateDist of the package actuar (introduced later).

The Collective Risk Model (Compound distributions, MW 2.1) #

Definition #

Introduction #

Two models, depending on the assumption on the number of losses:

deterministic - $n$
- main focus on the claims of individual policies (whose number is a priori known)
- $⟶$ Individual Risk Model
- discussed in previous sections
random - $N$
- main focus on claims of a whole portfolio (whose number is a priori unknown)
- $⟶$ Collective Risk Model
- this is another way of separating frequency and severity

In this section we focus on the Collective Risk Model.

Definition #

In the Collective Risk Model, aggregate losses become $S = Y_{1} + \dots + Y_{N} = \sum_{i = 1}^{N} Y_{i} .$ This is a random sum. We make the following assumptions:

$N$ is the number of claims
$Y_{i}$ is the amount of the $i$ th claim
the $Y_{i}$ ’s are iid with
- (c)df $G (y)$
- p(d/m)f $g (y)$
the $Y_{i}$ ’s and $N$ are mutually independent

Moments of $S$ #

We have $E [S] = E [E [S | N]] = E [N E [Y]] = E [N] E [Y],$ and

$\begin{array}{rcl} V a r (S) & = & E [V a r (S | N)] + V a r (E [S | N]) \\ = & E [N V a r (Y)] + V a r (E [Y] N) \\ = & E [N] V a r (Y) + E [Y]^{2} V a r (N) \\ = & E [N] (E [Y^{2}] - E [Y]^{2}) + E [Y]^{2} V a r (N) \\ = & E [N] E [Y^{2}] + E [Y]^{2} (V a r (N) - E [N]) . \end{array}$

Moment generating function of $S$ #

It is possible to get $M_{S} (t)$ as a function of $M_{Y} (t)$ and $M_{N} (t)$ :

$\begin{array}{rcl} M_{S} (t) & = & E [e^{t S}] = E [E [e^{t (Y_{1} + Y_{2} + \dots + Y_{N})} | N]] \\ = & E [M_{Y} (t)^{N}] = E [e^{N \ln M_{Y} (t)}] \\ = & M_{N} (\ln M_{Y} (t)) \end{array}$

Example (Bowers et al. (1997), 12.2.1) #

Assume that $N$ is geometric with probability of success $p$ : $Pr [N = n] = p q^{n}, n = 0, 1, \dots,$ where $0 < q < 1$ and $p = 1 - q$ . We have then $M_{N} (t) = E [e^{t N}] = \sum_{n = 0}^{\infty} p q^{n} e^{t n} = \frac{p}{1 - q e^{t}},$ and thus $M_{S} (t) = M_{N} (\ln M_{Y} (t)) = \frac{p}{1 - q e^{\ln M_{Y} (t)}} = \frac{p}{1 - q M_{Y} (t)} .$

Distribution of $S$ #

It is possible to get a fairly general expression for the df of $S$ by conditioning on the number of claims:

$F_{S} (x) = \sum_{n = 0}^{\infty} Pr [S \leq x | N = n] Pr [N = n] = \sum_{n = 0}^{\infty} G^{* n} (x) Pr [N = n], (1)$

where $G^{* n} (y)$ is the $n$ -th convolution of $G$ .

Note that

$N$ will always be discrete, so this works for any type of rv $Y$ . (continuous, discrete or mixed)
However, the type of $S$ will depend on the type of $Y$ .

Distribution of $S$ if $X$ is continuous #

If $X$ is continuous, $S$ will generally be mixed:

with a mass at 0 because of $Pr [N = 0]$ (if positive)
continuous elsewhere, but with a density integrating to $1 - Pr [N = 0]$

Example, continued (Bowers et al. (1997), 12.2.3) #

Assume now that $G (y) = 1 - e^{- y} and hence M_{Y} (t) = \frac{1}{1 - t} for t < 1.$ Now, we have that (remember $Pr [N = 0] = p$ ) $M_{S} (t) = \frac{p}{1 - q M_{Y} (t)} .$ It follows that $M_{S} (t) = \frac{p}{1 - q \frac{1}{1 - t}} = p + q \frac{p}{p - t} = p E [e^{t \cdot 0}] + (1 - p) E [e^{t Z}],$ where $Z$ is an exponential rv with parameter $p$ . Therefore, $f_{S} (s) = {\begin{cases} p = Pr [N = 0] (probability mass) & s = 0; \\ (1 - p) (p e^{- p s}) (probability density) & s > 0. \end{cases}$

Distribution of $S$ if $Y$ is mixed #

If $Y$ is mixed, $S$ will generally be mixed:

with a mass at 0 because of $Pr [N = 0]$ and $Pr [Y = 0]$ (if positive)
mixed (if $Y$ is not continuous for $x > 0$ ) or continuous elsewhere
with a density integrating to something $\leq 1 - Pr [N = 0]$

Distribution of $S$ if $Y$ is discrete #

For discrete $Y$ ’s we can get a similar expression to for the pmf of $S$ :

$f_{S} (s) = \sum_{n = 0}^{\infty} Pr [S = s | N = n] Pr [N = n] = \sum_{n = 0}^{\infty} g^{* n} (s) Pr [N = n], (2)$

where $g^{* 0} (0) = 1$ (and thus 0 anywhere else).

This can be implemented in a table and/or in a program.
However, if the range of $N$ goes really to infinity, calculating $f_{S} (s)$ may require an infinity of convolutions of $Y$ .
This formula is more efficient if the number of possible outcomes for $N$ is small.
$✠$ The pmf $g^{* n} (s)$ can be calculated using de Pril’s algorithm.
(see Module 4)

Example with tabular approach #

From Bowers et al. (1997), 12.2.2:

The convolutions are in done the usual way.
The number of columns depends on the range of $N$ .
The $f_{S} (x)$ are the sumproduct of the row $x$ and row $Pr [N = n]$ :

$f_{S} (3) = 0 \cdot 0.1 + 0.1 \cdot 0.3 + 0.4 \cdot 0.4 + 0.125 \cdot 0.2 .$

Using R #

We will make extensive use of the function aggregateDist from the package actuar (Dutang, Goulet, and Pigeon 2008):

This function allows for several different aggregate distribution approaches, which will be introduced here (and in Module 4 as the associated theory is presented).
Here, we show how the function can be used to implement formulas (1) and (2) (using the function convolve in the background). This corresponds to the method="convolution" approach.

actuar::aggregateDist(method="convolution"):

A discrete distribution for $Y$ is required. Note that discretisation methods are discussed in Module 4. This is input as a vector of claim amount probability masses after the argument model.sev=. The first element must be $Pr [Y = 0]$ .
There is no restriction on the shape of the frequency distribution, but it must have a finite range. This is input as a vector of claim number probability masses after the argument model.freq=. The first element must be $Pr [N = 0]$ .
The outcome of the function is (1). Additional outputs:
- plot: to get a pretty plot of the df
- summary: to get summary statistics
- mean: to get the mean
- diff: to get the pmf
Additional options are:
- x.scale: currency units per unit of sev in the severity model (this allows calculations on multiples of $1)

# Bowers 12.2.2
fy <- c(0, 0.5, 0.4, 0.1)
fn <- c(0.1, 0.3, 0.4, 0.2)
Fs <- aggregateDist("convolution", model.freq = fn, model.sev = fy)
mean(Fs)
## [1] 2.72
pmf <- c(Fs(0), diff(Fs(0:9)))
cbind(s = c(0:9), fs = pmf, Fs = Fs(0:9))
##       s     fs     Fs
##  [1,] 0 0.1000 0.1000
##  [2,] 1 0.1500 0.2500
##  [3,] 2 0.2200 0.4700
##  [4,] 3 0.2150 0.6850
##  [5,] 4 0.1640 0.8490
##  [6,] 5 0.0950 0.9440
##  [7,] 6 0.0408 0.9848
##  [8,] 7 0.0126 0.9974
##  [9,] 8 0.0024 0.9998
## [10,] 9 0.0002 1.0000

summary(Fs)
## Aggregate Claim Amount Empirical CDF:
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##    0.00    2.00    3.00    2.72    4.00    9.00
plot(Fs)

Explicit claims count distributions (MW 2.2) #

Introduction #

Exposure #

It makes no sense to talk about frequency in an insurance portfolio without considering exposure. Chapter 4 of Werner and Modlin (2010) defines exposure as “the basic unit that measures a policy’s exposure to loss”.
One primary criterion for choosing an exposure base is that it “should be directly proportional to expected loss”. Here we are focussing on frequency, so exposure should be something directly proportional to the expected frequency.
Wuthrich (2023) calls exposure “volume”, denoted $v$ , and defines the claims frequency as $\frac{N}{v} .$

Basic models for claims frequency #

In our case, we will assume that it directly affects the likelihood of a claim to occur - the frequency - such that $N / v$ is normalised
MW defines $p_{k} = Pr [N = k], for k \in A \subset N_{0},$ where $A$ us the set of possible frequency outcomes.
There are three main assumptions for $p_{k}$ :
- binomial (with variance less than mean)
- Poisson (with variance equal to the mean)
- negative-binomial (a Poisson with random mean, so that variance is more than the mean)
A summary table of those distributions is also given in Bowers et al. (1997), see Table 12.3.1 on page 376.
These all belong to a class of distributions called $(a, b)$

Binomial distribution #

fixed volume $v \in N$
fixed default probability $p \in (0, 1)$ (expected claims frequency)
pmf of $N \sim Binom (v, p)$ is $p_{k} = Pr [N = k] = (\binom{v}{k}) p^{k} (1 - p)^{v - k}, for all k \in {0, \dots, v} = A .$
same as a sum of Bernoulli (which is the case $v = 1$ )
makes sense for homogenous portfolio with unique possible events, such as credit defaults, or deaths in a life insurance model
In R: dbinom, pbinom, qbinom, rbinom, where size is $v$ , and where prob is $p$
Note that $(\binom{v}{k})$ can be computed with the R function choose.

Compound binomial model #

The total claim amount $S$ has a compound binomial distribution $S \sim CompBinom (v, p, G)$ if $S$ has a compound distribution with $N \sim Binom (v, p)$ for given $v \in N$ and $p \in (0, 1)$ and individual claim size distribution $G$ .

Corollary 2.7: Assume $S_{1}, \dots, S_{n}$ are independent with $S_{j} \sim CompBinom (v_{j}, p, G)$ for all $j = 1, \dots, n$ . The aggregated claim has a compound binomial distribution with $S = \sum_{j = 1}^{n} S_{j} \sim CompBinom (\sum_{j = 1}^{n} v_{j}, p, G) .$

Exercise NLI3 considers the decomposition of $S$ into small and large claims. It shows that $S_{lc}$ —the sum of those claims exceeding a certain threshold $M$ only (see notation in Example 2.16 later in those slides)—is compound binomial again.

Poisson distribution #

fixed volume $v > 0$
expected claims frequency $λ > 0$
pmf of $N \sim Poi (λ v)$ is $p_{k} = Pr [N = k] = e^{- λ v} \frac{(λ v)^{k}}{k!} for all k \in A = N_{0} .$
Lemma 2.9: increase volume while keeping $E [N]$ fixed in a binomial model leads to a Poisson distribution (more so for small $p$ compared to $v$ ).
In R: dpois, ppois, qpois, rpois, where lambda is $λ v$

Compound Poisson model #

The total claim amount $S$ has a compound Poisson distribution $S \sim CompPoi (λ v, G)$ if $S$ has a compound distribution with $N \sim Poi (λ v)$ for given $λ, v > 0$ and individual claim size distribution $G$ .

The compound Poisson distribution has nice properties such as:
- The aggregation property $↑$
- The disjoint decomposition property $↓$
These are reviewed in the next section, along with related new techniques for computing the distribution of $S$ .

Mixed Poisson distribution #

Inhomogeneous portfolio #

So far we have seen distributions with variance less (binomial) or exactly equal (Poisson) to the mean.
In reality, actuarial data is often overdispersed, that is, variance is larger than mean.
This could be due to frequency or severity, but it makes sense that some of this extra variability would come from frequency.
If we believe in a Poisson frequency for known frequency parameter, then additional uncertainty such as heterogeneity of risks in a portfolio, uncertain conditions (weather, for instance) could be modelled with a random Poisson parameter, and could explain the extra variability.
This is the idea of a mixed Poisson.

The mixed Poisson distribution #

Assume random $Λ \sim H$ with $H (0) = 0$ , $E [Λ] = λ$ , and $V a r (Λ) > 0$ .
Conditionally, given $Λ$ , $N \sim Poi (Λ v)$ for fixed volume $v > 0$ .

We have then

$\begin{array}{rcl} Pr [N = n] & = & \int_{0}^{\infty} Pr [N = n | Λ = λ] d H (λ) = \int_{0}^{\infty} \frac{e^{- λ v} (λ v)^{n}}{n!} d H (λ); \\ E [N] & = & E [E [N | Λ]] = E [Λ] v = λ v; \\ V a r (N) & = & E [V a r (N | Λ)] + V a r (E [N | Λ]) = λ v + v^{2} V a r (Λ) > λ v; \\ M_{N} (t) & = & E [e^{t N}] = E [E [e^{t N} | Λ]] = E [e^{Λ v (e^{t} - 1)}] = M_{Λ} (v [e^{t} - 1]) . \end{array}$

Example #

If $Λ \sim inverse Gaussian (α, β)$ (Example 12.3.2):

$N$ is Poisson Inverse Gaussian.
This distribution is the pig distribution in actuar, so that you can use dpig, ppig, etc…); see Section 5 of the vignette “distribution” of actuar.
$⟶$ $S$ will be compound inverse Gaussian.

Another example, which is very famous, is $Λ \sim Γ$ , which leads to the negative-binomial distribution.

Negative-binomial distribution #

Assume $λ$ is the mean, and will be “spread” according to a gamma distribution:

Define $Λ = λ Θ$ .
Now, $Θ \sim Γ (γ, γ)$ such that $E [Θ] = 1 and V a r (Θ) = \frac{1}{γ}$ and $E [Λ] = λ and V a r (Λ) = \frac{λ^{2}}{γ} .$
If conditionally, given $Θ$ , $N \sim Poi (Θ λ v)$ , then $N \sim NegBin (λ v, γ)$ with volume $v > 0$ , expected claims frequency $λ > 0$ ,
and dispersion parameter $γ > 0$ .

Proof:

$\begin{array}{rcl} M_{N} (t) & = & E [e^{t N}] = E [E [e^{t N} | Θ]] = E [e^{Θ λ v (e^{t} - 1)}] \\ = & {(\frac{γ}{γ - λ v (e^{t} - 1)})}^{γ} = {(\frac{γ}{γ + λ v - λ v e^{t}})}^{γ} = {(\frac{\frac{γ}{λ v + γ}}{1 - \frac{λ v}{λ v + γ} e^{t}})}^{γ}, \end{array}$

which can be recognised as a negative-binomial with probability of “failure” $p = \frac{λ v}{λ v + γ}$ (if we count failures until the $γ$ -th success) so that $p_{k} = Pr [N = k] = (\binom{k + γ - 1}{k}) p^{k} (1 - p)^{γ}$ In R, use dnbinom, pnbinom, qnbinom, rnbinom, where size is $γ$ and prob is probability of success $1 - p$ (note volume is hidden in $p$
and will affect the scale of the distribution).

Interpretation #

$Θ$ reflects the uncertainty about the `true’ parameter of the Poisson distribution.
Alternatively, it describes the distributions of “$\lambda$’s” in the population.
In the end we have

$\begin{array}{rcl} E [N] & = & λ v, \\ V a r (N) & = & λ v (1 + \frac{λ v}{γ}) > λ v, \\ Vco (\frac{N}{v}) & = & \sqrt{(λ v)^{- 1} + γ^{- 1}} . \end{array}$

This additional uncertainty is not diversifiable
(remains even for large $v$ ): $Vco (\frac{N}{v}) = \sqrt{(λ v)^{- 1} + γ^{- 1}} \to γ^{- 1 / 2} > 0 for v \to \infty .$

Compound negative-binomial model #

The total claim amount $S$ has a compound negative-binomial distribution $S \sim CompNB (λ v, γ, G)$ if $S$ has a compound distribution with $N \sim NegBin (λ v, γ)$ for given $λ, v, γ > 0$ and individual claim size distribution $G$ .

Additional properties and applications of Poisson frequencies #

Theorem 2.12: Aggregation property $↑$ #

Assume $S_{1}, \dots, S_{n}$ are independent with $S_{j} \sim CompPoi (λ_{j} v_{j}, G_{j})$ for all $j = 1, \dots, n$ . Aggregated claims have a compound Poisson distribution $S = \sum_{j = 1}^{n} S_{j} \sim CompPoi (λ v, G), with$ $v = \sum_{j = 1}^{n} v_{j}, λ = \sum_{j = 1}^{n} \frac{v_{j}}{v} λ_{j}, G = \sum_{j = 1}^{n} \frac{λ_{j} v_{j}}{λ v} G_{j} .$ So what?

Independent $n$ portfolios of losses can be easily aggregated.
Alternatively (or in addition), total claims paid over $n$ years are compound Poisson, even if the severity and frequency of losses vary across years.
“Bottom-up” modelling
In Bowers et al. (1997), this is Theorem 12.4.1.

Example 12.4.1 of Bowers et al. (1997) #

Suppose that $N_{1}, N_{2}, \dots, N_{m}$ are independent random variables. Further, suppose that $N_{i}$ follows Poisson($\lambda_i$). Let $y_{1}, y_{2}, \dots, y_{m}$ be deterministic numbers. What is the distribution of $y_{1} N_{1} + \dots + y_{m} N_{m} ?$

Theorem 2.14: Disjoint decomposition property $↓$ #

Preliminary 1: Add LoBs in the CompPoi formulation #

Let us introduce Lines of Business (“LoB”) in the notation:

Let the set ${1, \dots, m}$ be a partition of the portfolio, or different lines of business (“LoB” thereafter). For instance, we could have $j \in {1, 2, 3}$ for car $(j = 1)$ , building $(j = 2)$ and liability $(j = 3)$ LoBs.
Let ${(p_{j}^{+})}_{j = 1, \dots, m}$ be a discrete probability distribution on the finite set of sub-portfolios/LoBs ${1, \dots, m}$ (thereafter just “LoB”).
We assume $p_{j}^{+} > 0 for all j,$ that is, the probability of having claims in any of the $m$ LoBs is strictly positive.
We further assume that $G_{j}$ is the claim size distribution of LoB $j$ , with $G_{j} (0) = 0$ .

Finally, we define the mixture distribution by $G (y) = \sum_{j = 1}^{m} p_{j}^{+} G_{j} (y) for y \in R .$ This is the distribution of a claim, if we don’t know which LoB it comes from.
Note that this matches the formulation in the aggregation property Theorem 2.12 with $p_{j}^{+} = \frac{λ_{j} v_{j}}{λ v} .$
Now, define a discrete random variable $I$ which indicates which sub-portfolio/LoB a randomly selected claim $Y$ belongs to: $Pr [I = j] = p_{j}^{+} for all j \in {1, \dots, m} .$

We are now ready to define the following extended compound Poisson model:

The total claims $S = \sum_{i = 1}^{N} Y_{i}$ has a compound Poisson distribution as defined earlier.
In addition, we assume that $(Y_{i}, I_{i})_{i \geq 1}$ are
- mutually i.i.d. and independent of $N$ ,
- with $Y_{i}$ having marginal distribution function $G$ with $G (0) = 0$ , and
- $I_{i}$ having marginal distribution function given by $Pr [I = j] = p_{j}^{+} for all j \in {1, \dots, n}$ .

Preliminary 2: Partition #

The random vector $(Y_{1}, I_{1})$ takes values in $R_{+} \times {1, \dots, m}$ .
On this set we choose a finite sequence of sets $A_{1}, \dots, A_{n}$ such that

$\begin{array}{rcl} A_{k} \cap A_{l} & = & \emptyset for all k \neq l (no overlap); \\ \cup_{k = 1}^{n} A_{k} & = & R_{+} \times {1, \dots, m} (all-inclusive) . \end{array}$

Such a sequence is called a “measurable disjoint decomposition” or “partition” of $R_{+} \times {1, \dots, m}$ .
This partition is called “admissible” for $(Y_{1}, I_{1})$ if for all $k = 1, \dots, n$ $p^{(k)} = Pr [(Y_{1}, I_{1}) \in A_{k}] > 0.$ Note $\sum_{k = 1}^{n} p^{(k)} = 1$ due to the properties of the
partition above (no overlap and all-inclusive)

We have two levels of partition:

Into LoBs:
- Claims are classified according to a sub-portfolio or LoB
- For instance: domestic motor and commercial motor
- The probability of a claim being in LoB $j$ is $p_{j}^{+}$
- The indicator for the claim to be in LoB $j$ is $I_{j}$
  (with probability $p_{j}^{+}$ of being 1)
Into a second level:
- Claims are classified according to another set of criteria
- For instance: geographical areas NSW and VIC
- The probability of a claim being in geographical area $k$ is $p (k)$

Theorem 2.14: Disjoint decomposition $↓$ #

Assume $S$ is “doubly partitioned” as described above:

$S$ fulfills the extended compound Poisson model assumptions above (Preliminary 1).
We chose an admissible partition $A_{1}, \dots, A_{n}$ for $(Y_{1}, I_{1})$ (Preliminary 2).

Then the random variable (sum of claims for partition $k$ ): $S_{k} = \sum_{i = 1}^{N} Y_{i} 1_{{(Y_{i}, I_{i}) \in A_{k}}} \sim CompPoi (λ_{k} v_{k}, G_{k}),$ for $k = 1, \dots, n$ , with $λ_{k} v_{k} = λ v p^{(k)} > 0, G_{k} (y) = Pr [Y_{1} \leq y | (Y_{1}, I_{1}) \in A_{k}] .$ Furthermore, the $S_{k}$ ’s are independent (over $k$ ).

Thinning of the Poisson process #

Assume that $m = 1$ (only one LoB)
The disjoint decomposition theorem implies that $Y_{i} = Y_{i} 1_{Y_{i} \in A_{1}} + \dots + Y_{i} 1_{Y_{i} \in A_{n}} .$
For for each partition $A_{k}$ (defined on the claims) a natural choice is
- $v_{k} = v$
- $λ_{k} = λ p^{(k)}$
This means that the volume remains constant in each partition, but the expected claims frequencies $λ_{k}$ change proportionally to the probabilities of falling in partition $A_{k}$ , $k = 1, \dots, n$ .
This is called thinning of the Poisson process.

$✠$ Sparse vector algorithm #

If $S \sim compound Poisson (λ, g (y_{i}) = π_{i})$ , $i = 1, \dots, m$ then $S = y_{1} N_{1} + \dots + y_{m} N_{m},$ where the $N_{i}$ ’s

represent the number of claims of amount $y_{i}$ ;
are mutually independent;
are Poi $(λ_{i} = λ π_{i}) .$

Proof: see tutorial exercise los18. Note also that this is a special case of Theorem 2.14, and is Theorem 12.4.2 of Bowers et al. (1997).

So what?

Sparse vector algorithm: allows to develop an alternative method for tabulating the distribution of $S$ that is more efficient as $m$ is small.
$S$ can be used to approximate the Individual Risk Model if $X = I b$ (see Module 3).

$✠$ The sparse vector algorithm #

(Bowers et al. 1997, Example 12.4.2) Suppose $S$ has a compound Poisson distribution with $λ = 0.8$ and individual claim amount distribution

$y_i$		$Pr [Y = y_{i}]$
1		0.250
2		0.375
3		0.375

Compute $f_{S} (s) = Pr [S = s]$ for $s = 0, 1, . . ., 6$ .

This can be done in two ways:

Basic method (seen earlier in the lecture): requires to calculate up to the 6th convolution of $Y$ .
Sparse vector algorithm: requires no convolution of $Y$ .

Solution - Basic Method

$x$	$g^{* 0} (x)$	$g (x)$	$g^{* 2} (x)$	$g^{* 3} (x)$	$g^{* 4} (x)$	$g^{* 5} (x)$	$g^{* 6} (x)$	$f_{S} (x)$
0	1	-	-	-	-	-	-	0.4493
1	-	0.250	-	-	-	-	-	0.0899
2	-	0.375	0.0625	-	-	-	-	0.1438
3	-	0.375	0.1875	0.0156	-	-	-	0.1624
4	-	-	0.3281	0.0703	0.0039	-	-	0.0499
5	-	-	0.2813	0.1758	0.0234	0.0010	-	0.0474
6	-	-	0.1406	0.2637	0.0762	0.0073	0.0002	0.0309
$n$	0	1	2	3	4	5	6
$Pr [N = n] = e^{- 0.8} \frac{{(0.8)}^{n}}{n!}$	0.4493	0.3595	0.1438	0.0383	0.0077	0.0012	0.0002

The convolutions are done in the usual way.
The $f_{S} (x)$ are the sumproduct of the row $x$ and row $Pr [N = n]$ .
The number of convolutions (and thus of columns) will increase by 1 for each new value of $f_{S} (x)$ , without bound!

Solution - Sparse vector algorithm

Thanks to Theorem 2.12, we can write $S = N_{1} + 2 N_{2} + 3 N_{3}$

$x$	$Pr [N_{1} = x]$	$Pr [2 N_{2} = x]$	$Pr [3 N_{3} = x]$	$Pr [N_{1} + 2 N_{2} = x]$	$f_{S} (x)$
0	0.818731	0.740818	0.740818	0.606531	0.449329
1	0.163746	0	0	0.121306	0.089866
2	0.016375	0.222245	0	0.194090	0.143785
3	0.001092	0	0.222245	0.037201	0.162358
4	0.000055	0.033337	0	0.030974	0.049906
5	0.000002	0	0	0.005703	0.047360
6	0.000000	0.003334	0.033337	0.003288	0.030923
$x_{i}$	1	2	3
$λ_{i} = λ π_{i}$	0.2	0.3	0.3
$Pr [N_{i} = x / i]$	$e^{- 0.2} \frac{{(0.2)}^{x}}{x!}$	$e^{- 0.3} \frac{{(0.3)}^{x / 2}}{(x / 2)!}$	$e^{- 0.3} \frac{{(0.3)}^{x / 3}}{(x / 3)!}$

The $f_{S} (x)$ are convolution, e.g.: $(5) [3] = .818731 \cdot 0 + .163746 \cdot .222245 + .016375 \cdot 0 + .001092 \cdot .740818$ $(6) [3] = .740818 \cdot .037201 + 0 \cdot .194090 + 0 \cdot .121306 + .222245 \cdot .606531$

Note that only two convolutions are needed: columns (5) and (6).

Example 2.16: Large claim separation #

This is a very important (and convenient) application of the Disjoint decomposition property (Theorem 2.14).
Attritional and catastrophic claims often have very different distributions (different $G$ ’s); see also https://www.actuaries.digital/2022/01/10/catastrophe-vs-standard-loss-modelling/
The idea here is to divide the claims into different layers with different distributions:
- Small claims are modelled using a parametric distribution for which it is easy to obtain the distribution of the compound distribution, potentially even approximated with a normal distribution thanks to volume and light right tail;
- Large claims are typically modelled with a Pareto distribution with threshold $M$ and tail parameter $α > 1$ (see Module 6 for a justification of this, and for the choice of an appropriate $M$ ). The could also be “modelled” (see article above)

Assuming two layers:

We choose a large claims threshold $M > 0$ such that $0 < G (M) < 1,$ that is, there is probability mass on either size of $M$ .
We define the partition $A = A_{sc} = {Y_{1} \leq M} and A^{c} = A_{lc} = {Y_{1} > M} .$
Assume that $S \sim CompPoi (λ v, G) .$
We now define the small and large claims layers as $\begin{array}{rcl} S_{sc} & = & \sum_{i = 1}^{N} Y_{i} 1_{{Y_{i} \leq M}}, and \\ S_{lc} & = & \sum_{i = 1}^{N} Y_{i} 1_{{Y_{i} > M}}, \end{array}$ respectively.

Theorem 2.14 implies that $S_{sc}$ and $S_{lc}$ are and compound Poisson distributed with $\begin{array}{rcl} S_{sc} & \sim & CompPoi (λ_{sc} v = λ G (M) v, \\ G_{sc} (y) = Pr [Y_{1} \leq y | Y_{1} \leq M]), and \\ S_{lc} & \sim & CompPoi (λ_{lc} v = λ (1 - G (M)) v, \\ G_{lc} (y) = Pr [Y_{1} \leq y | Y_{1} > M]), \end{array}$ respectively.
The distribution of $S = S_{sc} + S_{lc}$ can then be obtained by a simple convolution of distributions of $S_{sc}$ and $S_{lc}$ (thanks to independence); see Module 4 for examples ().

$✠$ Parameter estimation (MW 2.3) #

$✠$ Introduction #

$✠$ Estimation methods #

You should be familiar with the main estimation methods:

Method of moments
Maximum likelihood estimation

Here the problem is slightly complicated because our observations may not be directly comparable due to varying exposures $v$ ’s.

Assume that $(N_{1}, \dots, N_{T})^{'}$ is the vector of observations.

$✠$ What to do with volumes? Lemma 2.26 #

The key idea here is to find the minimum variance method of moments estimator, when the volumes across the observations can vary.
This is what is different from a straight method of moments estimator, and explains why we need to think it through: how to deal with those volumes?
Assume there exist strictly positive volumes $v_{1}, \dots, v_{T}$ such that the components of $(N_{1} / v_{1}, \dots, N_{T} / v_{T})$ are independent with $λ = E [\frac{N_{t}}{v_{t}}] and τ_{t}^{2} = V a r (\frac{N_{t}}{v_{t}}) \in (0, \infty),$ for all $t = 1, \dots, T$ .

Lemma 2.26 states that the unbiased, linear estimator for $λ$ with minimal variance is given by ${\hat{λ}}_{T}^{MV} = {(\sum_{t = 1}^{T} \frac{1}{τ_{t}^{2}})}^{- 1} \sum_{t = 1}^{T} \frac{N_{t} / v_{t}}{τ_{t}^{2}},$ with variance $V a r ({\hat{λ}}_{T}^{MV}) = {(\sum_{t = 1}^{T} \frac{1}{τ_{t}^{2}})}^{- 1} .$

Note:

We haven’t made any distributional assumption yet - this estimates $E [\frac{N_{t}}{v_{t}}]$ via method of moments, taking the $v_{t}$ ’s into account in an optimal way (in the sense that it minimises the variance of the estimator).
The superscript “MV” stands for “minimal variance”.

$✠$ Method of moments #

$✠$ Binomial and Poisson cases #

Unbiased, minimal variance estimators:

binomial case for $p$ : ${\hat{p}}_{T}^{MV} = \frac{1}{\sum_{s = 1}^{T} v_{s}} \sum_{t = 1}^{T} N_{t} = \sum_{t = 1}^{T} \frac{v_{t}}{\sum_{s = 1}^{T} v_{s}} \frac{N_{t}}{v_{t}} \sim$ Furthermore, $\sum_{t = 1}^{T} N_{t} \sim Binom (\sum_{s = 1}^{T} v_{s}, p)$ , which means we know the distribution of ${\hat{p}}_{T}^{MV}$ .
Poisson case for $λ$ : ${\hat{λ}}_{T}^{MV} = \frac{1}{\sum_{s = 1}^{T} v_{s}} \sum_{t = 1}^{T} N_{t} = \sum_{t = 1}^{T} \frac{v_{t}}{\sum_{s = 1}^{T} v_{s}} \frac{N_{t}}{v_{t}}$ Here, $\sum_{t = 1}^{T} N_{t} \sim Poi (λ \sum_{s = 1}^{T} v_{s})$ .

$✠$ Negative binomial case #

More complicated, because: $E [\frac{N_{t}}{v_{t}}] = λ and V a r (\frac{N_{t}}{v_{t}}) = λ / v_{t} + λ^{2} / γ = τ_{t}^{2},$ Unbiased (but not guaranteed minimal variance): ${\hat{λ}}_{T}^{NB} = \frac{1}{\sum_{s = 1}^{T} v_{s}} \sum_{t = 1}^{T} N_{t} = \sum_{t = 1}^{T} \frac{v_{t}}{\sum_{s = 1}^{T} v_{s}} \frac{N_{t}}{v_{t}}$

$✠$ We need a sense of the dispersion for estimating the dispersion parameter $γ$ .

Let the weighted sample variance ${\hat{V}}_{T}^{2} = \frac{1}{T - 1} \sum_{t = 1}^{T} v_{t} {(\frac{N_{t}}{v_{t}} - {\hat{λ}}_{T}^{NB})}^{2} .$ Then we have ${\hat{γ}}_{T}^{NB} = \frac{({\hat{λ}}_{T}^{NB})^{2}}{{\hat{V}}_{T}^{2} - {\hat{λ}}_{T}^{NB}} \frac{1}{T - 1} (\sum_{t = 1}^{T} v_{t} - \frac{\sum_{t = 1}^{T} v_{t}^{2}}{\sum_{t = 1}^{T} v_{t}}),$ ONLY if ${\hat{V}}_{T}^{2} > {\hat{λ}}_{T}^{NB}$ . Otherwise use Poisson or binomial.

$✠$ Maximum likelihood estimators #

$✠$ Binomial and Poisson cases #

Estimators are identical to method of moments estimators. Or conversely, the MLE estimators are actually unbiased.

binomial case for $p$ : ${\hat{p}}_{T}^{MLE} = \frac{1}{\sum_{s = 1}^{T} v_{s}} \sum_{t = 1}^{T} N_{t} = \sum_{t = 1}^{T} \frac{v_{t}}{\sum_{s = 1}^{T} v_{s}} \frac{N_{t}}{v_{t}} = {\hat{p}}_{T}^{MV}$
Poisson case for $λ$ : ${\hat{λ}}_{T}^{MLE} = \frac{1}{\sum_{s = 1}^{T} v_{s}} \sum_{t = 1}^{T} N_{t} = \sum_{t = 1}^{T} \frac{v_{t}}{\sum_{s = 1}^{T} v_{s}} \frac{N_{t}}{v_{t}} = {\hat{λ}}_{T}^{MV}$

$✠$ Negative binomial case #

Assume $N_{1}, \dots, N_{T}$ are independent and $NegBin (λ v_{t}, γ)$ . The MLE $({\hat{λ}}_{T}^{MLE}, {\hat{γ}}_{T}^{MLE})$ are the solution of $\frac{\partial}{\partial (λ, γ)} \sum_{t = 1}^{T} \log (\binom{N_{t} + γ - 1}{N_{t}}) + γ \log (1 - p_{t}) + N_{t} \log p_{t} = 0,$ with $p_{t} = λ v_{t} / (γ + λ v_{t}) \in (0, 1)$ .

The $(a, b, 0)$ and $(a, b, 1)$ classes of distributions #

The $(a, b)$ class of Panjer distributions (4.2.1) #

A class of distributions has the following property

$Pr [N = n] = (a + \frac{b}{n}) Pr [N = n - 1], or \frac{p_{k}}{p_{k - 1}} = (a + \frac{b}{k}) .$

This is the $(a, b)$ class of “Panjer distributions”. This means that $Pr [N = n]$ can be obtained recursively with initial value $Pr [N = 0]$ ; see Wuthrich (2023), Definition 4.6.

The exhaustive list of its members (see Wuthrich 2023 Lemma 4.7) is

Distribution	$a$	$b$	$Pr [N = 0]$
Poisson $(λ)$	$0$	$λ$	$e^{- λ}$
Neg Bin $(γ, p)$	$p$	$(γ - 1) p$	$(1 - p)^{γ}$
Binomial $(m, p)$	$- p / (1 - p)$	$(m + 1) p / (1 - p)$	$(1 - p)^{m}$

Exercise: prove the results in the above table!

(Note the Negative Binomial is parametrised as per Proposition 2.20 in Wuthrich (2023) (second definition))

First three cumulants of the $(a, b)$ family #

Distribution	$E [N]$	$V a r (N)$	$E [(N - E [N])^{3}]$
Poisson $(λ)$	$λ$	$λ$	$λ$
Neg Bin $(γ, p)$	$\frac{γ p}{1 - p}$	$\frac{γ p}{(1 - p)^{2}}$	$\frac{γ p (1 + p)}{(1 - p)^{3}}$
Binomial $(m, p)$	$m p$	$m p q$	$m p q (q - p)$

Exercise:

check these results using the cgf
find the first 3 cumulants of $S$ , as well as $ς_{S}$ for each member of the family

$✠$ `actuar` and the $(a, b, 1)$ class #

The package actuar extends the definition above to allow for zero-truncated and zero-modified distributions.
The Poisson, binomial and negative-binomial (and special case geometric) are all well supported in Base R with the d, p, q and r functions.
If one takes the Panjer equation for granted, then we can think of $p_{0}$ as the mass that will make the pmf add up to one: $given Panjer : p_{0} is such that \sum_{k = 0}^{\infty} p_{k} = 1.$
We introduce here the $(a, b, 1)$ class which extends the idea above so that we have more freedom on the mass at 0.
The reference for this section is Section 4 of the vignette “distribution” of actuar

$✠$ The $(a, b, 1)$ class of distributions #

A discrete random variable is a member of the ** $(a, b, 1)$ class of distributions** if there exist constants $a$ and $b$ such that $\frac{p_{k}}{p_{k - 1}} = a + \frac{b}{k}, * * k = 2, 3, \dots * * .$ Note:

The recursion starts at $k = 2$ for the $(a, b, 1)$ class.
The extra freedom allows the probability at zero to be set to any arbitrary number $0 \leq p_{0} \leq 1$

$✠$ Zero-truncated distributions #

Setting $p_{0} = 0$ in the $(a, b, 1)$ class defines the subclass of zero-truncated distributions
Members are the zero-truncated Poisson (actuar::ztpois), zero-truncated binomial (actuar::ztbinom), zero-truncated negative-binomial (actuar::ztnbinom), and the zero-truncated geometric (actuar::ztgeom).
Let $p_{k}^{T}$ denote the probability mass at $k$ for a zero-truncated distribution (“$T$” for truncated). We have $p_{k}^{T} = {\begin{cases} 0, & k = 0; \\ \frac{p_{k}}{1 - p_{0}}, & k = 1, 2, \dots . \end{cases},$ where $p_{k}$ is the probability mass of the corresponding member of the $(a, b, 0)$ — that is, $(a, b)$ — class.
actuar provides the d, p, q, and r functions of the zero-truncated distributions mentioned above.

$✠$ Zero-modified distributions #

Setting $p_{0} \equiv p_{0}^{M}$ $(0 < p_{0}^{M} < 1)$ in the $(a, b, 1)$ class defines the subclass of zero-modified distributions (“$M$” for “modified”)
These distributions are discrete mixtures between a degenerate distribution at zero, and the corresponding distribution from the $(a, b, 0)$ class.
Let $p_{k}^{M}$ denote the probability mass at $k$ for a zero-modified distribution. We have then $p_{k}^{M} = (1 - \frac{1 - p_{0}^{M}}{1 - p_{0}}) 1_{{k = 0}} + \frac{1 - p_{0}^{M}}{1 - p_{0}} p_{k} .$ Alternatively, $p_{k}^{M} = {\begin{cases} p_{0}^{M}, & k = 0; \\ \frac{1 - p_{0}^{M}}{1 - p_{0}} p_{k}, & k = 1, 2, \dots . \end{cases},$ where $p_{k}$ is the probability mass of the corresponding member of the $(a, b, 0)$ class.

Quite obviously, zero-truncated distributions are zero-modified distributions with $p_{0}^{M} = 0$ , and $p_{k}^{M} = p_{0}^{M} 1_{{k = 0}} + (1 - p_{0}^{M}) p_{k}^{T} .$
Members are the zero-modified Poisson (actuar::zmpois), zero-modified binomial (actuar::zmbinom), zero-modified negative-binomial (actuar::zmnbinom), and the zero-modified geometric (actuar::zmgeom). actuar provides the d, p, q, and r functions of the zero-truncated distributions mentioned above.

plot(dpois(0:7, 2.5), pch = 20, col = "red", ylim = c(0, 0.3),
  cex = 1.5, type = "b")
points(dztpois(0:7, 2.5), pch = 20, col = "blue", type = "b")
points(dzmpois(0:7, 2.5, 2 * dpois(0, 2.5)), pch = 20, col = "green",
  type = "b")

References #

Bowers, Newton L. Jr, Hans U. Gerber, James C. Hickman, Donald A. Jones, and Cecil J. Nesbitt. 1997. Actuarial Mathematics. Second. Schaumburg, Illinois: The Society of Actuaries.

Dutang, Christophe, Vincent Goulet, and Mathieu Pigeon. 2008. “Actuar: An r Package for Actuarial Science.” Journal of Statistical Software 25 (7).

Werner, Geoff, and Claudine Modlin. 2010. Basic Ratemaking. Casualty Actuarial Society.

Wuthrich, Mario V. 2023. “Non-Life Insurance: Mathematics & Statistics.” Lecture notes. RiskLab, ETH Zurich; Swiss Finance Institute.

Introduction: Models for aggregate losses #

The Individual Risk Model #

Definition #

The Individual Risk Model #

Convolutions of random variables #

Formulas #

Numerical example #

Solution #

Using generating functions #

Example #

Using R #

The Collective Risk Model (Compound distributions, MW 2.1) #

Definition #

Introduction #

Definition #

Moments of S #

Moment generating function of S #

Example (Bowers et al. (1997), 12.2.1) #

Distribution of S #

Distribution of S if X is continuous #

Example, continued (Bowers et al. (1997), 12.2.3) #

Distribution of S if Y is mixed #

Distribution of S if Y is discrete #

Example with tabular approach #

Using R #

Explicit claims count distributions (MW 2.2) #

Introduction #

Exposure #

Basic models for claims frequency #

Binomial distribution #

Compound binomial model #

Poisson distribution #

Compound Poisson model #

Mixed Poisson distribution #

Inhomogeneous portfolio #

The mixed Poisson distribution #

Example #

Negative-binomial distribution #

Interpretation #

Compound negative-binomial model #

Additional properties and applications of Poisson frequencies #

Theorem 2.12: Aggregation property ↑ #

Example 12.4.1 of Bowers et al. (1997) #

Theorem 2.14: Disjoint decomposition property ↓ #

Preliminary 1: Add LoBs in the CompPoi formulation #

Preliminary 2: Partition #

Theorem 2.14: Disjoint decomposition ↓ #

Thinning of the Poisson process #

✠ Sparse vector algorithm #

✠ The sparse vector algorithm #

Example 2.16: Large claim separation #

✠ Parameter estimation (MW 2.3) #

✠ Introduction #

✠ Estimation methods #

✠ What to do with volumes? Lemma 2.26 #

✠ Method of moments #

✠ Binomial and Poisson cases #

✠ Negative binomial case #

✠ Maximum likelihood estimators #

✠ Binomial and Poisson cases #

✠ Negative binomial case #

The (a,b,0) and (a,b,1) classes of distributions #

The (a,b) class of Panjer distributions (4.2.1) #

First three cumulants of the (a,b) family #

✠ actuar and the (a,b,1) class #

✠ The (a,b,1) class of distributions #

✠ Zero-truncated distributions #

✠ Zero-modified distributions #

References #

Moments of $S$ #

Moment generating function of $S$ #

Distribution of $S$ #

Distribution of $S$ if $X$ is continuous #

Distribution of $S$ if $Y$ is mixed #

Distribution of $S$ if $Y$ is discrete #

Theorem 2.12: Aggregation property $↑$ #

Theorem 2.14: Disjoint decomposition property $↓$ #

Theorem 2.14: Disjoint decomposition $↓$ #

$✠$ Sparse vector algorithm #

$✠$ The sparse vector algorithm #

$✠$ Parameter estimation (MW 2.3) #

$✠$ Introduction #

$✠$ Estimation methods #

$✠$ What to do with volumes? Lemma 2.26 #

$✠$ Method of moments #

$✠$ Binomial and Poisson cases #

$✠$ Negative binomial case #

$✠$ Maximum likelihood estimators #

$✠$ Binomial and Poisson cases #

$✠$ Negative binomial case #

The $(a, b, 0)$ and $(a, b, 1)$ classes of distributions #

The $(a, b)$ class of Panjer distributions (4.2.1) #

First three cumulants of the $(a, b)$ family #

$✠$ `actuar` and the $(a, b, 1)$ class #

$✠$ The $(a, b, 1)$ class of distributions #

$✠$ Zero-truncated distributions #

$✠$ Zero-modified distributions #