
Generalized linear models (GLMs) and generalized linear mixed models (GLMMs) are fundamental tools for predictive analysis. These statistical tools have motivated many modern insurance-pricing techniques (Nelder and Verrall, 1997; Antonio and Beirlant, 2007; Wuthrich, 2019). Meanwhile, the credibility theory is a field within actuarial science that focuses on calculating risk premiums. It originated from Mowbray (1914) and Whitney (1918), who aimed to determine a premium that balanced the experience of an individual policyholder with that of a group of similar policyholders (Nelder and Verrall, 1997). While GLMMs and credibility theory are both widely used in risk classification in non-life insurance sectors, such as auto insurance, practitioners in these sectors tend to be more familiar with credibility theory.
GLMMs have several advantages over credibility theory. They offer a wider range of models and can be easily fitted and tested using standard statistical software. As a result, GLMMs provide additional modeling options for various actuarial problems, such as premium ratings and claims reserves. Furthermore, it is well known that credibility theory is closely related to the framework of GLMMs (Nelder and Verrall, 1997). While the insurance literature has some initial discussion on credibility theory based on GLMMs (Ohlsson, 2008; Lai and Sun, 2012), most assume above-average knowledge of the statistical aspects of GLMM, which makes its application inaccessible to traditional insurance practitioners.
This study aims to explain GLMs and GLMMs in plain actuarial language and demonstrate how to obtain a simple and fair premium. The remainder of the article is organized as follows. Section 2 introduces the notations used. Section 3 provides the general settings of risk classification procedures for insurance using the language of GLMs. Sections 4 and 5 explain the modeling and prediction methods for GLMs and GLMMs, respectively. Section 6 provides the theoretical link between GLMMs and credibility theory. Finally, the results are discussed in Section 7.
We denote ℕ, ℕ0, ℝ, and ℝ+ as the set of natural numbers, the set of non-negative integers, the set of real numbers, and the set of positive real numbers, respectively. We define
We differentiate a fixed effect from a random effect. The former is based on the policyholders’ observed risk characteristics, whereas the latter is based on the policyholders’ unobserved risk characteristics after controlling for fixed effects. Let the data matrix
where
We use Pois(
In terms of data, for each policyholder
where
First, we are interested in a fair premium, defined as , for the
should be understood as
The fair premium for each policy year is determined solely by the fixed effect. In insurance, a fair premium based on the fixed effect is called the a priori rate and the risk classification process based on the fixed effect is called the a priori risk classification procedure. The premium determined by the a priori risk classification procedure can be further updated based on the claim history as follows:
which is called the a posteriori rate. This premium adjustment mechanism based on claim history is called the a posteriori risk classification procedure. Typically, in insurance, the risk characteristics for a year are known at the beginning of each year. Hence, in predicting the premium in the
GLMs are a flexible method that extend the fixed-effect linear regression model to response variables, which are not normally distributed. The expected value of
where to a linear predictor.
In Model 1, the only unknown parameters are the coefficients
with the a priori rate for the .
We assume that we have a dataset in the form of (
where
where Λ
In this subsection, we predict the a posteriori rate in (
based on the assumption of independence in Model 1. The a posteriori rate in (
Throughout this study, we use the dataset for collision coverage on vehicles collected by the Wisconsin Local Government Property Insurance Fund (LGPIF) from 2006 to 2011 (Frees
To fit the GLM in Model 1, we can use the glm function in R, whose form is given by
> glm(formula , family=familytype (link=linkfunction ), data=dataset ,. . .).
The specific form of the function under the setting in Model 1 is given by
> glm(Frequency~factor(entityType)+avgCoverage, family=poisson(link=log), data=mydata).
If the categorical variable (entity type) is presented as a dummy variable (e.g., TypeCity, TypeCounty) in a dataset, the formula specifying the fixed effects part of the function is modified as follows:
> glm(Frequency~TypeCity+TypeCounty+· · · +avgCoverage, family=poisson(link=log), data=mydata).
The estimation results of this model are summarized in Table 3. The asterisk (*) signs indicate that the parameters are significant at the significance level of 0.05. Based on the table, the a posteriori rate, which is equivalent to the a priori rate, can be obtained as
Hence, the estimated parameter
Moreover, the third policyholder in the dataset, for example, is in a county and has an average coverage of 2.495, so that under Model 1, the estimated expectation is
GLMMs are a popular method to model correlated data; they extend GLMs by including random effects in the predictor. The conditional expected value of
where into a linear predictor
In Model 1, we assume that the selected explanatory variable
Now, we assume that only the first explanatory variable
If the unobserved explanatory variables are denoted as
where the explanatory variables are assumed invariant over time, we have the following conditional model:
By further defining
Regardless of whether observed or unobserved, the explanatory variables can also be assumed to be random variables. By placing an additional distributional assumption on the unobserved explanatory variables, Example 5.1 can be formally represented as follows:
In the case of a normal random effect, that is,
In Model 2, the mean of the random effect
with the mean of 1 and the dispersion parameter
However, in Model 1, we have
An important implication in the comparison of the covariance in (
are not independent under Model 2, the corresponding likelihood function under Model 2 cannot be written as the product of the likelihood function. Instead, we have the following log-likelihood function for the observations:
which requires one-dimensional integration. Here,
and
Next, for data in the form of (
For the estimation of unknown coefficients
The goal of the a posteriori risk classification is to predict the net premium for each policyholder. In Model 2, by definition, we have the following conditional expectation:
Because the random effect for the
the conditional expectation in (
Hence, the prediction of the a posteriori rate in Model 2 is reduced to the estimation of predicting the random effect in (
In Model 3, the likelihood function in (
where
Hence, the statistical estimation procedure is based on the simple optimization of (
from the fact that the conditional distribution
respectively. In the conjugate families, the posterior mean of the individual’s random effect in (
Because later acts as a multiplier of the expectation from the fixed effects alone (see the example at the end of Section 5, under Model 3),
suggests no modification in the premium. Additionally, the ratio
.
So far, we have analyzed the properties of GLMMs in Model 2, in which the random effect
We have
under the parameterization in Model 4. Unlike the gamma distributional assumption for the random effect as in Model 3, the lognormal distributional assumption for the random effect does not lead to a closed-form expression for the likelihood function in (
To fit the GLMMs in Model 2, we consider the hglm function in the hglm package (Rönnegård
> hglm(formula , family=familytype (link=linkfunction ),
random=var , rand.family=familytype (link=linkfunction ),
data=dataset ,. . .).
This is similar to the glm function in R but with additional terms for the random effect: random and rand.family. First, we specify the policyholder-specific random effect of interest as random=~1|Polic yNum. To easily specify the distribution of the random effect, we consider the conditional expectation of frequency as
and having a log link function. To use the gamma distribution for the random effect in the function, the random effect
> hglm(Frequency~TypeCity+· · · +avgCoverage,
family=poisson(link=log),
random=~1|PolicyNum,
rand.family=gamma(link=log),
data=mydata).
Under the settings in Model 4,
> hglm(Frequency~TypeCity+· · · +avgCoverage,
family=poisson(link=log),
random=~1|PolicyNum,
rand.family=gaussian(link=identity),
data=mydata).
Alternative functions to fit GLMM are the glmer function of the lme4 package and the glmmPQL function of the MASS package. The
glmer function is commonly used, and it fits the model via maximum likelihood. Meanwhile, glmmPQL uses the penalized quasi-likelihood approach to evaluate the likelihood of GLMM. The functions in Model 4 are
> glmer(Frequency~TypeCity+ · · · +avgCoverage+(1|PolicyNum),
family=poisson(link=log),
data=mydata),
and
> glmmPQL(Frequency~TypeCity+· · · +avgCoverage,
family=poisson(link=log),
random=~1|PolicyNum,
data=mydata).
The estimation results of these models for the parameters are summarized in Table 4. The estimates for the fixed effects differ slightly depending on the distribution of the random effect; however, we have the same interpretation for the tendency in risk characteristics. In this example, the function provides an estimate of the dispersion parameters for the random effect. Because the dispersion parameter of the gamma distribution is the reciprocal of the shape parameter and has a mean of 1,
Here,
and under Model 4, it is
Although GLMMs offer a unified approach for calculating the a posteriori insurance rate, they often lack closed-form expressions, except for a few distributional assumptions; for instance, see Model 3. In this section, we demonstrate the incorporation of the credibility theory into the framework of GLMMs by providing a closed-form approximation of the a posteriori rate.
For simplicity, we consider the settings in Model 2 with the further assumption that the risk characteristics are the same across years; that is,
where
Following the classical procedure in the Bühlmann premium (Bühlmann and Gisler, 2006), the optimal premium for the
where
Furthermore, , and
can be obtained as
from equations
where
In this study, we present an intuitive explanation of the structure of GLMs and GLMMs and their relationship with insurance pricing methods; and we explain their connection with the Bühlmann method in insurance. With recent advances in computing tools, such as deep neural networks, insurance pricing can become more accurate through the use of complex predictive expressions. However, the traditional credibility method, which seeks an intuitive representation of insurance prices, seems at odds with modern statistical techniques. An interesting future topic is to explore how to adapt modern statistical computing tools to the credibility method in order to improve accuracy while preserving its intuitive and simple form.
Here, we mainly focus on the Poisson distribution in GLM and GLMMs to model the frequency of insurance claims, which are commonly used for count data. However, based on the assumption that the variance and mean are the same in the distribution, there are limitations, such as incorrect inference when overdispersion is present in the dataset. To address overdispersion in the Poisson model, overdispersion adjustments can be made using the quasi-likelihood method (quasi-Poisson model) or an alternative distribution method, such as negative binomial distribution. We note that, with some limitation (Lee
Typical data example
ID | Calendar year | Salary ( | Home value ( | · · · | Age ( | Frequency ( |
---|---|---|---|---|---|---|
1 | 2017 | 1500 | 20500 | · · · | 37 | 0 |
1 | 2018 | 1470 | 20900 | · · · | 38 | 1 |
1 | 2019 | 1563 | 20500 | · · · | 39 | 0 |
⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ |
102 | 2017 | 1401 | 45000 | · · · | 52 | 2 |
102 | 2018 | 1573 | 47000 | · · · | 53 | 0 |
102 | 2019 | 1562 | 50000 | · · · | 54 | 0 |
Observable policy characteristics used as covariates
Continuous variable | Description | Proportions | ||
---|---|---|---|---|
Type of local government entity | ||||
Miscellaneous | 5.03% | |||
City | 9.66% | |||
Entity type | County | 11.47% | ||
School | 36.42% | |||
Town | 16.90% | |||
Village | 20.52% | |||
Continuous variable | Min | Mean | Max | |
avgCoverage | Collision coverage amount | 0.009 | 1.097 | 13.135 |
Estimation results under Model 1
Parameter | Est | Std.dev | z | |||
---|---|---|---|---|---|---|
−2.339 | 0.302 | −7.758 | <0.0001 | * | ||
Type = City | 1.618 | 0.314 | 5.158 | <0.0001 | * | |
Type = County | 2.633 | 0.307 | 8.572 | <0.0001 | * | |
Type = School | 1.021 | 0.309 | 3.310 | <0.0001 | * | |
Type = Town | −0.730 | 0.387 | −1.888 | 0.0591 | ||
Type = Village | 0.835 | 0.317 | 2.632 | 0.0085 | * | |
avgCoverage | 0.195 | 0.010 | 19.731 | <0.0001 | * |
Estimation results for the parameters under Model 3 and Model 4
Parameter | Model 3 | Model 4 | |||||||||
---|---|---|---|---|---|---|---|---|---|---|---|
Est | Std.dev | z | Est | Std.dev | z | ||||||
−2.356 | 0.319 | −7.373 | <0.0001 | * | −2.849 | 0.364 | −7.829 | <0.0001 | * | ||
Type = City | 1.048 | 0.370 | 2.833 | 0.0047 | * | 1.323 | 0.417 | 3.177 | 0.0015 | * | |
Type = County | 1.940 | 0.379 | 5.121 | <0.0001 | * | 2.229 | 0.422 | 5.284 | <0.0001 | * | |
Type = School | 0.920 | 0.333 | 2.766 | 0.0057 | * | 0.917 | 0.379 | 2.418 | 0.0157 | * | |
Type = Town | −0.909 | 0.390 | −2.331 | 0.0199 | * | −0.713 | 0.442 | −1.612 | 0.1072 | ||
Type = Village | 0.434 | 0.349 | 1.245 | 0.2131 | 0.613 | 0.396 | 1.549 | 0.1216 | |||
avgCoverage | 0.383 | 0.048 | 8.038 | <0.0001 | * | 0.337 | 0.040 | 8.430 | <0.0001 | * | |
1.072 | 1.191 |
Selected estimation results for random effects under Models 3 and Model 4
Model 3 | Model 4 | |||
---|---|---|---|---|
Estimate | Std.err | Estimate | Std.err | |
0.938 | 0.299 | 0.234 | 0.304 | |
0.647 | 0.247 | 0.004 | 0.238 | |
2.074 | 0.280 | 0.995 | 0.295 | |
0.875 | 0.238 | 0.262 | 0.238 | |
1.180 | 0.276 | 0.467 | 0.285 | |
0.921 | 0.314 | 0.302 | 0.313 | |
3.132 | 0.181 | 1.584 | 0.177 | |
0.336 | 0.345 | −0.451 | 0.286 | |
0.960 | 0.229 | 0.438 | 0.212 | |
0.264 | 0.360 | −0.906 | 0.341 |