The problem

Guszcza and Lommele

	Hierarchical models	Bayesian
Fit method	Maximum likelihood	Closed form if you’re lucky, numerical methods (like MCMC) if you’re not
Complementary data	Part of the fitting process	Use a prior distribution
Objectivity	Objective	Subjective when the prior swamps the data

Data

data {
  int<lower=0> sampleN;
  int<lower=0, upper=1> heads[sampleN];
  int<lower=0> predN;
  int<lower=0> betaA;
  int<lower=0> betaB;
}

Prior distribution and model

parameters {
  real<lower=0,upper=1> theta;
}

model {
  theta ~ beta(betaA, betaB);
  heads ~ bernoulli(theta);
}

In this simple example, there’s just the one parameter. is the beta variable that serves as our range around the binomial coefficient, p.

Prediction

generated quantities{
  real heads_pred;
  heads_pred <- binomial_rng(predN, theta);
}

The sample size doesn’t need to be the same as the predicted results. I could use five years of data to predict the next two, or whatever.

Run the code within R

sampleN <- 10
heads <- c(1, 1, 0, 0, 0, 0, 0, 0, 0, 0)
predN <- 5
fit1 <- stan(file = './stan/bernoulli.stan'
            , data = list(sampleN
                          , heads
                          , predN
                          , betaA = 1
                          , betaB = 1)
            , iter = 1000
            , seed = 1234)

Data

data {
  // sample data
  int<lower=0> numClaims;
  vector[numClaims] Prior;
  int<lower=0, upper=1> BadCredit[numClaims];
  int Current[numClaims];
  
  // prior parameters
  real shape;
  real rate;
  
  // New predicted quantities
  int<lower=0> numNewClaims;
  vector[numNewClaims] NewPrior;
  int<lower=0, upper=1> NewBadCredit[numNewClaims];
}

Transformed Data

transformed data {
  vector[numClaims] logPrior;
  vector[numNewClaims] logNewPrior;
  
  logPrior <- log(Prior);
  logNewPrior <- log(NewPrior);
}

Because this is a Poisson GLM, we’ll take the log of the predictor.

Parameters

parameters {
  real credit;
  real linkRatio;
}

transformed parameters {
  real logLink;
  logLink <- log(linkRatio);
}

We’re using Poisson with an offset, so we need to transform the parameters.

Model

model {
  for (i in 1:numClaims) {
    linkRatio ~ gamma(shape, rate);
    Current[i] ~ poisson_log(logPrior[i] 
                                + logLink + credit * BadCredit[i]);
  }
}

Predictions

generated quantities{
  int newCurrent[numNewClaims];
  for (i in 1:numNewClaims){
    newCurrent[i] <- poisson_log_rng(logNewPrior[i] 
                                      + logLink + credit * NewBadCredit[i]);
  }
}

Individual Claims Reserving with Stan