---
title: "Getting started with RBesT (binary)"
author: "Sebastian Weber"
date: "`r Sys.Date()`"
output:
  rmarkdown::html_vignette:
    toc: true
  html_vignette:
    toc: true
  html_document:
    toc: true
    number_sections: true
    toc_float:
      collapsed: false
      smooth_scroll: false
  pdf_document:
    toc: true
  word_document:
    toc: true
vignette: >
  %\VignetteIndexEntry{Getting started with RBesT (binary)}
  %\VignetteEncoding{UTF-8}
  %\VignetteEngine{knitr::rmarkdown}
---

```{r, child="settings-knitr.txt"}
```
```{r, child="settings-sampling.txt"}
```

# Introduction

The R Bayesian evidence synthesis Tools (RBesT) facilitate the use of
historical information in clinical trials. Once relevant historical
information has been identified, RBesT supports the derivation of
informative priors via the Meta-Analytic-Predictive (MAP) approach [1]
and the evaluation of the trial's operating characteristics. The MAP
approach performs a standard meta-analysis followed by a prediction
for the control group parameter of a future study while accounting for
the uncertainty in the population mean (the standard result from a
meta-analysis) and the between-trial heterogeneity. Therefore, RBesT
can also be used as a meta-analysis tool if one simply neglects the
prediction part.

This document demonstrates RBesT as it can be used to derive from
historical control data a prior for a binary endpoint. The [RBesT
package homepage](https://opensource.nibr.com/RBesT/) contains further
articles on introductory material:

- Probability of success with co-data
- Probability of success at an interim with a normal endpoint
- Customizing RBesT plots
- RBesT for a normal endpoint
- Meta-Analytic-Predictive priors for variances

# Binary responder analysis example

Let's consider a Novartis Phase II study in ankylosing spondylitis
comparing the Novartis test treatment secukinumab with placebo
[2]. The primary efficacy endpoint was percentage of patients with a
20% response according to the Assessment of SpondyloArthritis
international Society criteria for improvement (ASAS20) at week 6. For
the control group, the following historical data were used to derive
the MAP prior:

```{r,results="asis",echo=FALSE}
kable(AS)
```

This dataset is part of RBesT and available after loading
the package in the data frame `AS`.

RBesT supports all required steps to design a clinical trial with
historical information using the MAP approach.

# Prior Derivation

## Meta-Analytic-Predictive Analysis

The **`gMAP`** function performs the meta-analysis and the prediction,
which yields the MAP prior. The analysis is run using stochastic
Markov-Chain-Monte-Carlo with Stan. In order to make results exactly
reproducible, the `set.seed` function must be called prior to calling
**`gMAP`** .

A key parameter in a meta-analysis is the between-trial heterogeneity
parameter $\tau$ which controls the amount of borrowing from
historical information for the estimation of the population mean will
occur. As we often have only few historical trials, the prior is
important.  For binary endpoints with an expected response rate of
20%-80% we recommend a conservative `HalfNormal(0,1)` prior as a
default. Please refer to the help-page of **`gMAP`** for more
information.

The **`gMAP`** function returns an analysis object from which we can
extract information using the functions from RBesT. We do recommend to
look at the graphical model checks provided by RBesT as demonstrated
below. The most important one is the forest plot, with solid lines for
the MAP model predictions and dashed lines for the stratified
estimates. For a standard forest plot without the shrinkage estimates
please refer to the `forest_plot` function in RBesT.

```{r}
# load R packages
library(RBesT)
library(ggplot2)
theme_set(theme_bw()) # sets up plotting theme

set.seed(34563)
map_mcmc <- gMAP(cbind(r, n-r) ~ 1 | study,
                 data=AS,
                 tau.dist="HalfNormal",
                 tau.prior=1,
                 beta.prior=2,
                 family=binomial)
print(map_mcmc)

## a graphical representation of model checks is available
pl <- plot(map_mcmc)

## a number of plots are immediately defined
names(pl)

## forest plot with model estimates
print(pl$forest_model)
```

An often raised concern with a Bayesian analysis is the choice of the
prior. Hence sensitivity analyses may sometimes be necessary. They can
be quickly performed with the **`update`** function. Suppose we want
to evaluate a more optimistic scenario (with less between-trial
heterogeneity), expressed by a `HalfNormal(0,1/2)` prior on
$\tau$. Then we can rerun the original analysis, but with modified
arguments of **`gMAP`**:

```{r}
set.seed(36546)
map_mcmc_sens <- update(map_mcmc, tau.prior=1/2)
print(map_mcmc_sens)
```

## Parametric Approximation

As a next step, the MAP prior, represented numerically using a large
MCMC simulation sample, is converted to a parametric representation
with the **`automixfit`** function. This function fits a parametric
mixture representation using expectation-maximization (EM). The number
of mixture components to best describe the MAP is chosen
automatically. Again, the `plot` function produces a graphical
diagnostic which allows the user to assess whether the marginal
mixture density (shown in black) matches well with the histogram of
the MAP MCMC sample.
```{r}
map <- automixfit(map_mcmc)
print(map)
plot(map)$mix
```

## Effective Sample Size

The (usual) intended use of a (MAP) prior is to reduce the number of
control patients in the trial. The prior can be considered equivalent
to a number of experimental observations, which is called the
effective sample size (ESS) of the prior. It can be calculated in
RBesT with the **`ess`** function. It should be noted, however, that
the concept of ESS is somewhat elusive. In particular, the definition
of the ESS is not unique and multiple methods have therefore been
implemented in RBesT. The default method in RBesT is the elir approach
[5] which results in reasonable ESS estimates. The moment matching
approach leads to conservative (small) ESS estimates while the Morita
[3] method tends to estimates liberal (large) ESS estimates when used
with mixtures:

```{r}
round(ess(map, method="elir"))   ## default method
round(ess(map, method="moment"))
round(ess(map, method="morita"))
```
The Morita approach uses the curvature of the prior at the mode and
has been found to be sensitive to a large number of mixture
components. From experience, a realistic ESS estimate can be obtained
with the elir method which is the only method which is predictively
consistent, see [5] for details.

## Robustification of the MAP Prior

Finally, we recommend to **`robustify`** [4] the prior which protects
against type-I error inflation in presence of prior-data conflict,
i.e. if the future trial data strongly deviate from the historical
control information.
```{r}
## add a 20% non-informative mixture component
map_robust <- robustify(map, weight=0.2, mean=1/2)
print(map_robust)

round(ess(map_robust))
```

Adding a robust mixture component does reduce the ESS of the MAP prior
to an extent which depends on the weight of the robust
component. Selecting higher robust mixture weights leads to greater
discounting of the informative MAP prior and vice versa. As a
consequence the robust weight controls the degree of influence of the
MAP prior within the final analysis. In some circumstances it can be
helpful to graphically illustrate the relationship of the prior ESS as
a function of the robust mixture component weight:

```{r}
ess_weight <- data.frame(weight=seq(0.05, 0.95, by=0.05), ess=NA)
for(i in seq_along(ess_weight$weight)) {
    ess_weight$ess[i] <- ess(robustify(map, ess_weight$weight[i], 0.5))
}
ess_weight <- rbind(ess_weight,
                    data.frame(weight=c(0, 1),
                               ess=c(ess(map), ess(mixbeta(c(1,1,1))))))

ggplot(ess_weight, aes(weight, ess)) + geom_point() + geom_line() +
    ggtitle("ESS of robust MAP for varying weight of robust component") +
    scale_x_continuous(breaks=seq(0,  1, by=0.1)) +
    scale_y_continuous(breaks=seq(0, 40, by=5))
```

# Design Evaluation

Now we have a prior which can be specified in the protocol. The
advantage of using historical information is the possible reduction of
the placebo patient group. The sample size of the control group
 is supplemented by the historical information. The reduction
in placebo patients can be about as large as the ESS of the MAP prior.

In the following, we compare designs with different sample sizes and
priors for the control group. The comparisons are carried out by
evaluating standard Frequentist operating characteristics (type-I
error, power). The scenarios are not exhaustive, but rather specific
ones to demonstrate the use of RBesT for design evaluation.


## Operating Characteristics

We consider the 2-arm design of the actual Novartis trial in
ankylosing spondylitis [2]. This trial tested 6 patients on placebo as
control against 24 patients on an active experimental
treatment. Success was declared whenever the condition

$$\Pr(\theta_{active} - \theta_{control} > 0) > 0.95$$

was met for the response rates $\theta_{active}$ and
$\theta_{control}$. A MAP prior was used for the placebo response rate
parameter. Here we evaluate a few design options as an example.

The operating characteristics are setup in RBesT in a stepwise manner:

1. Definition of priors for each arm.
2. Definition of the decision criterion using the **`decision2S`** function.
3. Specification of design options with the **`oc2S`** function. This
   includes the overall decision function and per arm the prior and
   the sample size to use.
4. The object from step 3 is then used to calculate the operating
   characteristics.

Note that for a 1-sample situation the respective `decision1S` and
`oc1S` function are used instead.

### Type I Error

The type I can be increased compared to the nominal $\alpha$ level in
case of a conflict between the trial data and the prior. Note, that in
this example the MAP prior has a 95% interval of about 0.1 to 0.5.

```{r}
theta         <- seq(0.1,0.95,by=0.01)
uniform_prior <- mixbeta(c(1,1,1))
treat_prior   <- mixbeta(c(1,0.5,1)) # prior for treatment used in trial
lancet_prior  <- mixbeta(c(1,11,32)) # prior for control   used in trial
decision      <- decision2S(0.95, 0, lower.tail=FALSE)

design_uniform   <- oc2S(uniform_prior, uniform_prior, 24,  6, decision)
design_classic   <- oc2S(uniform_prior, uniform_prior, 24, 24, decision)
design_nonrobust <- oc2S(treat_prior,   map          , 24,  6, decision)
design_robust    <- oc2S(treat_prior,   map_robust   , 24,  6, decision)

typeI_uniform   <- design_uniform(  theta, theta)
typeI_classic   <- design_classic(  theta, theta)
typeI_nonrobust <- design_nonrobust(theta, theta)
typeI_robust    <- design_robust(   theta, theta)

ocI <- rbind(data.frame(theta=theta, typeI=typeI_robust,    prior="robust"),
             data.frame(theta=theta, typeI=typeI_nonrobust, prior="non-robust"),
             data.frame(theta=theta, typeI=typeI_uniform,   prior="uniform"),
             data.frame(theta=theta, typeI=typeI_classic,   prior="uniform 24:24")
             )

ggplot(ocI, aes(theta, typeI, colour=prior)) + geom_line() + ggtitle("Type I Error")
```

Note that observing response rates greater that 50% is highly
implausible based on the MAP analysis:

```{r}
summary(map)
```

Hence, it is resonable to restrict the response rates $\theta$ for
which we evaluate the type I error to a a range of plausible values:

```{r}
ggplot(ocI, aes(theta, typeI, colour=prior)) + geom_line() +
    ggtitle("Type I Error - response rate restricted to plausible range") +
    coord_cartesian(xlim=c(0, 0.5))
```

### Power

The power demonstrates the gain of using an informative prior;
i.e. 80% power is reached for smaller $\delta$ values in comparison to
a design with non-informative priors for both arms.

```{r}
delta <- seq(0,0.7,by=0.01)
mean_control <- summary(map)["mean"]
theta_active  <- mean_control +   delta
theta_control <- mean_control + 0*delta

power_uniform   <- design_uniform(  theta_active, theta_control)
power_classic   <- design_classic(  theta_active, theta_control)
power_nonrobust <- design_nonrobust(theta_active, theta_control)
power_robust    <- design_robust(   theta_active, theta_control)

ocP <- rbind(data.frame(theta_active, theta_control, delta=delta, power=power_robust,    prior="robust"),
             data.frame(theta_active, theta_control, delta=delta, power=power_nonrobust, prior="non-robust"),
             data.frame(theta_active, theta_control, delta=delta, power=power_uniform,   prior="uniform"),
             data.frame(theta_active, theta_control, delta=delta, power=power_classic,   prior="uniform 24:24")
             )

ggplot(ocP, aes(delta, power, colour=prior)) + geom_line() +
    ggtitle("Power")

```

We see that with the MAP prior one reaches greater power at smaller
differences $\delta$ in the response rate. For example, the $\delta$
for which 80% power is reached can be found with:

```{r}
find_delta <- function(design, theta_control, target_power) {
    uniroot(function(delta) { design(theta_control + delta, theta_control) - target_power },
            interval=c(0, 1-theta_control))$root
}

target_effect <- data.frame(delta=c(find_delta(design_nonrobust, mean_control, 0.8),
                                    find_delta(design_classic,   mean_control, 0.8),
                                    find_delta(design_robust,    mean_control, 0.8),                                    
                                    find_delta(design_uniform,   mean_control, 0.8)),
                            prior=c("non-robust", "uniform 24:24", "robust", "uniform"))

knitr::kable(target_effect, digits=3)
```

### Data Scenarios

An alternative approach to visualize the study design to
non-statisticians is by considering data scenarios. These show the
decisions based on potential trial outcomes. The information needed
are the critical values at which the decision criterion flips. In the
2-sample case this means to calculate the decision boundary, see the
**`decision2S_boundary`** help for more information.

```{r}
## Critical values at which the decision flips are given conditional
## on the outcome of the second read-out; as we like to have this as a
## function of the treatment group outcome, we flip label 1 and 2
decision_flipped <- decision2S(0.95, 0, lower.tail=TRUE)
crit_uniform     <- decision2S_boundary(uniform_prior, uniform_prior, 6, 24, decision_flipped)
crit_nonrobust   <- decision2S_boundary(map          , treat_prior  , 6, 24, decision_flipped)
crit_robust      <- decision2S_boundary(map_robust   , treat_prior  , 6, 24, decision_flipped)
treat_y2 <- 0:24
## Note that -1 is returned to indicated that the decision is never 1
ocC <- rbind(data.frame(y2=treat_y2, y1_crit=crit_robust(treat_y2),    prior="robust"),
             data.frame(y2=treat_y2, y1_crit=crit_nonrobust(treat_y2), prior="non-robust"),
             data.frame(y2=treat_y2, y1_crit=crit_uniform(treat_y2),   prior="uniform")
             )

ggplot(ocC, aes(y2, y1_crit, colour=prior)) + geom_step() + ggtitle("Critical values y1(y2)")
```

The graph shows that the decision will always be negative if there are
less than 10 events in the treatment group. On the other hand, under a
non-robust prior and assuming 15 events in the treatment group, three
(or less) placebo events would be needed for success. To check this
result, we can directly evaluate the decision function:

```{r}
## just positive
decision(postmix(treat_prior, n=24, r=15), postmix(map, n=6, r=3))
## negative
decision(postmix(treat_prior, n=24, r=14), postmix(map, n=6, r=4))
```


# Trial Analysis

Once the trial has completed and data is collected, the final analysis
can be run with RBesT using the **`postmix`** function. Calculations
are performed analytically as we are in the conjugate mixture
setting.

```{r}
r_placebo <- 1
r_treat   <- 14

## first obtain posterior distributions...
post_placebo <- postmix(map_robust,  r=r_placebo, n=6)
post_treat   <- postmix(treat_prior, r=r_treat  , n=24)

## ...then calculate probability that the difference is smaller than
## zero
prob_smaller <- pmixdiff(post_treat, post_placebo,  0, lower.tail=FALSE)

prob_smaller

prob_smaller > 0.95

## alternativley we can use the decision object
decision(post_treat, post_placebo)
```

### References

[1] Neuenschwander B. et al., _Clin Trials_. 2010; 7(1):5-18  
[2] Baeten D. et al., _The Lancet_, 2013, (382), 9906, p 1705  
[3] Morita S. et al., _Biometrics_ 2008;64(2):595-602  
[4] Schmidli H. et al., _Biometrics_ 2014;70(4):1023-1032  
[5] Neuenschwander B. et al., _Biometrics_ 2020;76(2):578-587  

### R Session Info

```{r}
sessionInfo()
```

```{r,include=FALSE}
options(.user_mc_options)
```