Ordinal Regression

class: center, middle, inverse, title-slide

# Ordinal Regression
### Yue Jiang
### STA 210 / Duke University / Spring 2023

---

### Licorice and post-op sore throat

---

### Licorice and post-op sore throat

---

### Licorice and post-op sore throat

---

### Licorice and post-op sore throat

---

### Ordinal data

In this case, our outcome is *ordered* (and categorical). Although we have 
"numbers" on a pain scale from 0 to 10, these numbers don't have a linear
relationship - a 4 isn't necessarily twice as painful as a 4; a 3 isn't three
times as painful as 1.

However, these data *are* ordered. We do know that a 4 is more painful than a 3,
which is more painful than a 2, etc.

.question[
What are some potential pitfalls of using an ordinary least squares regression?
How about using a logistic regression approach (how would we even do this)?
]

---

### A cumulative link model

We might consider an outcome `$Y$` that looks at all outcomes at once. For
`$j$` total ordered categories, we might model the **cumulative probability** for
observation `$i$`:

`\begin{align*}
\gamma_{ij} &= P(Y_i \le j)\\
&= P(Y_i = 1) + P(Y_i = 2) + \cdots + P(Y_i = j).
\end{align*}`

Note that `$\gamma_{ij}$` is limited to values from 0 to 1, as it is a probability.
We might consider a model

`\begin{align*}
logit(\gamma_{ij}) = \beta_{0;j} + \beta_1x_{i1} + \cdots + \beta_px_{ip}
\end{align*}`

.question[
Take a look at this model - what do the `$\beta_{0;j}$` terms mean/represent?
]

---

### A cumulative link model

`\begin{align*}
\gamma_{ij} &= P(Y_i \le j)\\
&= P(Y_i = 1) + P(Y_i = 2) + \cdots + P(Y_i = j).
\end{align*}`

The `$\beta_{0;j}$` terms are constants representing the "baseline" value for each
category (on the logit scale).

`\begin{align*}
logit(\gamma_{ij}) = \beta_{0;j} + \beta_1x_{i1} + \cdots + \beta_px_{ip}
\end{align*}`

.question[
What is the interpretation of `$\beta_1$` through `$\beta_p$`? What are we 
implicitly assuming?
]

---

### A cumulative link model

In this case, we have the same covariate relationships `$\beta_1, \cdots, \beta_p$` 
across **all** of the categories. This means that `$\beta_k$` represents the conditional 
change in (transformed) cumulative probabilities given a 1 unit difference 
in `$X_{ik}$`.

---

### Ordered logistic regression

The ordered logistic regression model is a cumulative link model that assumes a
logit transformation of the cumulative probabilities:

`\begin{align*}
logit(\gamma_{ij}) &= \beta_{0;j} + \beta_1x_{i1} + \cdots + \beta_px_{ip}\\
log\left(\frac{\gamma_{ij}}{1 - \gamma_{ij}}\right) &= \beta_{0;j} + \beta_1x_{i1} + \cdots + \beta_px_{ip}\\
log\left(\frac{P(Y_i \le j)}{P(Y_i > j)}\right) &= \beta_{0;j} + \beta_1x_{i1} + \cdots + \beta_px_{ip}
\end{align*}`

Exponentiating, we have

`\begin{align*}
\frac{P(Y_i \le j)}{P(Y_i > j)} &= \beta_{0;j} + \beta_1x_{i1} + \cdots + \beta_px_{ip}
\end{align*}`

.question[
What is the outcome here? How might we interpret `$\exp(\beta_{0;j})$`? How might
we interpret the `$\beta$` terms here?
]

---

### The proportional odds assumption

Remember that we have only one `$\beta_k$` term for each predictor across *all*
categories. This implies that changes in `$X_k$` have the same conditional 
relationship with odds of being in category 1 vs. 2, 6 vs. 7, or any `$j-1$` vs.
`$j$`.

.question[
When might this be a reasonable assumption? When might this assumption be
violated? How might we modify the model in the case that this assumption does
not hold? How might we gut-check this assumption using the data?
]

---