Causal Inference (1)

class: center, middle, inverse, title-slide

.title[
# Causal Inference (1)
]
.author[
### Yue Jiang
]
.date[
### Duke University
]

---

.question[
Who is depicted here?
]

---

### Potential outcomes

For now, we will consider a treatment `\(A\)` taking on values of 0 or 1, and an
outcome `\(Y\)` taking on values 0 or 1.

Define a .vocab[potential outcome] `\(Y^{a = 1}\)` as the outcome that *would be observed* if treatment `\(a = 1\)` was given (this might not have happened in reality). `\(Y^{a = 0}\)` is the potenetial outcome that would be observed if treatment `\(a = 0\)` was given.

These potential outcomes are known as .vocab[counterfactuals]. Before treatment `\(A\)` happens, there are two potential outcomes. After `\(A\)` happens, then a counterfactual exists.

---

### Counterfactuals

You've likely dealt with counterfactuals your whole life!

- What if I stayed in bed this morning?
- What if we shot better during the Elite 8 game?
- What if I didn't take that Tylenol?
- If this patient got a liver transplant 3 years ago, would they sitll have been alive now?

---

### Potential outcomes

(A famous (and very reproduced) table from Hernan and Robins)

---

### Individual vs. average causal effects

.question[
What does it mean to have an individual "causal effect"? What about an average causal effect?
]

---

### Individual causal effects

In this table, we see that some benefit, some are harmed, some are "immune," and some are "doomed."

Consider the individual potential outcomes `\(Y_i^{a = 1}\)` and `\(Y_i^{a = 0}\)` for the `\(i\)`th observation. For individual `\(i\)`, we say that `\(A\)` has an effect on `\(Y\)` if `\(Y_i^{a = 1} \neq Y_i^{a = 0}\)`.

.question[
The .vocab[sharp null hypothesis] states that there is no effect for *all* individuals. What does it mean for `\(A\)` to have no effect on `\(Y\)` in the language of potential outcomes?
]

---

### Individual causal effects

The sharp null hypothesis states that `\(Y_i^{a = 1} = Y_i^{a = 0}\)` for all `\(i\)`.

We will also introduce .vocab[causal consistency], which states that the observed outcome `\(Y\)` is related to the observed treatment and counterfactuals by `\(Y = Y^{a = 1}A + Y^{a = 0}(1 - A)\)`.

---

### Individual causal effects

We usually care about some difference in potential outcomes, for instance `\(Y_i^{a = 1} - Y_i^{a = 0}\)`.

.question[
Is it possible to identify individual causal effects from the observed data `\(Y\)` and `\(A\)` for observation `\(i\)`?
]

---

### Quick aside: variability

So far, we've assumed that potential outcomes are deterministic based on the treatment, but note that we could (somewhat) straightforwardly define potential outcomes probabilistically as well.

---

### Potential outcomes vs. observed data

---

### Some assumptions

.vocab[Infererence] occurs when one individual's treatment affects the outcome of another individual. Having no interference is an important part of SUTVA (next slide).

.question[
What are some examples of interference?
]

---

### Some assumptions

The .vocab[Stable Unit Treatment Value Assumption] (SUTVA) essentially states that there are only two potential outcomes: there is no interference, and that there is only one version of treatment and one version of non-treatment (or that multiple versions of each of these are irrelevant).

.vocab[
What is an example of having multiple "versions" of treatment? How about an example where this is irrelevant? Why would SUTVA be important philosophically?
]

---

### Average causal effects

Let's now move on to average causal effects, defined in terms of expectations. An .vocab[average causal effect] exists in the population if `\(E(Y^{a = 1}) \neq E(Y^{a = 0})\)`. We say there is no average causal effect if `\(E(Y^{a = 1}) = E(Y^{a = 0})\)`.

.question[
Does having no average causal effect imply the sharp null? Does the sharp null imply no average causal effect? Explain why or provide counterexamples.
]

---

### Causal vs. associational effects

We might care about various measures of causal effects, such as causal risk differences, odds ratios, etc. These are defined in terms of potential outcomes. *Associational* effects are based on observed data. E.g.:

- Causal risk difference: `\(E(Y^{a = 1}) \neq E(Y^{a = 0})\)`
- *Associational* risk difference: `\(E(Y | A = 1) - E(Y|A = 0)\)`

.question[
When we might we be able to claim an association is actually causation?
]

---

### Causal vs. associational effects

(that's the whole point of causal inference!)

---

### A quick calculation

---

### A quick calculation

.vocab[
What is `\(P(Y^{a = 0} = 1 | A = 1)\)`? What is `\(P(Y^{a = 0} = 1 | A = 0)\)`? What is the implication of this?
]

---

### A quick calculation

Those that received treatment were at higher risk of death than the untreated in the counterfactual world in which no one receives treatment.

.question[
But we don't get to observe the counterfactuals. What then? What can we do?
]

---

### Exchangeability

.vocab[Exchangeability] is the property that the counterfactual outcome and actual treatment received are independent for all `\(a\)`: `\(Y^a \perp A\)` (note that like "causal consistency," "exchangeability" does not have the usual probabilistic definition of exchangeability).

Be careful, this does *not* mean `\(Y \perp A\)` (in fact, we probably wouldn't want this at all for many studies!).
 
---

### Exchangeability

Here, exchangeability implies that we can "exchange" group assignment without altering the outcome. For instance, if in the treatment group you have sicker patients than another, we will not have exchangeability - if they instead got placebo instead, our results would be different.

> *When the groups of exposed and unexposed are exchangeable, the risk of the outcome in the treated group would have been the same as the risk of the outcome in the untreated group had the treatment been switched between the groups*

.right[--Hernan]

---

### Exchangeability

Similarly, .vocab[mean exchangeability] is a weaker, but analogous property on the expectations: `\(E(Y^a | A = 1) = E(Y^a | A = 0)\)` (exchangeability implies mean exchangeability).

.question[
Why is (mean) exchangeability important? Did we have exchangeability in the example earlier? Why or why not?
]

---

### Exchangeability

Mean exchangeability means that `\(E(Y^{a = 1}) = E(Y^{a = 1} | A = 1)\)`, which by causal consistency is equal to `\(E(Y | A = 1)\)` (which *is* identifiable).

Similarly, it means that `\(E(Y^{a = 0}) = E(Y | A = 0)\)`.

.vocab[
What is the implication of this fact?
]

---

### Exchangeability

As we saw before, under exchangeability the causal risk difference (and risk ratio and odds ratio) equal the associational risk difference (or risk ratio or odds ratio).

.question[
How might we introduce exchangeability to a study?
]