class: center, middle, inverse, title-slide .title[ # Causal Inference (1) ] .author[ ### Yue Jiang ] .date[ ### Duke University ] --- <br><br><br><br> <img src="img/olympus.jpg" width="100%" style="display: block; margin: auto;" /> .question[ Who is depicted here? ] --- ### Potential outcomes For now, we will consider a treatment `\(A\)` taking on values of 0 or 1, and an outcome `\(Y\)` taking on values 0 or 1. Define a .vocab[potential outcome] `\(Y^{a = 1}\)` as the outcome that *would be observed* if treatment `\(a = 1\)` was given (this might not have happened in reality). `\(Y^{a = 0}\)` is the potenetial outcome that would be observed if treatment `\(a = 0\)` was given. These potential outcomes are known as .vocab[counterfactuals]. Before treatment `\(A\)` happens, there are two potential outcomes. After `\(A\)` happens, then a counterfactual exists. --- ### Counterfactuals You've likely dealt with counterfactuals your whole life! - What if I stayed in bed this morning? - What if we shot better during the Elite 8 game? - What if I didn't take that Tylenol? - If this patient got a liver transplant 3 years ago, would they sitll have been alive now? --- ### Potential outcomes <img src="img/gods.png" width="30%" style="display: block; margin: auto;" /> (A famous (and very reproduced) table from Hernan and Robins) --- ### Individual vs. average causal effects <img src="img/gods.png" width="30%" style="display: block; margin: auto;" /> .question[ What does it mean to have an individual "causal effect"? What about an average causal effect? ] --- ### Individual causal effects In this table, we see that some benefit, some are harmed, some are "immune," and some are "doomed." Consider the individual potential outcomes `\(Y_i^{a = 1}\)` and `\(Y_i^{a = 0}\)` for the `\(i\)`th observation. For individual `\(i\)`, we say that `\(A\)` has an effect on `\(Y\)` if `\(Y_i^{a = 1} \neq Y_i^{a = 0}\)`. .question[ The .vocab[sharp null hypothesis] states that there is no effect for *all* individuals. What does it mean for `\(A\)` to have no effect on `\(Y\)` in the language of potential outcomes? ] --- ### Individual causal effects The sharp null hypothesis states that `\(Y_i^{a = 1} = Y_i^{a = 0}\)` for all `\(i\)`. We will also introduce .vocab[causal consistency], which states that the observed outcome `\(Y\)` is related to the observed treatment and counterfactuals by `\(Y = Y^{a = 1}A + Y^{a = 0}(1 - A)\)`. --- ### Individual causal effects We usually care about some difference in potential outcomes, for instance `\(Y_i^{a = 1} - Y_i^{a = 0}\)`. .question[ Is it possible to identify individual causal effects from the observed data `\(Y\)` and `\(A\)` for observation `\(i\)`? ] --- ### Quick aside: variability So far, we've assumed that potential outcomes are deterministic based on the treatment, but note that we could (somewhat) straightforwardly define potential outcomes probabilistically as well. --- ### Potential outcomes vs. observed data <img src="img/gods2.png" width="60%" style="display: block; margin: auto;" /> --- ### Some assumptions .vocab[Infererence] occurs when one individual's treatment affects the outcome of another individual. Having no interference is an important part of SUTVA (next slide). .question[ What are some examples of interference? ] --- ### Some assumptions The .vocab[Stable Unit Treatment Value Assumption] (SUTVA) essentially states that there are only two potential outcomes: there is no interference, and that there is only one version of treatment and one version of non-treatment (or that multiple versions of each of these are irrelevant). .vocab[ What is an example of having multiple "versions" of treatment? How about an example where this is irrelevant? Why would SUTVA be important philosophically? ] --- ### Average causal effects Let's now move on to average causal effects, defined in terms of expectations. An .vocab[average causal effect] exists in the population if `\(E(Y^{a = 1}) \neq E(Y^{a = 0})\)`. We say there is no average causal effect if `\(E(Y^{a = 1}) = E(Y^{a = 0})\)`. .question[ Does having no average causal effect imply the sharp null? Does the sharp null imply no average causal effect? Explain why or provide counterexamples. ] --- ### Causal vs. associational effects We might care about various measures of causal effects, such as causal risk differences, odds ratios, etc. These are defined in terms of potential outcomes. *Associational* effects are based on observed data. E.g.: - Causal risk difference: `\(E(Y^{a = 1}) \neq E(Y^{a = 0})\)` - *Associational* risk difference: `\(E(Y | A = 1) - E(Y|A = 0)\)` .question[ When we might we be able to claim an association is actually causation? ] --- ### Causal vs. associational effects (that's the whole point of causal inference!) --- ### A quick calculation <img src="img/gods3.png" width="40%" style="display: block; margin: auto;" /> --- ### A quick calculation .vocab[ What is `\(P(Y^{a = 0} = 1 | A = 1)\)`? What is `\(P(Y^{a = 0} = 1 | A = 0)\)`? What is the implication of this? ] <img src="img/gods.png" width="30%" style="display: block; margin: auto;" /> --- ### A quick calculation Those that received treatment were at higher risk of death than the untreated in the counterfactual world in which no one receives treatment. .question[ But we don't get to observe the counterfactuals. What then? What can we do? ] --- ### Exchangeability .vocab[Exchangeability] is the property that the counterfactual outcome and actual treatment received are independent for all `\(a\)`: `\(Y^a \perp A\)` (note that like "causal consistency," "exchangeability" does not have the usual probabilistic definition of exchangeability). Be careful, this does *not* mean `\(Y \perp A\)` (in fact, we probably wouldn't want this at all for many studies!). --- ### Exchangeability Here, exchangeability implies that we can "exchange" group assignment without altering the outcome. For instance, if in the treatment group you have sicker patients than another, we will not have exchangeability - if they instead got placebo instead, our results would be different. > *When the groups of exposed and unexposed are exchangeable, the risk of the outcome in the treated group would have been the same as the risk of the outcome in the untreated group had the treatment been switched between the groups* .right[--Hernan] --- ### Exchangeability Similarly, .vocab[mean exchangeability] is a weaker, but analogous property on the expectations: `\(E(Y^a | A = 1) = E(Y^a | A = 0)\)` (exchangeability implies mean exchangeability). .question[ Why is (mean) exchangeability important? Did we have exchangeability in the example earlier? Why or why not? ] --- ### Exchangeability Mean exchangeability means that `\(E(Y^{a = 1}) = E(Y^{a = 1} | A = 1)\)`, which by causal consistency is equal to `\(E(Y | A = 1)\)` (which *is* identifiable). Similarly, it means that `\(E(Y^{a = 0}) = E(Y | A = 0)\)`. .vocab[ What is the implication of this fact? ] --- ### Exchangeability As we saw before, under exchangeability the causal risk difference (and risk ratio and odds ratio) equal the associational risk difference (or risk ratio or odds ratio). .question[ How might we introduce exchangeability to a study? ]