MTH136/STA114: Statistics

Homework #5

Due: Wednesday, Feb 28, 2001


A criminal suspect undergoing a polygraph test is either guilty (G) or innocent (I). His answer to the question, ``Are you innocent?'' is ``Yes.'' As a result of the test, the expert polygraph examiner declares the suspect to by lying, denoted by X=1, or truth-telling, denoted by X=0. Let thetaI = Pr(X=0|I) and thetaG = Pr(X=1|G) be the probabilities of correct determinations for innocent and guilty suspects, respectively.
To provide estimates of thetaG and thetaI a pre-trial study is performed. Here the examiner administers the polygraph test to 20 ``guilty'' people and 18 ``innocent'' people, and each was asked ``are you innocent?'' All participants are instructed to say ``yes'' so that the guilty ones are lying, the innocent ones are not lying. The conditions of the test are otherwise exactly as used in testing a real suspect.
Let Y be the number out of the 20 guilty people that the examiner correctly identifies as lying, and let Z be the number out of the 18 innocent people that the examiner correctly identifies as truth-telling.

  1. State the distribution of Y given thetaG. State the distribution of Z given thetaI.
  2. Assuming uniform independent priors thetaG ~ U(0,1) and thetaI ~ U(0,1), what are the corresponding posteriors for thetaG and thetaI based on the observed pre-trial outcomes Y=17 and Z=18? What are the MLEs of thetaG and thetaI? What are the posterior means of thetaG and thetaI? Compare the posterior means with the MLEs as possible point estimates.
  3. Simulate large samples (say, k=10,000) from each of the posteriors and use these to compute samples for R = thetaG/thetaI. Summarize the posterior for R and use it to explore whether or not the examiner is in fact better at detecting truth-tellers than liars on the basis of this data. What do you conclude?
  4. Now return to the real suspect. We are really interested in the probability pG that he is in fact guilty when the polygraph examiner declares him to be lying. If we suppose the prior probability he is guilty to be 0.5, then Bayes' theorem gives us pG=thetaG/(thetaG+1-thetaI). Use the posterior simulations from (3.) to compute and summarize posterior samples for pG. How likely is it that the suspect is guilty?
  5. A non-Bayesian approach would be to simply estimate pG by its MLE, namely pG-hat = thetaG-hat / (thetaG-hat+1-thetaI-hat) based on the MLEs thetaG-hat and thetaI-hat. Is this a good idea given our data? Why?