FizzBuzz for Statistics

1 minute read

General

Explain why, when applicable, the paired t-test is usually preferable over an unpaired t-test. (credit)

Interpreting interaction term: The model is \(y = \beta_0 + \beta_1 X_1 + \beta_2 X_2 + \beta_3 X_3 + \epsilon\). What is the change in \(y\) given 1 unit increase in \(X_1\)?

I fire two missiles at a ship at the same time. Each independently has a 60% chance of hitting and sinking it. I turn to yell at somebody, and when I turn back around, the ship is sinking. what’s the probability both missiles hit it? (credit)
Sum of two random variables

p-value is the probability of getting observed, or more extreme, data given that the null hypothesis is true. Wrong: p-value is the probability that the null hypothesis is true.
If we repeat an experiment many many times, we would expect that X% of the times the true parameter will fall within the X% confidence interval of that parameter.

Paired t-test takes into account the paired structure, which reduces the variance of the difference. Consider two paired observations, whose outcomes are denoted \(X\) and \(Y\). We are interested in the difference \(Z = X - Y\).

\[\begin{align} Var(Z) &= Var(X - Y) = Var(X) + Var(Y) - 2Cov(X, Y) \\ \end{align}\]

Notice that if \(Cov(X, Y) > 0\), e.g. paired units tend to have similar outcomes, then \(Var(Z)\) is smaller.

\[\frac{\partial}{\partial X_1} y = \beta_1 + \beta_3 X_2\]

Key point: the marginal effect of \(X_1\) on \(y\) depends on the value of \(X_2\)

P(both hit | ship sinks) = P(both hit | at least one hit) = \(\frac{0.36}{0.36 + 0.24 + 0.24} = 0.43\)