Let \(P\) be an orthogonal projection matrix (symmetric and idempotent). Show the following:
Let \(E \in \mathbb R^{n\times p}\) and let \(S=E^\top E\).
Suppose the best one-dimensional affine approximation to the rows of \(Y\in \mathbb R^{n\times p}\) is \(\hat Y = 1 \bar y^\top + f_1 v_1^\top\), where \(v_1^\top v_1 = 1\). For \(i=1,\ldots,n\), let \(x_i = a + c B (y_i-\bar y)\), where \(y_i\) is the \(i\)th row of \(Y\), \(a\in \mathbb R^p\), \(B\) is an orthogonal matrix (\(B^\top B=I_p)\) and \(c\) is a scalar. Let \(\hat X = 1 m^\top + g w^\top\) be the best one-dimensional affine approximation to \(X\).
Let \(Y\) be the data matrix on Swiss head measurements discussed in class.