Homework Assignment #8
Problem 1
- Load Howell’s craniometric dataset
- Get rid of unwanted data. Select only the following columns: ID, Sex, Population, BNL, MDH, EKB, ZOR, BAA, NBA. Filter the dataset to only include individuals from the following populations: BUSHMAN, PERU, NORSE, ZULU. Use this filtered dataset for all remaining questions in this homework.
- Calculate the variance / covariance matrix for the 6 numeric variables.
Problem 2
- Perform a Principal Components Analysis on the data…make sure to use
scale=TRUE to scale your variables! - Which single variable has the strongest loading on PC1?
- What is the cumulative proportion of variance explained by PC1 and PC2?
- Make a plot of the PC1 scores against PC2 scores. Color code the point based on the population variable.