Don’t forget to use the correct markdown formatting as discussed in the “Workflow”. Specifically, be sure your document has bold headers for each of the sub-parts of the question (e.g. Problem 1A, Problem 1B, etc)
Problem 1
load this dataset on scapula morphology in Pan, Homo and Gorilla, and the Dikika fossil child. Details on this data can be found in Alemseged et al, 2006
Calculate the natural log of all the numeric variables. Save these in your existing dataframe, and use the logged variables in all future parts of this problem.
Make a subset of the data including only the extant species (G, H, or P, representing Gorilla, Homo and Pan), excluding the Dikika fossil (D).
Perform a discriminant function analysis (DFA) on the subset data (extant species only).
Which variable has the strongest loading on LD1?
Make a nice plot (using ggplot) of LD1 versus LD2. Hint: to get the LD scores, use the
predict()function on your discriminant function analysis model object, as shown in class.Bonus point: Add the Dikika fossil onto your extant species DFA plot.