For this assignment, you choose a dataset and perform EDA and exploratory data visualization to better understand the data, examine some initial questions / hypotheses that you have, and report on these questions and hypotheses at the close of your analysis. The deliverable for the assignment is a document containing visualizations that are captioned, and that conveys information and insights (our textbook author would say “pearls”) that you learned from analysis.
DO NOT USE A DATASET FROM KAGGLE OR OTHER ONLINE CO-WORKING PLATFORMS. You need to do your own analysis and chart your own path!
Choose a dataset that will have the depth and breadth to allow you to perform a rigorous analysis, and practice using the tools you’ve learned about in this course. Below are some suggestions to get you started, although you are free to use others. If you are wondering if a dataset is too simple or inappropriate for this assignment, send me a link to the dataset and we can talk about it.
Tools to Use for The Assignment
You will use R + Rstudio + ggplot2
The Assignment Deliverable
Already registered? Login
Not Account? Sign up
Enter your email address to reset your password
Back to Login? Click here