Requirements
has to be done inJupiter Notebook
You are required to use the dataset contained within the file “us-names-by-decade.csv”,which contains the following features:
Gender - Gender of Individual – (M) Male, (F)Female
Name – First Name of Individual
Decade – 10 Year Period – 1990 = 1990-1999
Count – Number of Individuals that were given the name detailed in the Name
feature in the decade indicated in the Decade feature eg “F”, “Olivia”, ”2010”, “69799” = The number of Females named Olivia in the years 2010 -2019 was 69799
and then perform the following analysis:
You are then required to explain what you plan on doing with the data. E.g., Why did you choose the specific visualizations, etcThis must be detailed in the Mark- up of the Jupyter Notebook and include the rational for your choice.
Generate a plot that details the top 5 number of Names for each of the Decades.
Plot a graph depicting the distribution of the Names that are Female in decade 1980.
Find out and visualize which decade had the MOST names.
Observe and visualize the average number of names per decade.
No additional output will be graded.
You must complete ALL data exploration PROGRAMMATICALLY and not using any other tool than python.
Already registered? Login
Not Account? Sign up
Enter your email address to reset your password
Back to Login? Click here