You will use R to analyse data collected that contains information on the Means of Travel, Time and Distance to Work, School or College.
You will be assigned a county for the purposes of answering the questions listed below.
Steps you will complete include:
· Import data into R
· Using R; Explore the data, remove irrelevant columns and manipulate the remaining heading names to be more clear/intuitive.
· Check for missing, outlier or erroneous values and take appropriate action
· Write R code to perform data analysis to support your answers the questions listed below
· Include a brief reflection section (
· any challenges met, any assumptions made, and any potential issues/weaknesses associated
· with using the dataset for decision making purposes.
Questions
1. What is the most popular mode of transport nationally?
2. How does this (Answer to Q1) compare to the most popular mode of transport in your
assigned county?
3.What differences are evident between the choice of transportation in the cities compared to the other regions?
4.What proportion of commuters leave home outside of the 8-9am rush hour?
5.Are commuters in your assigned county likely to travel for longer than 45 minutes each
morning?
6.How does this (Answer to Q5) compare to other counties in the same NUTS III region?
7.The residents of which five counties experience the longest commute times?
8.What proportion of cars used in the morning commute contain only one person?
9.Which Electoral Division within each Planning Region do you propose should be prioritised
for investment in public transportation?
Use the assigned county of Carlow.
Some packages that can be used are:
· Dplyr
· Stringr
· lubridate
· tidyr