Assignment
For this assignment, you will use the dataset weights.csv. This is a dataset of individual weight events for customers. The objective of this assignment is to carry out a 2 sample t test to find out:
1. Does the use of receptable type 3, change the weights of receptacle type 1? And if so how?
2. Does the use of receptacle type 3 change the total weight produced? And if so how?
You will prepare and explore the datasets using R data cleaning and analysis techniques and will discuss the discovered trends and plot any points of interest.
Steps should include:
Transform the dataset to summarise the weights for each clCode by receptType. (Use Dplyr)
Create a new variable for each clCode to identify whether they use receptType = 3 and receptType = 1, or only uses receptType = 1. This variable should allow you to create two independent samples. (Use Dplyr)
Check for and handle missing values.
Any clCode beginning with B have been included in error and can be excluded from the analysis
Identify any outliers accounts and produce a list of them. Any outlier clCodes can be excluded from the analysis.
Plot data and identify trends and/or points of interest.
Discuss your findings and prepare explanatory visualisations in an R Markdown (.Rmd) file
Assignment For this assignment, you will use the dataset weights.csv. This is a dataset of individual weight events for customers. The objective of this assignment is to carry out a 2 sample t test to find out: 1. Does the use of receptable type 3, change the weights of receptacle type 1? And if so how? 2. Does the use of receptacle type 3 change the total weight produced? And if so how? You will prepare and explore the datasets using R data cleaning and analysis techniques and will discuss the discovered trends and plot any points of interest. Steps should include: Transform the dataset to summarise the weights for each clCode by receptType. (Use Dplyr) Create a new variable for each clCode to identify whether they use receptType = 3 and receptType = 1, or only uses receptType = 1. This variable should allow you to create two independent samples. (Use Dplyr) Check for and handle missing values. Any clCode beginning with B have been included in error and can be excluded from the analysis Identify any outliers accounts and produce a list of them. Any outlier clCodes can be excluded from the analysis. Plot data and identify trends and/or points of interest. Discuss your findings and prepare explanatory visualisations in an R Markdown (.Rmd) file