Use the following data on median values of single detached houses of Canadian residents in 20 census metropolitan areas in British Columbia and Ontario in 2017 (source: Statistics Canada) to prepare a statistical report. The data are reported in units of a hundred thousand dollars rounded to the nearest ten thousand dollars (so, for example, 5.7 represents $570,000).
Data: 5.7, 5.2, 12.6, 6.4, 3.7, 2.1, 2.9, 2.4, 4.2, 4.3, 2.9, 3.6, 2.6, 4.2, 4.2, 2.7, 2.5, 2.2, 7.2, 1.9.
(Download the Assignment 5 Data xlsx file attached to this assignment)
The report should include (marks in brackets):
A dotplot or a histogram of the data. Note that you’ll have to group the data into suitable,equal-sizedintervals before drawing your graph. (3)
A pie graph of the data showing the percentages of the sample in the following categories: 1-3, 3-5, 5-7, 7-10, and 10 or higher. (3)
A brief discussion of what the data displays in Q1 and Q2 show for this dataset. (2)
The mean and the median, together with a brief discussion of which of these is the more appropriate measure of what is typical or representative for this dataset. (3)
The 5-number summary of the data (i.e., the minimum, lower quartile, median, upper quartile, and maximum). (5)[Hint: Look at the Lesson 14 Supplementary Material for details of how to calculate the quartiles.]
The range of the data and the inter-quartile range of the data, together with a brief discussion of exactly what the inter-quartile range represents for this dataset. (3)
A brief discussion of any outliers that are present in the data. (2)
The following probability calculations, including reasoning.
Suppose we select one census metropolitan area at random from the sample of 20. What is the probability that it has a single detached house median value greater than $500k? (2)
Suppose we select one census metropolitan area at random from the sample of 20. What is the probability that it has a single detached house median value greater than $500k or less than $200k? (2)
Suppose we select two census metropolitan areas at random from the sample of 20. What is the probability thatbothhave a single detached house median value greater than $500k? (2)
A brief discussion of whether it would be appropriate to use your findings to make inferences about the median values of single detached houses in all of Canada. (3)
Grading Criteria
30 marks total:
Q1-3: Data displays are constructed correctly, displayed clearly, and interpretations are correct and clearly explained. (3 + 3 + 2 = 8 marks)
Q4-7: Numerical summaries are calculated correctly, presented clearly, and interpretations are correct and clearly explained. (3 + 5 + 3 + 2 = 13 marks)
Q8: Probability calculations are correct, presented clearly, and reasoning is clearly explained. (2 + 2 + 2 = 6 marks)
Q9: Statistical inference conclusions are appropriate and carefully reasoned. (3 marks)