Hello, I am looking to have ONLY part II completed of the document I attached. I have written the data set here for the work as well. Let me know! Thanks
Data Set:
How tall are you? (inches)
76
73
73
73
72.5
72
72
71.5
71
71
71
71
71
70
70
70
70
70
69
69
69
69
69
69
68
68
68
68
68
68
67
67
67
67
67
67
67
67
67
66
66
66
66
66
66
66
66
66
66
66
66
66
66
66
66
65.7
65
65
65
65
64
64
64
64
63.8
63
63
63
63
63
63
63
63
63
62
62
62
62
62
62
61
61
61
61
61
61
61
61
61
61
60
60
60
60
Do you regularly go to a gym?
Female: 10yes, 37no
Male: 22yes, 27no
· Question #1: This will be an average / 1-population question · Question #2: This will be a proportion / 2–population question · The project must be typed. (Show all calculations. Calculations may be hand-written) Part I: Data Collection (10 total pts) · Submit your 2 questions on Canvas by May 31st How tall are you? (inches) Do you regularly go to a gym? Part II: One-Population, Average Question (80 total pts) · Question: · (1pt) What is the question you asked the student? How tall are you? (inches) Do you regularly go to a gym? · (2pt) Why are you interested in this question? For example, is it a related to a particular hobby, job, or general interest of yours, and how? (There is no right or wrong answer) · Confidence Interval: a) (4pt) What are the values of the following statistics / parameters (if you know them)? How did you calculate each value (explain with formulas or specific calculator commands used)? · and/or . (Note: some of these will not be applicable to your question) b) (1pt) What is the value of the point estimate of your population parameter? · (6pt) Describe your sample statistic in words. What is the distribution of the sample statistic? Explain why it has this distribution. · (4pt) For c = 0.9, 0.95, & 0.99, calculate the error bound. Show all work, using the error bound formula. · (4pt) Use the error bounds to construct a 90%, 95%, and 99% confidence interval for the population parameter. · (4pt) What calculator command did you use to check your work for the confidence interval? Explain why. · (2pt) Interpret the meaning of each confidence interval in the context of the problem. · (2pt) If you are interested in decreasing the error bound while keeping the confidence level constant, what will happen to the sample size? Explain. · (2pt) If you are interested in increasing the confidence level while keeping the sample size constant, what will happen to the error bound? Explain. · (2pt) If you are interested in increasing the sample size while keeping the error bound constant, what will happen to the confidence level? Explain. · (7pt) How many additional students must you sample for the error bound to be half its size? Do this for c = 0.95. Keep t-score or z-score the same as part d. Show all steps. · Hypothesis Testing: · (7pt) Decide on a claim to test about your 1-population average question, meaning construct Ho and HA. (Find data online about the U.S. or California parameter to construct Ho) Note that even though you are finding data about another population, you are just using this value to compare to your De Anza population, so this is still a 1-population test. Cite your source by providing the website & print-out of webpage citing the parameter. Also, explain in words the meaning of your population parameter. This part may be tough. Ask Danny if you need any help. For example, if your 1 population average question is, "What's the average commute time to campus for a DA student?" go online and find some statistic that mentions the average commute time. It could be to work or school. It could be for students or working professionals. Any statistic would helpful to use as a benchmark to compare your data such as: https://www.cnbc.com/2018/02/22/study-states-with-the-longest-and-shortest-commutes.html From here, you can choose a left-tail, right-tail, or 2-tail test (up to you). If you chose a 2-tail in this example, your null hypothesis would be that the DA commute average is equal to 29.4 minutes, and the alternative hypothesis would be that the DA commute average is not equal to 29.4 minutes. If you are having difficulty finding a statistic online, you can either (1) contact me, and I can help you with your search, or (2) set your own value from which you'd like to compare the De Anza population. For example, if your question is, "What's the average number of units you're taking this summer?" and you just can't find anything online, you could consider the number of units you're taking (perhaps, 8 units), and use that number so that you can determine whether the De Anza average is different from your value. · (2pt) Is your test a left tail, right tail, or 2-tail test? How can you tell? Why did you decide on this (Note: Many times, this will be up to you as a statistician & what you believe are the appropriate inequalities.) · (1pt) What is your significance level? (You can choose your own) · (4pt) What are the type I & type II errors? Explain them in the context of the problem. · (4pt) What is the distribution of the sample statistic? Explain why it has this distribution. · (4pt) What is the value of the test statistic? Show formula & calculations. · (6pt) Calculate is the p-value. Show work. Sketch a graph of the region whose area the p-value represents. Label the values of the horizontal axis with the appropriate values. · (3pt) Define the p-value in the context of the problem. · (3pt) What calculator command did you use to check your work for test statistic & p-value? Explain why. · (3pt) Do you reject or fail to reject the null hypothesis? Why or why not? · (2pt) Interpret your conclusion in the context of the problem. Part III: Two-Population, Proportion Question (50 total pts) · Question: · (1pt) What is the question you asked the student? · (3pt) Why are you interested in this question? For example, is it a related to a particular hobby, job, or general interest of yours, and how? (There is no right or wrong answer) 2.) Hypothesis Testing: · (7pt) Decide on a claim to test, meaning construct Ho and HA. Also, explain in words the meaning of your population parameters. · (3pt) Is your test a left tail, right tail, or 2 tail test? Why did you decide on this (Note: Many times, this will be up to you as a statistician & what you believe are the appropriate inequalities to use.) · (2pt) What is the value of your significance level? You can choose your own. · (4pt) What are the type I & type II errors? Explain them in the context of the problem. · (6pt) Describe the sample statistic in words. What is the distribution of the sample statistic? Explain why it has this distribution. · (8pt) Use your graphing calculator to calculate the test statistic & p-value. State which calculator command you used and explain why. · (6pt) Sketch a graph of the region whose area the p-value represents. Label the values of the horizontal axis with the appropriate values. · (3pt) Define the p-value in the context of the problem. · (4pt) Do you reject or fail to reject the null hypothesis? Why or why not? · (3pt) Interpret your conclusion in the context of the problem. Part IV: Histograms & Plots (Only for Average Question from Part II) (30 total pts) · Histogram · (7pt) Construct a histogram. You can do this by hand (please use a ruler) or on a computer. · (1pt) What does the horizontal axis represent? What does the vertical axis represent? · (1pt) How many bars are there? (Note: It is completely up to you) · (1pt) What is the width of each bar? · Box Plot · (6pt) Construct a box plot. You can do this by hand (please use a ruler) or on a computer. · (4pt) What are the 5 key values AND what do these values represent? · (3pt) What is the IQR? · (3pt) What is the range? · (4pt) Are there any outliers? If so, how many? Explain why or why not using the appropriate calculations.