Finish statistics project
Project submission expectations: Your can submit your project in your choice of format as long as the required items are included. A written “report” document is the standard format but there are two examples of more-creative formats in the Term Project Information folder of Laulima Resources. One of these is a YouTube video on superheroes and the other is in a magazine article format on sunscreen. Required elements of your project submission and their contribution toward your project score. ● Introduction and initial expectations and data collection (75 point maximum) ○ The project topic (even if a pre-selected one was done) needs to be stated. Why this topic was chosen is optional for a pre-selected topic but expected for a student-created project topic. ○ Describe what you expected to see at the start of the project, before looking at your data. You will later compare these initial thoughts to your actual data analysis and conclusions. ■ If you chose one of the pre-selected project topics, do you think you will see a difference between age groups, between genders, between frequency of dining out, and/or between engagement status/history? Why do you think you will see such differences? It is also fine to think you will not see any differences between these groups. ● A student who created their own project topic would provide analogous predictions for whatever potentially-explanatory variables they collected data on. ■ For those who plan to make a comparison based on age (as is the case for the pre-selected project topics), what age would you use to separate those who are “older” from those who are “younger”? Why did you choose that age? ● For example, you might say that anyone up to 19 is “younger” and anyone 20 or over is “older” because you think (as one of my former 19-year-old students did) that 20 years old is “old.” ● You might choose instead to treat age as a ratio-level numeric variable and calculate correlation and regression instead of comparing “older” and “younger.” In this case, you would not need to specify the cutoff between “older” and “younger.” ○ Describe how you first planned to collect data and, if appropriate, explain any changes to these initial plans that were made. ● Presentation of the data (100 point maximum). Note that the items listed below are tailored to the pre-selected project topics. Comparable items should be included for project topics that were student created. ○ Demographic summaries, statistics, and graphs are included. ■ Calculate and include the mean and sample standard deviation of the nonzero ages. ● If you will be comparing “older” and “younger” subgroups, also include a table and a pie chart or bar graph showing the percents of your survey participants who are “older,” participants who are “younger,” and participants who declined to reveal an age. ● If you will not be comparing “older” and “younger,” include instead a table and a histogram showing the distribution of ages. ■ Include a table and a pie chart or bar graph showing the gender breakdown of your survey participants (male, female, other/decline to reveal). ■ Calculate and include the mean and sample standard deviation of the nonzero ages of those who identified as male. ■ Calculate and include the mean and sample standard deviation of the nonzero ages of those who identified as female. ■ Calculate and include the mean and sample standard deviation of the nonzero ages of those who identified as other or declined to reveal a gender identity. ○ Overall descriptive statistics are calculated ■ Tip project: ● Calculate and include the mean and sample standard deviation of all of the responses to the appropriate tip percent for just-okay service. ● Calculate and include the mean and sample standard deviation of all of the responses to the appropriate tip percent for excellent service. ■ Engagement ring project: ● Calculate and include the mean and sample standard deviation of all of the responses to the dollar amount cost of an engagement ring. ○ Sub-group data. For these, it would be helpful to use the sort or filter (new tab/window) your data in a spreadsheet. ■ Tip project: ● Calculate and include the mean and sample standard deviation of the just-okay service tip percents only for those who identified as males. ● Calculate and include the mean and sample standard deviation of the just-okay service tip percents only for those who identified as females. ● Calculate and include the mean and sample standard deviation of the excellent service tip percents only for those who identified as males. ● Calculate and include the mean and sample standard deviation of the excellent service tip percents only for those who identified as females. ● If you are separating individuals into “older” and “younger” categories, do not include those who answered 0 for their age (prefer not to reveal) and complete the following: ○ Calculate and include the mean and sample standard deviation of the just-okay service tip percents only for those who are identified as older. ○ Calculate and include the mean and sample standard deviation of the just-okay service tip percents only for those who are identified as younger. ○ Calculate and include the mean and sample standard deviation of the excellent service tip percents only for those who are identified as older. ○ Calculate and include the mean and sample standard deviation of the excellent service tip percents only for those who are identified as younger. https://youtu.be/nSo_Ops5o2w https://youtu.be/nSo_Ops5o2w ● If you are treating age as a ratio-level variable then do not include those who answered 0 for their age (prefer not to reveal) and complete the following: ○ Create a scatter diagram that shows the relationship between the nonzero ages with the just-okay service tip percents. ○ Calculate the correlation coefficient for the nonzero ages and the just-okay service tip percents. ○ Create a scatter diagram that shows the relationship between the nonzero ages with the excellent service tip percents. ○ Calculate the correlation coefficient for the nonzero ages and the excellent service tip percents. ● Calculate and include the mean and sample standard deviation of the just-okay service tip percents only for those who dine out at a sit-down restaurant seldom or never. ● Calculate and include the mean and sample standard deviation of the just-okay service tip percents only for those who dine out at a sit-down restaurant sometimes. ● Calculate and include the mean and sample standard deviation of the just-okay service tip percents only for those who dine out at a sit-down restaurant more frequently. ● Calculate and include the mean and sample standard deviation of the excellent service tip percents only for those who dine out at a sit-down restaurant seldom or never. ● Calculate and include the mean and sample standard deviation of the excellent service tip percents only for those who dine out at a sit-down restaurant sometimes. ● Calculate and include the mean and sample standard deviation of the excellent service tip percents only for those who dine out at a sit-down restaurant more frequently ■ Engagement ring project: ● Calculate and include the mean and sample standard deviation of the engagement ring dollar costs only for those who identified as males. ● Calculate and include the mean and sample standard deviation of the engagement ring dollar costs only for those who identified as females. ● Create and include a contingency table comparing gender with the months’ salary engagement ring cost. ● If you are separating individuals into “older” and “younger” categories, do not include those who answered 0 for their age (prefer not to reveal) and complete the following: ○ Calculate and include the mean and sample standard deviation of the engagement ring dollar costs only for those who are identified as older. ○ Calculate and include the mean and sample standard deviation of the engagement ring dollar costs only for those who are identified as younger. ○ Create and include a contingency table comparing age group (older and younger) with the months’ salary engagement ring cost. ● If you are treating age as a ratio-level variable then do not include those who answered 0 for their age (prefer not to reveal) and complete the following: ○ Create a scatter diagram that shows the relationship between the nonzero ages with the engagement ring dollar costs. ○ Calculate the correlation coefficient for the nonzero ages and the engagement ring dollar costs. ● Calculate and include the mean and sample standard deviation of the engagement ring dollar costs only for those who are or have engaged. ● Calculate and include the mean and sample standard deviation of the engagement ring dollar costs only for those who have never been engaged. ● Create and include a contingency table comparing engagement status/history with the months’ salary engagement ring cost. ● Data-based conclusions (75 point maximum). As before, this section is rather tailored to the pre-selected projects. Comparable comparisons should be made and appropriate conclusions drawn that are supported by the comparisons. ○ Compare the sub-group data with the corresponding overall descriptive statistics. State any differences that you find notable. ■ For example, if the overall mean of the tip percents for excellent service was 22% and the mean of just the males was 23%, you might not find the difference notable. On the other hand, suppose the overall sample standard deviation for the tip percents was 3.6% but the sample standard deviation for the males was 1.8% (half as much), you might want to point that out. ○ State any conclusions that appear reasonable based on the comparisons you made. ■ For example, in the above example comparison, a conclusion might be that while the mean (average) tip percent for excellent service for the males appears to be similar to the mean tip percent for excellent service for everyone, the males had a much lower standard deviation so there appears to be less variability in the tipping percent for excellent service by males than for the general population. ○ State any limitations you feel might exist in your conclusions. ■ A common issue is when there are “too many” of one particular demographic group or another in the survey