Project 1 Example 1 Project 1 Example 1 Introduction The client, Pima County, requested that we analyze a set of water quality data to determine if water quality samples from 2 different water quality...

1 answer below »
r programming


Project 1 Example 1 Project 1 Example 1 Introduction The client, Pima County, requested that we analyze a set of water quality data to determine if water quality samples from 2 different water quality meters were significantly different from each other. One meter recently purchased by the county was cheaper than the other and appeared to be giving readings that were different.The county requested we determine if it was worth it to buy only the older more expensive water quality meters or if the cheap ones would suffice. 30 pH samples for each water meter were provided by Pima County then analyzed in R using a two-sample student t-test. All data and analyses are provided in Appendix I at the end of this document. Results and Discussion We found that the two different water quality meters were significantly different (p = 0.031). The old water quality data meter had averages that were higher than the new quality meter (0.28 units of pH on average) (See Figure 1). Recommendations We recommend testing both water quality meters against a known standard. The two meters are different in their measurements, but it is unclear which one is correct at this time. Given that the average difference was small between measurements it is possible that the change in accuracy between the two meters is unimportant for the use of the county and the significance of this difference may be irrelevant for their purposes. Without information on accuracy, or an idea of permissible level of inaccuracy from the county, we cannot determine whether one water meter is better than the other. Figure 1:Box plot of the data provided by the County Appendix 1: R Script and Simulated Data #Data Simulation oldmeter <- data.frame(x="rnorm(n" =="" 30,="" mean="6.5," sd="1)," y="Old Meter" )="" newmeter=""><- data.frame(x="rnorm(n" =="" 30,="" mean="6.2," sd="1)," y="New Meter" )="" meter=""><- rbind(oldmeter, newmeter) #p value round(t.test(oldmeter$x, newmeter$x, alternative = "two.sided")$p.value,3) #effect size round(mean(oldmeter$x) - mean(newmeter$x),2) #figure boxplot(x~y, data = meter, col = "blue", main = "boxplot of ph data", xlab = "", ylab = "measured ph") introduction results and discussion recommendations appendix 1: r script and simulated data learning outcomes this project is meant to demonstrate that you are capable of doing the following: · create a fake dataset (or find a real one) in r that follows the format required for a one or two-sample t-test. · conduct a one or two-sample t-test in r on a dataset · interpret p-values · create graphics in r of univariate data · describe results and implications of a two-sample t-test · identify scenarios where a two-sample t-test would be pertinent additionally, you will be reviewing the work of other students. in your reviews you are expected to demonstrate mastery over the following learning outcomes: · identify mistakes in r that could lead to code not running · identify misinterpretations of p values or effect sizes · identify misapplications of statistical tests project requirements the project itself is a mock data analysis project for a client of your choosing. you will make up the data (or find real data on the internet to use), make up the clients and their question, and then write a short paper as if it were a real analysis for the client. there are several examples available on d2l for you to model your analyses after. your paper should be written in third person past tense, as this is typical for analytics projects. for full credit you must have all of the following pieces: · data (either real or created by you) that we have not used in class · if the data is real, you need to provide a link to it as your appendix · if the data is fake, you need to provide your code for its creation · if the data is fake, but you made it by hand, you need to provide it as a nicely formatted table · a figure showing that data in a way that is meaningful for the test you are running with: · legible and meaningful titles and axis labels · a caption describing the figure · no issues with x axes not lining up etc. · histograms, density diagrams, box plots, dot plots – all are fine just make sure you use them correctly · an r script showing all of your steps for analyses and figure creation · which runs with only a file path change · which accurately replicates your results · one of the following analyses used and described correctly, with the correct alternative specified in your code: · a one sample student t-test · a two sample student t-test · a p value interpreted correctly, and rounded to no more than three decimal places · some measure of effect size rounded to no more than three decimal places, like differences between the averages of two samples for difference between the average of a sample and the expected population mean · a one page written report in size 12 font that contains the following sections: · introduction: a section introducing the clients requesting the project and a little bit of information about the project itself · results and description: discussing the method of analyses used, the results, and a graphic of the data · recommendations: a section dictating what the client should do next, depending on what their hypothetical question was · the one page paper does not include the data or the r script in the page length requirements. the r script and the data should be pasted on as consecutive pages in one document. peer review requirements in addition to creating a one page write up, you will also be required to review the project of two other students. the peer reviews of your classmates will not be used in the calculation of your project grade. instead your grade will come from your own project, and the two peer reviews that you provide. in your review you will evaluate each other’s projects for accuracy and clarity. you will be asked to evaluate each other’s projects on: · whether or not they did the analysis correctly · whether or not they made a recommendation that was in line with their data · whether or not their figure was appropriate for their data set all comments should be constructive and polite and areas of confusion should be pointed out kindly. the goal of the peer review process is to help you learn how to critique other people’s work and identify weaknesses or failures in their analyses. as such if the section leaders or myself determined that there are flaws in the data analysis of a project and your peer review does not pick it up your peer review will be graded down. rubrics the following rubrics are meant as guidelines but may change once papers have been submitted, if there is a common problem that i did not anticipate but that i feel is fair to grade students down on. project rubric those items indicated in the peer review column are ones that peer reviewers will be expected to look at. if not indicated, the peer reviewer is not expected to evaluate that item. if you have helpful feedback on that item that is acceptable, but you do not have to provide it. rubric item peer review? high score r script is provided as an appendix and runs without major modifications other than changing the file path name yes 5 code for the analyses and figures are provided in the r script yes 5 data, or a link to the data are provided and can be replicated easily yes 3 a figure is provided with an informative caption 2 the figure is legible and professional, and has meaningful labels 2 there is not a figure that would have been a significantly better choice (i.e. you put two histograms on top of each other and now we cannot see the data in the back, when you should’ve used two separate rows for each histogram) yes 2 the figure axes are adjusted for maximum legibility i.e. if you have two panels they should both have the same limits and breaks in a histogram yes 2 the figure is called out inline in the text (i.e. see distributions in figure 1) 1 the paper is only one page long, and has not obviously changed the margins or the font or the font size in order to meet requirements 2 the paper has the three sections required and the information in each section is reflective of the section heading 3 the introduction describes the client and the data and the question yes 3 the introduction specifically states what statistical test is being used yes 1 the statistical test is correct for the data used (i.e. when you are comparing a sample to a population mean, you use a one sample t-test) yes 3 the statistical test is run correctly in the r script, and is the statistical test listed in the introduction yes 2 the statistical test code specifies what alternative the test is, and this is accurate according to the question in the paper (i.e. if the question is whether sample two is bigger than sample one, the alternative should be greater and the samples should be in the right order) yes 2 the results and discussion provide a p value with no more than three decimal places (i.e. 0.001, not 0.001259) 2 the results and discussions provide a measure of effect size with no more than three decimal places (i.e. sample one is .567 cm longer than sample two on average) 2 the results and discussion section interprets the p value correctly for the statistical test run yes 3 the recommendation section correctly advocates for action or inaction depending on the question asked and the p value received yes 3 the overall report is written in a professional tone in third person past tense 2 if there are grammar or spelling mistakes they do not interfere with the legibility of the paper 3 total: 50 note: because the peer review process is important, papers that are turned in late will be docked up to 15 points, five points for every day it is delayed. peer review rubric rubric item points mistakes in the analyses are identified and described if present (i.e. if the paper ran a one sample t-test but should have done a two sample t-test) 3 mistakes in the interpretation of the p value or the effect size are identified and described if present 2 errors in the recommendations are identified and described if present 1 issues in the figure like legible axes or inappropriate uses of histograms are identified and described if present 2 issues in the r script running correctly or the data set being easy to use are identified and described if present rbind(oldmeter,="" newmeter)="" #p="" value="" round(t.test(oldmeter$x,="" newmeter$x,="" alternative="two.sided" )$p.value,3)="" #effect="" size="" round(mean(oldmeter$x)="" -="" mean(newmeter$x),2)="" #figure="" boxplot(x~y,="" data="meter," col="blue" ,="" main="Boxplot of pH Data" ,="" xlab="" ,="" ylab="Measured pH" )="" introduction="" results="" and="" discussion="" recommendations="" appendix="" 1:="" r="" script="" and="" simulated="" data="" learning="" outcomes="" this="" project="" is="" meant="" to="" demonstrate="" that="" you="" are="" capable="" of="" doing="" the="" following:="" ·="" create="" a="" fake="" dataset="" (or="" find="" a="" real="" one)="" in="" r="" that="" follows="" the="" format="" required="" for="" a="" one="" or="" two-sample="" t-test.="" ·="" conduct="" a="" one="" or="" two-sample="" t-test="" in="" r="" on="" a="" dataset="" ·="" interpret="" p-values="" ·="" create="" graphics="" in="" r="" of="" univariate="" data="" ·="" describe="" results="" and="" implications="" of="" a="" two-sample="" t-test="" ·="" identify="" scenarios="" where="" a="" two-sample="" t-test="" would="" be="" pertinent="" additionally,="" you="" will="" be="" reviewing="" the="" work="" of="" other="" students.="" in="" your="" reviews="" you="" are="" expected="" to="" demonstrate="" mastery="" over="" the="" following="" learning="" outcomes:="" ·="" identify="" mistakes="" in="" r="" that="" could="" lead="" to="" code="" not="" running="" ·="" identify="" misinterpretations="" of="" p="" values="" or="" effect="" sizes="" ·="" identify="" misapplications="" of="" statistical="" tests="" project="" requirements="" the="" project="" itself="" is="" a="" mock="" data="" analysis="" project="" for="" a="" client="" of="" your="" choosing.="" you="" will="" make="" up="" the="" data="" (or="" find="" real="" data="" on="" the="" internet="" to="" use),="" make="" up="" the="" clients="" and="" their="" question,="" and="" then="" write="" a="" short="" paper="" as="" if="" it="" were="" a="" real="" analysis="" for="" the="" client.="" there="" are="" several="" examples="" available="" on="" d2l="" for="" you="" to="" model="" your="" analyses="" after.="" your="" paper="" should="" be="" written="" in="" third="" person="" past="" tense,="" as="" this="" is="" typical="" for="" analytics="" projects.="" for="" full="" credit="" you="" must="" have="" all="" of="" the="" following="" pieces:="" ·="" data="" (either="" real="" or="" created="" by="" you)="" that="" we="" have="" not="" used="" in="" class="" ·="" if="" the="" data="" is="" real,="" you="" need="" to="" provide="" a="" link="" to="" it="" as="" your="" appendix="" ·="" if="" the="" data="" is="" fake,="" you="" need="" to="" provide="" your="" code="" for="" its="" creation="" ·="" if="" the="" data="" is="" fake,="" but="" you="" made="" it="" by="" hand,="" you="" need="" to="" provide="" it="" as="" a="" nicely="" formatted="" table="" ·="" a="" figure="" showing="" that="" data="" in="" a="" way="" that="" is="" meaningful="" for="" the="" test="" you="" are="" running="" with:="" ·="" legible="" and="" meaningful="" titles="" and="" axis="" labels="" ·="" a="" caption="" describing="" the="" figure="" ·="" no="" issues="" with="" x="" axes="" not="" lining="" up="" etc.="" ·="" histograms,="" density="" diagrams,="" box="" plots,="" dot="" plots="" –="" all="" are="" fine="" just="" make="" sure="" you="" use="" them="" correctly="" ·="" an="" r="" script="" showing="" all="" of="" your="" steps="" for="" analyses="" and="" figure="" creation="" ·="" which="" runs="" with="" only="" a="" file="" path="" change="" ·="" which="" accurately="" replicates="" your="" results="" ·="" one="" of="" the="" following="" analyses="" used="" and="" described="" correctly,="" with="" the="" correct="" alternative="" specified="" in="" your="" code:="" ·="" a="" one="" sample="" student="" t-test="" ·="" a="" two="" sample="" student="" t-test="" ·="" a="" p="" value="" interpreted="" correctly,="" and="" rounded="" to="" no="" more="" than="" three="" decimal="" places="" ·="" some="" measure="" of="" effect="" size="" rounded="" to="" no="" more="" than="" three="" decimal="" places,="" like="" differences="" between="" the="" averages="" of="" two="" samples="" for="" difference="" between="" the="" average="" of="" a="" sample="" and="" the="" expected="" population="" mean="" ·="" a="" one="" page="" written="" report="" in="" size="" 12="" font="" that="" contains="" the="" following="" sections:="" ·="" introduction:="" a="" section="" introducing="" the="" clients="" requesting="" the="" project="" and="" a="" little="" bit="" of="" information="" about="" the="" project="" itself="" ·="" results="" and="" description:="" discussing="" the="" method="" of="" analyses="" used,="" the="" results,="" and="" a="" graphic="" of="" the="" data="" ·="" recommendations:="" a="" section="" dictating="" what="" the="" client="" should="" do="" next,="" depending="" on="" what="" their="" hypothetical="" question="" was="" ·="" the="" one="" page="" paper="" does="" not="" include="" the="" data="" or="" the="" r="" script="" in="" the="" page="" length="" requirements.="" the="" r="" script="" and="" the="" data="" should="" be="" pasted="" on="" as="" consecutive="" pages="" in="" one="" document.="" peer="" review="" requirements="" in="" addition="" to="" creating="" a="" one="" page="" write="" up,="" you="" will="" also="" be="" required="" to="" review="" the="" project="" of="" two="" other="" students.="" the="" peer="" reviews="" of="" your="" classmates="" will="" not="" be="" used="" in="" the="" calculation="" of="" your="" project="" grade.="" instead="" your="" grade="" will="" come="" from="" your="" own="" project,="" and="" the="" two="" peer="" reviews="" that="" you="" provide.="" in="" your="" review="" you="" will="" evaluate="" each="" other’s="" projects="" for="" accuracy="" and="" clarity.="" you="" will="" be="" asked="" to="" evaluate="" each="" other’s="" projects="" on:="" ·="" whether="" or="" not="" they="" did="" the="" analysis="" correctly="" ·="" whether="" or="" not="" they="" made="" a="" recommendation="" that="" was="" in="" line="" with="" their="" data="" ·="" whether="" or="" not="" their="" figure="" was="" appropriate="" for="" their="" data="" set="" all="" comments="" should="" be="" constructive="" and="" polite="" and="" areas="" of="" confusion="" should="" be="" pointed="" out="" kindly.="" the="" goal="" of="" the="" peer="" review="" process="" is="" to="" help="" you="" learn="" how="" to="" critique="" other="" people’s="" work="" and="" identify="" weaknesses="" or="" failures="" in="" their="" analyses.="" as="" such="" if="" the="" section="" leaders="" or="" myself="" determined="" that="" there="" are="" flaws="" in="" the="" data="" analysis="" of="" a="" project="" and="" your="" peer="" review="" does="" not="" pick="" it="" up="" your="" peer="" review="" will="" be="" graded="" down.="" rubrics="" the="" following="" rubrics="" are="" meant="" as="" guidelines="" but="" may="" change="" once="" papers="" have="" been="" submitted,="" if="" there="" is="" a="" common="" problem="" that="" i="" did="" not="" anticipate="" but="" that="" i="" feel="" is="" fair="" to="" grade="" students="" down="" on.="" project="" rubric="" those="" items="" indicated="" in="" the="" peer="" review="" column="" are="" ones="" that="" peer="" reviewers="" will="" be="" expected="" to="" look="" at.="" if="" not="" indicated,="" the="" peer="" reviewer="" is="" not="" expected="" to="" evaluate="" that="" item.="" if="" you="" have="" helpful="" feedback="" on="" that="" item="" that="" is="" acceptable,="" but="" you="" do="" not="" have="" to="" provide="" it.="" rubric="" item="" peer="" review?="" high="" score="" r="" script="" is="" provided="" as="" an="" appendix="" and="" runs="" without="" major="" modifications="" other="" than="" changing="" the="" file="" path="" name="" yes="" 5="" code="" for="" the="" analyses="" and="" figures="" are="" provided="" in="" the="" r="" script="" yes="" 5="" data,="" or="" a="" link="" to="" the="" data="" are="" provided="" and="" can="" be="" replicated="" easily="" yes="" 3="" a="" figure="" is="" provided="" with="" an="" informative="" caption="" 2="" the="" figure="" is="" legible="" and="" professional,="" and="" has="" meaningful="" labels="" 2="" there="" is="" not="" a="" figure="" that="" would="" have="" been="" a="" significantly="" better="" choice="" (i.e.="" you="" put="" two="" histograms="" on="" top="" of="" each="" other="" and="" now="" we="" cannot="" see="" the="" data="" in="" the="" back,="" when="" you="" should’ve="" used="" two="" separate="" rows="" for="" each="" histogram)="" yes="" 2="" the="" figure="" axes="" are="" adjusted="" for="" maximum="" legibility="" i.e.="" if="" you="" have="" two="" panels="" they="" should="" both="" have="" the="" same="" limits="" and="" breaks="" in="" a="" histogram="" yes="" 2="" the="" figure="" is="" called="" out="" inline="" in="" the="" text="" (i.e.="" see="" distributions="" in="" figure="" 1)="" 1="" the="" paper="" is="" only="" one="" page="" long,="" and="" has="" not="" obviously="" changed="" the="" margins="" or="" the="" font="" or="" the="" font="" size="" in="" order="" to="" meet="" requirements="" 2="" the="" paper="" has="" the="" three="" sections="" required="" and="" the="" information="" in="" each="" section="" is="" reflective="" of="" the="" section="" heading="" 3="" the="" introduction="" describes="" the="" client="" and="" the="" data="" and="" the="" question="" yes="" 3="" the="" introduction="" specifically="" states="" what="" statistical="" test="" is="" being="" used="" yes="" 1="" the="" statistical="" test="" is="" correct="" for="" the="" data="" used="" (i.e.="" when="" you="" are="" comparing="" a="" sample="" to="" a="" population="" mean,="" you="" use="" a="" one="" sample="" t-test)="" yes="" 3="" the="" statistical="" test="" is="" run="" correctly="" in="" the="" r="" script,="" and="" is="" the="" statistical="" test="" listed="" in="" the="" introduction="" yes="" 2="" the="" statistical="" test="" code="" specifies="" what="" alternative="" the="" test="" is,="" and="" this="" is="" accurate="" according="" to="" the="" question="" in="" the="" paper="" (i.e.="" if="" the="" question="" is="" whether="" sample="" two="" is="" bigger="" than="" sample="" one,="" the="" alternative="" should="" be="" greater="" and="" the="" samples="" should="" be="" in="" the="" right="" order)="" yes="" 2="" the="" results="" and="" discussion="" provide="" a="" p="" value="" with="" no="" more="" than="" three="" decimal="" places="" (i.e.="" 0.001,="" not="" 0.001259)="" 2="" the="" results="" and="" discussions="" provide="" a="" measure="" of="" effect="" size="" with="" no="" more="" than="" three="" decimal="" places="" (i.e.="" sample="" one="" is="" .567="" cm="" longer="" than="" sample="" two="" on="" average)="" 2="" the="" results="" and="" discussion="" section="" interprets="" the="" p="" value="" correctly="" for="" the="" statistical="" test="" run="" yes="" 3="" the="" recommendation="" section="" correctly="" advocates="" for="" action="" or="" inaction="" depending="" on="" the="" question="" asked="" and="" the="" p="" value="" received="" yes="" 3="" the="" overall="" report="" is="" written="" in="" a="" professional="" tone="" in="" third="" person="" past="" tense="" 2="" if="" there="" are="" grammar="" or="" spelling="" mistakes="" they="" do="" not="" interfere="" with="" the="" legibility="" of="" the="" paper="" 3="" total:="" 50="" note:="" because="" the="" peer="" review="" process="" is="" important,="" papers="" that="" are="" turned="" in="" late="" will="" be="" docked="" up="" to="" 15="" points,="" five="" points="" for="" every="" day="" it="" is="" delayed.="" peer="" review="" rubric="" rubric="" item="" points="" mistakes="" in="" the="" analyses="" are="" identified="" and="" described="" if="" present="" (i.e.="" if="" the="" paper="" ran="" a="" one="" sample="" t-test="" but="" should="" have="" done="" a="" two="" sample="" t-test)="" 3="" mistakes="" in="" the="" interpretation="" of="" the="" p="" value="" or="" the="" effect="" size="" are="" identified="" and="" described="" if="" present="" 2="" errors="" in="" the="" recommendations="" are="" identified="" and="" described="" if="" present="" 1="" issues="" in="" the="" figure="" like="" legible="" axes="" or="" inappropriate="" uses="" of="" histograms="" are="" identified="" and="" described="" if="" present="" 2="" issues="" in="" the="" r="" script="" running="" correctly="" or="" the="" data="" set="" being="" easy="" to="" use="" are="" identified="" and="" described="" if="">
Answered Same DayOct 01, 2021

Answer To: Project 1 Example 1 Project 1 Example 1 Introduction The client, Pima County, requested that we...

Sudharsan.J answered on Oct 02 2021
150 Votes
Project-1
Introduction:
A dataset is based on the factors on forest fires in Northeast Portugal. W
hich includes weather factors and categorical variables months, haif year and weeks days. 14 variables, 517 observations. Here we determine whether there is high swing of wind in 1st half year when compared with swing of wind in 2nd half year. 1st half year indicates month of January-June, whereas 2nd half year indicated month of July- Dec.
The data and code used for analysis are provided in appendix-1 at the end of the document.
Test of Hypothesis:
Ho: ρ=0
HA: ρ>0, level of significance=0.05
Null hypothesis: there was no significant...
SOLUTION.PDF

Answer To This Question Is Available To Download

Related Questions & Answers

More Questions »

Submit New Assignment

Copy and Paste Your Assignment Here