Problem 1) (Helpful rubric and hints:REVIEW THE INSTRUCTORS VIDEOS FOR WEEK 2 & REVIEW THE RUBRICS provided for HW Week 4) Use the data set “Prostate” which you can extract from the package “lasso2”...

Problem 1) (Helpful rubric and hints:REVIEW THE INSTRUCTORS VIDEOS FOR WEEK 2 & REVIEW THE RUBRICS provided for HW Week 4) Use the data set “Prostate” which you can extract from the package “lasso2” in R. The linkhttps://rafalab.github.io/pages/649/prostate.html (Links to an external site.) describes the variables in this data set. install the package “lasso2” (10%) use the command library(lasso2) to invoke using the data set “Prostate” (5%) summarize all the variables in this data set (use package “summarytools” and write your results on an external csv file (45%) Plot histograms or boxplots of the distributions of ALL variables in the data set and place them on an external file (40%) Problem 2) (Helpful rubric and hints:REVIEW THE INSTRUCTORS VIDEOS FOR WEEK 4 & REVIEW THE RUBRICS provided for HW Week 4) Use the data set on problem 1 above to perform correlation analyses for Age and each of lpsa and lcavol (35%) Lpsa and each of pgg45 and lweight (35%) For results 1) and 2) extract only a) the variables used for correlation, b) method of correlation, c) estimate of correlation and d) p-value for testing the correlation is zero (15%) Write your results in an external csv file (15%) Problem 3) (Helpful rubric and hints:REVIEW THE INSTRUCTORS VIDEOS FOR WEEK 4-5 & REVIEW THE RUBRICS provided for HW Week 4) Use the data set on problem 1 above to 1) Build a 95% Confidence interval for each ofage, lbph, lcavol and lpsa (20%) perform a ttest of hypothesis using the “t.test” function in R, for the following hypothesis whether age came from a normal distribution with mean =55 versus > 55 (one sample ttest) (25%) lpsa came from a normal distribution with mean =2.5 versus < 2.5="" (one="" sample="" ttest)="" (25%)="" for="" the="" subgroups="" defined="" by="" the="" “gleason="" scores”="">< 7="">= 7, test if lpsa came from a normal distributions with similar mean values (use –two-sided test) (30%) (this is an example on how to use the two sample ttest for testing mean age for the two independent samples defined by subsetting your data according to the gleason score of = 7: > t.test(subset(Prostate$age, Prostate$gleason < 7),="" subset(="" prostate$age,prostate$gleason="">= 7)) Write your results in an external csv file (15%) Question 1-3 (Helpful rubric and hints: REVIEW THE INSTRUCTORS VIDEOS FOR WEEK 2 & REVIEW THE RUBRICS provided for HW Week 4) Use the data set “Prostate” which you can extract from the package “lasso2” in R. The linkhttps://rafalab.github.io/pages/649/prostate.html Links to an external site. (Links to an external site.) describes the variables in this data set. Question 1 install the package “lasso2” (10%) use the command library(lasso2) to invoke using the data set “Prostate” (5%) summarize all the variables in this data set (use package “summarytools” and write your results on an external csv file (45%) Plot histograms or boxplots of the distributions of ALL variables in the data set and place them on an external file (40%) Question 2 (Helpful rubric and hints: REVIEW THE INSTRUCTORS VIDEOS FOR WEEK 4 & REVIEW THE RUBRICS provided for HW Week 4) Use the data set on problem 1 above to perform correlation analyses for Age and each of lpsa and lcavol (35%) Lpsa and each of pgg45 and lweight (35%) For results 1) and 2) extract only a) the variables used for correlation, b) method of correlation, c) estimate of correlation and d) p-value for testing the correlation is zero (15%) Write your results in an external csv file (15%) Question 3 (Helpful rubric and hints: REVIEW THE INSTRUCTORS VIDEOS FOR WEEK 4-5 & REVIEW THE RUBRICS provided for HW Week 4) Use the data set on problem 1 above to 1) Build a 95% Confidence interval for each of age, lbph, lcavol and lpsa (20%) perform a ttest of hypothesis using the “t.test” function in R, for the following hypothesis whether age came from a normal distribution with mean =55 versus > 55 (one sample ttest) (25%) lpsa came from a normal distribution with mean =2.5 versus < 2.5="" (one="" sample="" ttest)="" (25%)="" for="" the="" subgroups="" defined="" by="" the="" “gleason="" scores”="">< 7="">= 7, test if lpsa came from a normal distributions with similar mean values (use –two-sided test) (30%) (this is an example on how to use the two sample ttest for testing mean age for the two independent samples defined by subsetting your data according to the gleason score of = 7: show quoted text> t.test(subset(Prostate$age, Prostate$gleason < 7),="" subset(="" prostate$age,prostate$gleason="">= 7)) Write your results in an external csv file (15%)
Apr 03, 2021
SOLUTION.PDF

Get Answer To This Question

Related Questions & Answers

More Questions »

Submit New Assignment

Copy and Paste Your Assignment Here