R Markdown
--- title: "Cars" author: "Samia Challal" date: "15/06/2021" output: pdf_document: default html_document: default toc: yes --- ```{r setup, include=FALSE} knitr::opts_chunk$set(echo = TRUE) ``` ``` ``` _You can see a list of data sets in R by taping : data()_ _We will explore here the "cars" data. Tape " cars " and " run " to see the data.frame._ _Organize your questions, answers, and comments on an Rmd file that you will submit._ ``` ``` **1. Do a small research in the internet about this kind of measurements.** **Write a small introduction on the purpose of the study. Illustrate using images, videos, or, .... ** **Provide references for the resources that you use.** ``` ``` ```{r } ``` ``` ``` **2. How many variables are described ?** **Are there categorical or numerical? Specify the type of each variable.** **How many measurements do you have for each variable.** ``` ``` **3. Construct a stem and leaf plot for each variable. How do you describe the shape of each plot? ** (skewed, symmetric, unimodal, outlier, variability…) ``` ``` ```{r } ``` ``` ``` ``` ``` **4. Construct a dotplot for each variable. How do you describe the shape of each plot? ** ``` ``` ```{r } ``` ``` ``` ``` ``` **5. Construct a histogram for each variable. How do you describe the shape of each plot? ** ``` ``` ```{r } ``` ``` ``` ``` ``` **6.For each variable,** **a. Find the five numbers summary and the IQR. ** **b. Locate the lower and upper fences.** **c. Construct a boxplot and identify any outliers.** ``` ``` ```{r } ``` ``` ``` ``` ``` **7. Construct manually a boxplot for each variable. Include all the details as we do it in class. Take a photo and insert it in your Rmd. file. ** ``` ``` Example: the file must be in the same folder as the Rmd.file ```{r } ``` ``` ``` ``` ``` **8. Which plot describes better the data? a stem and leaf plot, a dotplot, a histogram, or a boxplot. ** ``` ``` ```{r } ``` ``` ``` ``` ``` **9. Use a suitable graph that describes a relationship between the two variables. ** **Interpret your results.** ``` ``` ```{r } ``` ``` ``` ``` ``` **10. Discuss about the decisions that should be made.** ``` ``` ```{r } ``` ``` ``` ``` --- title: "Iris" author: "Samia Challal" date: "15/06/2021" output: pdf_document: default word_document: default html_document: default toc: true --- ```{r setup, include=FALSE} knitr::opts_chunk$set(echo = TRUE) ``` ``` ``` _You can see a list of data sets in R by taping : data()_ _We will explore here the "iris" data. Tape " iris " and " run " to see the data.frame._ _Organize your questions, answers, and comments on an Rmd file that you will submit._ ``` ``` **1. Do a small research in the internet about this kind of measurements.** **Write a small introduction on the purpose of the study. Illustrate using images, videos, or, .... ** **Provide references for the resources that you use.** ``` ``` ```{r } ``` ``` ``` **2. How many variables are described ?** **Are there categorical or numerical? Specify the type of each variable.** **How many measurements do you have for each variable.** ``` ``` **3. Take a random sample of size around the third of the total size of the population from iris file.** **Example:** If n is the size of the larger sample, then for forming a sample of size 40, I will first choose randomly 40 numbers from the rows 1 to n. ``` ``` ```{r } #sample40 = sample(1:n, size =40, replace = FALSE) #mydata40= iris[sample40,] ``` ``` ``` ``` ``` **4. Construct a bar chart and a pie chart to describe the variation of a categorical variable from your sample. ** ``` ``` ```{r } ``` ``` ``` ``` ``` **5. Use your own words to describe the data through the two graphs obtained. ** ``` ``` ```{r } ``` ``` ``` ``` ``` **6. Repeat 4 and 5 with samples with larger sizes : half size and total size. ** **How do you describe these data compared to the first sample ? ** ``` ``` ```{r } ``` ``` ``` ``` ``` **7. Which sample describes more accurately the data: the first, the second, or the third ?** ``` ``` ```{r } ``` ``` ``` ``` ``` **8. Construct a stem and leaf plot and a histogram for each quantitative variable. How do you describe the shape of each plot? ** **(skewed, symmetric, unimodal, outlier, variability…) ** ``` ``` ```{r } ``` ``` ``` ``` ``` **9. For each quantitative variable, ** **a. Find the five numbers summary and the IQR. ** **b. Locate the lower and upper fences.** **c. Construct a boxplot and identify any outliers.** ``` ``` ```{r } ``` ``` ``` ``` ``` **10. Use a suitable graph that describes a relationship between the two variables length and width for sepal and petal. ** **Interpret your results. ** ``` ``` ```{r } ``` ``` ```