Statistics 151 STAT151X Instructor: Dr. Robert Changirwa Lab1: Displaying and Describing Distributions Assigned: Sep 14, 2021 at 3:00 PM Due: Sep 21, 2021 at 8:30 PM Name: Date: Score: Lab Exercise 1...

Stat Assignment Lab


Statistics 151 STAT151X Instructor: Dr. Robert Changirwa Lab1: Displaying and Describing Distributions Assigned: Sep 14, 2021 at 3:00 PM Due: Sep 21, 2021 at 8:30 PM Name: Date: Score: Lab Exercise 1 – Displaying and describing distributions In this lab assignment, you will study Top 100 Most Deadly Natural Disasters in the 20th century. The data set you will be using is uploaded into MOODLE and is called “natural disasters 20th century”. It is an EXCEL file. Although the data in the file are generally accurate, I have adjusted a few entries for the sake of simplifying our lab exercise. Consequently, please verify any of the entries prior to using the information for other classes or purposes; this data set should not be considered a source of information. The first objective of this lab is to acquaint you with EXCEL so that you develop the skills and confidence you need to independently display and analyze data. The second objective of this lab exercise is to give you an opportunity to explore topics we covered in lecture. Both these objectives are best accomplished by giving you opportunities to gain “hands-on” experience with data sets and EXCEL. Specific topics covered by today’s lab exercise include displaying data using bar graphs and histograms, and calculating some summary statistics. I strongly encourage you to examine the data in the EXCEL file and familiarize yourself with it in that format. This will help you catch problems with the way your data are coded and therefore prevent problems with the data when opened in EXCEL. Report Submission: Upload the completed lab WORD and EXCEL files into MOODLE. Ensure the filenames are formatted as Lab#-FirstNameIniitial-Jan28-2021 before you upload them into MOODLE. Documents should be properly prepared and labeled and spellchecked. Do NOT delete questions, instead press enter and start typing. EXCEL files will have extension .XLSX. In order to earn maximum points, you must submit the lab by the due date, ensure to keep the questions and answers on the same page and both files must be resident in MOODLE. Use the data to answer the following questions: Lab 1 Questions. Use the data from the “Natural Disasters” data set to answer the following questions. Be sure to examine and “clean” your dataset before uploading it to EXCEL. Once in EXCEL, inspect your variables to be sure they are coded properly. 1.a) Obtain and properly label a bar graph of the frequencies (counts) of each type of natural disaster in the 20th century. [2 marks] b) What is the most common type of natural disaster in this “Top 100” list? What is the frequency of this type of natural disaster? [2 mark] 2.a) Obtain and properly label a bar graph that shows the average number of people killed by each type of natural disaster. [2 marks] b) Which type(s) of natural disaster tend to be deadliest when they occur? [2 marks] 3. a) Obtain and properly label a histogram of the variable “killed”. Adjust the intervals as you think best. [2 marks] b) Obtain and properly label a boxplot of the variable “killed”. [2 marks] c) How do the figures in part a) and b) compare in terms of displaying the distribution of the variable “killed”? [2 marks] d) Are there any potential outliers in the dataset for “killed”? Explain your answer using the figures produced in parts a) and b), and by using the objective criteria we’ve learned about in lecture. [2 marks] e) Create a separate EXCEL data file that removes any outliers identified in parts a) - d). Obtain and properly label a new histogram of the variable “killed” in this separate data set. [2 marks] f) Based on (e), obtain and properly label a new Boxplot of the variable “killed” in this separate data set. [2 marks] g) Compare the histograms in part a) and part e). Include comment on the scale of the X-axis in each figure and comment on whether any data points “become” outliers in the second histogram. [2 marks] 4.a) Obtain and properly label a histogram of the variable “killed” for drought events only. Adjust the intervals as you think best. [2 marks] b) Describe the histogram in part a). Consult your notes from class to be sure you have addressed all the features of a histogram that need to be described. [2 marks] c) Obtain and tabulate the 5-Number summary of the variable “killed” for drought events. [2 marks] d) Use information from parts a) - c) to explain whether the data for drought events have a skewed distribution. Explain your answer fully. [2 marks] 5. Use the information from Questions 3 and 4 to compare the distributions of deaths due to droughts to the distribution of deaths due to all causes. [2 marks] Page 1 of 4 rawdata countryyearDisaster Region Killed multiple1968EpidemicGlobal700000 multiple1957EpidemicGlobal1250000 multiple1918EpidemicGlobal50000000 Martinique1902VolcanoCaribbean40000 Guatemala1976EarthquakeCentAmer23000 Guatemala1949FloodingCentAmer40000 Ethiopia1973DroughtEAfrica100000 Ethiopia1974DroughtEAfrica200000 Ethiopia1984DroughtEAfrica300000 Ethiopia1972DroughtEAfrica600000 Mozambique1985DroughtEAfrica100000 multiple1943DroughtEAfrica35000 Somalia1974DroughtEAfrica19000 Uganda1901EpidemicEAfrica220000 China1912CycHurTyphEAsia50000 China1922CycHurTyphEAsia100000 Hong Kong (China)1937CycHurTyphEAsia11000 China1920DroughtEAsia501000 China1928DroughtEAsia3000000 China1907EarthquakeEAsia12000 China1974EarthquakeEAsia20000 China1932EarthquakeEAsia70000 China1920EarthquakeEAsia180000 China1927EarthquakeEAsia200000 China1976EarthquakeEAsia242000 Japan1923EarthquakeEAsia143000 China1910EpidemicEAsia60000 China1909EpidemicEAsia1500000 China1933FloodEAsia18000 China1949FloodEAsia57000 China1908FloodEAsia100000 China1911FloodEAsia100000 China1935FloodEAsia142000 China1939FloodEAsia500000 China1959FloodEAsia2000000 China1931FloodEAsia3700000 China1954FloodingEAsia30000 China1938FloodingEAsia500000 China1930StormEAsia15000 Soviet Union1921DroughtEEurope1200000 Soviet Union1932DroughtEEurope5000000 Soviet Union1907EarthquakeEEurope12000 Soviet Union1988EarthquakeEEurope25000 Soviet Union1948EarthquakeEEurope110000 Turkey1939EarthquakeEEurope32962 Soviet Union1955EpidemicEEurope25000 Soviet Union1949LandslideEEurope12000 Italy1915EarthquakeEU30000 Italy1908EarthquakeEU75000 multiple1914EpidemicEurope3000000 Sudan1984DroughtNAfrica150000 Morocco1960EarthquakeNAfrica12000 Bangladesh1961CycHurTyphSAsia11000 Bangladesh1963CycHurTyphSAsia11500 Bangladesh1965CycHurTyphSAsia12047 Bangladesh1965CycHurTyphSAsia36000 Bangladesh1942CycHurTyphSAsia61000 Bangladesh1991CycHurTyphSAsia138866 Bangladesh1970CycHurTyphSAsia300000 Bangladesh1926CycHurTyphSAsia393000 India1977CycHurTyphSAsia14204 India1942CycHurTyphSAsia40000 India1935CycHurTyphSAsia60000 Bangladesh1943DroughtSAsia1900000 India1965DroughtSAsia487000 India1966DroughtSAsia420000 India1967DroughtSAsia500000 India1900DroughtSAsia1250000 India1942DroughtSAsia1500000 India1905EarthquakeSAsia20000 India1935EarthquakeSAsia56000 Indonesia1917EarthquakeSAsia15000 Iran1962EarthquakeSAsia12000 Iran1978EarthquakeSAsia20000 Iran1939EarthquakeSAsia23000 Iran1990EarthquakeSAsia36000 Pakistan1935EarthquakeSAsia60000 India1924EpidemicSAsia300000 India1926EpidemicSAsia423000 India1920EpidemicSAsia500000 India1907EpidemicSAsia1300000 India1920EpidemicSAsia2000000 Bangladesh1974FloodingSAsia28700 Chile1939EarthquakeSouAmer30000 Peru1970EarthquakeSouAmer66794 Colombia1985VolcanoSouAmer21800 CapeVerde1900DroughtWAfrica11000 CapeVerde1920DroughtWAfrica24000 CapeVerde1946DroughtWAfrica30000 multiple1972DroughtWAfrica62500 multiple1973DroughtWAfrica62500 multiple1974DroughtWAfrica62500 Niger1910DroughtWAfrica11500 Niger1911DroughtWAfrica18050 Niger1912DroughtWAfrica21250 Niger1913DroughtWAfrica17000 Niger1931DroughtWAfrica26000 CapeVerde1902EpidemicWAfrica50000 Niger1923EpidemicWAfrica90000 Nigeria1991EpidemicWAfrica10400
Sep 21, 2021
SOLUTION.PDF

Get Answer To This Question

Related Questions & Answers

More Questions »

Submit New Assignment

Copy and Paste Your Assignment Here