First, the researcher categorised the data into six location groups and six occupation groups, and calculated the frequencies given below.
Frequency tables
Location
|
|
Occupation
|
Location category
|
Frequency
|
|
Occupation categ o
|
r y
|
|
Frequency Loca t
|
i o
|
n
|
group A 5 Occu p
|
at i
|
o
|
n group 1 4 Loc a
|
ti o
|
n
|
group B 7 Occu p
|
at i
|
o
|
n group 2 26 Lo c
|
a t
|
i
|
o
n group C 12 Occupa tion group 3 15 Location group D 25 Occupation group 4 12 Locati o
n g roup E 10 Occupation group 5 5 Location group F 6 Occupation group 6 3 Using Excel and the dat a in t h e frequency tables abov e, answe r the following questions. a) Which graphical techniq
graphical chart. Construct and display the chart, also briefly describe what you can observe about the number of individuals belonging to each location category. (3 marks)
b) Which graphical technique or chart should be used if the researcher is interested in comparing the proportion of the number of individuals in each occupation group? Explain the reason for the selection of this graphical chart. Construct and display the chart, also briefly describe what you can observe about the proportion of the number of individuals belonging to each occupation category. (3 marks)
Ques tion 4 (7 marks)
Second, the researcher wishes to use graphical descriptive methods to present summaries of the data on each of the two variables: hours worked per week and yearly income , as stored in file HOURSWORKED.xls .
a) The number of observations (n) is 65 individuals. The researcher suggests using 7 class intervals to construct a histogram for each variable. Explain how the researcher would have decided on the number of class intervals (K) as 7. (2 marks)
b) The resea rcher suggests using class intervals as 10 income variable. Explain how the researcher would have decided th e width of the above class intervals (or class width). (2 marks)
c) Draw and display a histogram for each of the two variables using appropriate BIN values from part (b) and comment on the shape of the two distributions. (3 marks)