BUS708 Statistics and Data Analysis Inferential Statistics Report Assignment 2 (Assessment 4) – Individual Word Report – Trimester 1, 2020 1 OVERVIEW OF THE ASSIGNMENT This assignment will test your...

1 answer below »
I need a no mistake assessment pls do it with full focus pls


BUS708 Statistics and Data Analysis Inferential Statistics Report Assignment 2 (Assessment 4) – Individual Word Report – Trimester 1, 2020 1 OVERVIEW OF THE ASSIGNMENT This assignment will test your skill to present and summarise data as well as to make basic statistical inferences in a business context. You will use the results and any feedback given in Assignment 1 (Assessment 3, Excel Report) and produce a single report in a word document. You will need to construct interval estimates, perform suitable hypothesis tests and regression analysis and make conclusion and suggestion for management action. Your report should be written in a word document and should be submitted to Turnitin following the requirement explained below. 2 TASK DESCRIPTION There are two datasets involved in this assignment: Dataset 1 and Dataset 2, which are the same datasets used in Assignment 1 (Excel Report). All data processing should be performed in Excel or Statkey (http://www.lock5stat.com/StatKey). Specific instruction as to which tools should be used for each section will be given during tutorials. Your tasks are to answer the following research questions given in Section 2 to Section 6 below using dataset 1 or dataset 2 as indicated in each section. To answer each question, you will need to first present the relevant numerical summary (summary statistics) and graphical display and perform suitable statistical analysis to provide a conclusion. Your tasks are described below. 1. Section 1: Introduction Provide a brief and clear introduction about the report (e.g. the objective of the report, the datasets involved, etc.). Find relevant articles (minimum one article, maximum 3 articles) and write a proper literature review which includes in-text citation. 2. Section 2: Is Flat makes up about 50% of Dwelling Type? Using Dataset 1, first provide both numerical summary as well as graphical display that easily shows the proportions of dwelling type. Then construct a 95% confidence interval of the population proportion of dwelling type. Finally, answer the research question using the confidence interval. http://www.lock5stat.com/StatKey 3. Section 3: Is the Average Weekly Rent of Flats in Sydney More than $800? Using Dataset 1, first describe the weekly rent distribution of Flats in Sydney (postcode 2000). You need to provide numerical summary (sample size, mean, standard deviation and median) as well as graphical display which shows any outliers. Then perform a suitable hypothesis test to answer the research question above at 5% level of significance. 4. Section 4: Is there a difference in Weekly Rent among five different postcodes? Using Dataset 1, describe the distribution of Weekly Rent for each of the following postcodes: 2000 (Sydney), 2017 (Waterloo), 2145 (Westmead), 2150 (Parramatta), and 2170 (Liverpool). You need to provide both numerical summary as well as graphical display which shows any outliers. Then perform a suitable hypothesis test to answer the research question above. Use a 5% significance level. 5. Section 5: Can we predict the Weekly Rent for flats in Sydney using the Number of Bedrooms? Using Dataset 1, first describe the relationship between the weekly rent and the number of bedrooms for flats in Sydney. You need to provide both numerical summary as well as graphical display. Then interpret the correlation coefficient, coefficient determination and the relevant p-values and use them to answer the research question. 6. Section 6: Is there any relationship between country of origin and suburb where international students live? Using Dataset 2, describe the relationship between the country of origin of an international student and the suburb they currently live in. You need to provide both numerical summary and graphical display. Then perform a suitable hypothesis test to answer the research question above. Use a 5% significance level. 7. Section 7: Conclusion Write a summary of all the findings in the previous sections and then write concluding statements that would benefit a stake holder (e.g. an investor or a renter) to take management action. Finally, suggest further research by discussing an interesting topic or a research question that can be further explored related to the datasets. 3 SUBMISSION REQUIREMENT Deadline to submit the report: Monday, 1st June 2020, 23:59 (11:59pm) You need to submit a word document file to Turnitin which shows all computer outputs and discussion. You do not need to submit the dataset. 4 MARKING CRITERIA Students are advised to read the marking rubric provided on Moodle as well as detailed marking criteria based on this rubric. 5 DEDUCTION, LATE SUBMISSION AND EXTENSION Late submission penalty: - 5% of the total available marks per calendar day unless an extension is approved. This means 0.75 marks (out of 15 marks) per day. For extension application procedure, please refer to Section 3.3 of the Subject Outline. Please do NOT email the lecturer or tutor to seek an extension, you need to follow the procedure described in the Subject Outline. 6 PLAGIARISM Please read Section 3.4 Plagiarism and Referencing, from the Subject Outline. Below is part of the statement: “Students plagiarising run the risk of severe penalties ranging from a reduction through to 0 marks for a first offence for a single assessment task, to exclusion from KOI in the most serious repeat cases. Exclusion has serious visa implications.” “Authorship is also an issue under Plagiarism – KOI expects students to submit their own original work in both assessment and exams, or the original work of their group in the case of a group project. All students agree to a statement of authorship when submitting assessments online via Moodle, stating that the work submitted is their own original work. The following are examples of academic misconduct and can attract severe penalties: • Handing in work created by someone else (without acknowledgement), whether copied from another student, written by someone else, or from any published or electronic source, is fraud, and falls under the general Plagiarism guidelines. • Students who willingly allow another student to copy their work in any assessment may be considered to assisting in copying/cheating, and similar penalties may be applied. ” Section 1 DATA SET 1 DESCRIPTION The data is a secodary data since it is obtained from an already existing source. It was collected from Fair Trading Website (https://www.fairtrading.nsw.gov.au/about-fair-trading/data-and-statistics/rental-bond-data) and it is a subset of "Rental bond lodgement data 2019." The data has got four variables both categorical and numeric. Postal code and number of bedrooms are categorical varibles of the nominal type since values in them are labels with no significant value. They indicate the postal code of the suburb and number of bedrooms in a dwelling type respectively. Dwelling type is a categorical variable indicating the type of housing and lastly weekly rent is a numeric variable of ratio type (Hinton, 2014). It represent the weekly rent paid for any type of dwelling. DATA SET 2 DESCRIPTION The data is primary data since it was collected directly from an online survey. It was collected by radomly surveying responses from international students in an online poll regarding the suburbs where they dwell. Only 30 responses were randomly picked. The sample has got four variables all of which are categorical. The variables are gender indicating whether the individual was male or female, origin indicating the country of origin, postal code and suburb indicating the place where the student resided. Despite the sample being sufficiently large for statistical analysis, there was a possibility of bias, this is because the online poll had no way to check who participated and it would be easy for a local to impersonate an international and participate in the poll (Freund, 2014). Section 2 Dwelling TypePart 1: Summary Statistics and Pie Chart FlatDwelling TypeFrequencyProportion HouseFlat504150% FlatHouse375138% FlatOthers3363% HouseTerrace6777% FlatUnknown1952% TerraceTotal10000100% Terrace House Flat Flat House FlatPart 2 FlatHypothesis Test House HouseData FlatNull Hypothesis p =0.5 FlatLevel of Significance0.05 FlatNumber of Items of Interest5041 HouseSample Size10000 Flat HouseIntermediate Calculations FlatSample Proportion0.5041 FlatStandard Error0.0050 UnknownZ Test Statistic0.8200 Flat UnknownUpper-Tail Test FlatUpper Critical Value1.6449 Flatp-Value0.2061 TerraceDo not reject the null hypothesis Flat House House Flat Flat Flat Flat Flat House Flat Terrace House Flat Terrace Flat Flat Flat House Flat Flat Flat Flat Flat Others Flat House Flat House Flat House Flat Terrace Flat Flat House House House Flat Flat House Flat House Flat Flat Flat House Terrace Flat Flat Others House Flat House House House House Flat Flat House House Flat Flat Flat Flat Flat Unknown Flat Flat House Flat House Flat Flat Terrace Flat House Flat Others House Flat House House Flat House Terrace House House Unknown Flat Flat Flat Flat Flat House House Flat Terrace Others Flat Unknown Terrace House Flat Flat Flat Flat Flat Flat House Flat Flat House Flat House Flat Flat House Flat Flat House House House House Flat Flat House House House Flat House Terrace Flat Flat Flat House House House Flat House Flat House Flat House Flat House House Flat House House Flat House House House House Flat House House House Flat Flat Flat House House Flat Others House Flat Unknown House Flat House Flat House House House House Terrace House House Flat Flat Flat Flat Flat House Flat House Flat Flat House House House Flat Flat House House Flat Flat Flat House Flat Flat Flat Terrace Flat Flat Flat House Flat Others Flat House House Others House Flat Others House House Flat Flat Flat Flat Flat Flat Flat House House Flat Flat House House Others Terrace House Flat House House House Flat House Flat House Flat Flat House House Unknown Flat Terrace House House House Flat House Terrace House Flat House House Flat Flat House House Flat House House Flat House House Flat Flat House Flat Flat House Flat House Flat Flat Flat House Flat Flat Flat House Flat House Flat Others Flat House House Others Flat House Unknown Flat Flat Flat Unknown Flat House Flat House Unknown House Flat House House Flat House Flat Flat House Flat House House Flat House Flat House Terrace House Flat Flat Flat Flat House Flat Flat House Flat Flat Flat House House Flat Flat House House Flat House House Flat Terrace Flat House Flat Flat Terrace House House House Flat Flat Terrace Flat House House House House Flat Terrace House Flat House House Flat House Flat Flat Terrace Flat House Flat Flat Flat Flat Flat Flat Flat Flat Terrace Flat House House House House House House Flat House House Flat House House Flat House House House Flat House House House House House House Terrace Flat Flat Flat Flat House House Flat Flat House Flat House House House Terrace Flat House House House Flat Flat House Flat Others House House Flat House
Answered Same DayMay 22, 2021

Answer To: BUS708 Statistics and Data Analysis Inferential Statistics Report Assignment 2 (Assessment 4) –...

Rajeswari answered on May 28 2021
145 Votes
58602 assignment
DATA SET 1 DESCRIPTION
The data is a secodary data since it is obtained from an already existing source. It was collected from Fair Trading Website (https://www.fairtrading.nsw.gov.au/about-fair-trading/data-and-statistics/rental-bond-data) and it is a subset of "Rental bond lodgement data 2019." The data has got four variables both ca
tegorical and numeric. Postal code and number of bedrooms are categorical varibles of the nominal type since values in them are labels with no significant value. They indicate the postal code of the suburb and number of bedrooms in a dwelling type respectively. Dwelling type is a categorical variable indicating the type of housing and lastly weekly rent is a numeric variable of ratio type (Hinton, 2014). It represent the weekly rent paid for any type of dwelling.
Inferential statistics
The following is just an example.
Sample size (n) = 10000
Sample proportion () = 0.5043
Standard Error (SE) = = 0.0050
Critical value = 1.96
95% Confidence Interval = 0.5043 (1.96)(0.0050)=0.5043 = (0.4945, 0.5141)
Since 0.50 i.e. 50% lie within this confidence interval, it confirms acceptance of null hypothesis that 50% people dwell in flats.
Next part is checking of hypothesis that mean weekly rent is 800
H0: \mu =800 vs Ha: mu >800
(Right tailed test)
The table one above shows the summary statistics for for the weekly rent distribution of flats in Sydney (Postal code 2000). The mean weekly rent is 838.07, the median is 800, the standard deviation is 291.45 and the sample size is 127. The distribution is visualized with the aid of a box plot (part 2) that also indicates the outliers as stars on the right hand side. The boxplot indicates that the distribution of rent in Sydney is slightly skwed to the right (Freund, 2014).
To determine whether the average weekly rent of flats in Sydney is more than $800, a hypothesis for mean has been conducted. The assumption was collected randomly and from central limit theorem, the assumption is that data is almost normally ditributed since the sample size is sufficienctly large (greater than 30. The hypothesis test is shown in part 3 above. The result indicate that at 95% confidence level, the p-value of the test is greater than the significance level (0.05) and hence there is sufficicent amount of evidence to prove that the average weekly rent of flats in sydeney is more than $800 (Fowler, 2009)
    Weekly Rent
     
     
    Mean
    838.07
    Standard Error
    25.86
    Median
    800
    Mode
    750
    Standard Deviation
    291.45
    Sample Variance
    84942.48
    Kurtosis
    11.20
    Skewness
    2.45
    Range
    2180
    Minimum
    420
    Maximum
    2600
    Sum
    106435
    Count
    127
Box plot is shown above.
    Part 3: Hypothesis Test
     
    
    
    
    
    
    
    
    
    Hypothesis Test
     
    
    
    
     
     
    
    
    
    Data
    
    
    
    Null Hypothesis =
    800
    
    
    
    Level of Significance
    0.05
    
    
    
    Sample Size
    127
    
    
    
    Sample Mean
    838.07
    
    
    
    Sample Standard Deviation
    291.45
    
    
    
     
     
    
    
    
    Intermediate Calculations
    
    
    
    Standard Error of the Mean
    25.8620
    
    
    
    Degrees of Freedom
    126
    
    
    
    t Test Statistic
    1.4721
    
    
    
     
     
    
    
    
    Upper-Tail Test
     
    
    Calculations Area
    Upper Critical Value
    1.6570
    
    For one-tailed tests:
    p-Value
    0.0717
    
    T.DIST.RT value
    0.0717
    Do not reject the null hypothesis
     
    
    1-T.DIST.RT value
    0.9283
So we get do not reject H0 since p value > 0.05 our significance level.
Section 4, is to check whether means of 2000, 2017… are...
SOLUTION.PDF

Answer To This Question Is Available To Download

Related Questions & Answers

More Questions »

Submit New Assignment

Copy and Paste Your Assignment Here