all we need just to make sure u read the instructions, like excel and word. Size and font, references.
And should be 2 report, one word with everything and excel report for whatever u did on excel
Microsoft Word - BUS708-201801 Assignment Description draft 3 BUS708 Statistics and Data Analysis Statistical Modelling Assignment Trimester 1, 2018 1 OVERVIEW OF THE ASSIGNMENT This assignment will test your skill to collect and analyse data to answer a specific business problem. It will also test your understanding and skill to use statistical methods to make inferences about business data and solve business problems, including constructing hypotheses, test them and interpret the findings. Gender gap is the difference between the salary of men and the salary of women. The reasons of gender gap are not only because of discrimination in hiring, but also includes the different industries that women and men are working, as well as many other reasons. By using an edited subset of the sample file from the Australian Taxation Office (ATO), your task is to summarise and analyse several aspects of the salary and occupation of the different gender. In addition, you are also asked to suggest one relevant research question and then collect and analyse a dataset that will answer your research question. 2 TASK DESCRIPTION: WRITTEN REPORT There are two datasets involved in this assignment: Dataset 1 and Dataset 2, detailed below. Dataset 1: You will receive an email that contains a dataset that is specifically allocated to you. This dataset is a subset of 2013-2014 individual sample file, provided by the ATO and has been edited to only include a subset of the cases and variables. The original dataset can be obtained from https://data.gov.au/dataset/taxation-statistics-individual-sample-files, and it is under the license of Creative Commons Attribution 3.0 Australia. Data dictionary of the edited dataset is given in the following table. Variable Description Values Gender Gender (sex) Female or Male Occ_code Salary/wage occupation code 0 = Occupation not listed/ Occupation not specified 1 = Managers 2 = Professionals 3 = Technicians and Trades Workers 4 = Community and Personal Service Workers 5 = Clerical and Administrative Workers 6 = Sales workers 7 = Machinery operators and drivers 8 = Labourers 9 = Consultants, apprentices and type not specified or not listed Sw_amt Salary/wage amount All numeric Gift_amt Gifts or donation deductions All numeric Dataset 2: Collect data (e.g. via a survey) that will answer your research question. There is no requirement about the number of variables, sampling methods and sample size, but you need to justify your approaches in Section 1 (see below). Both datasets should be saved in an Excel file (one file, separate worksheets). All data processing should be performed in Excel or Statkey (http://www.lock5stat.com/StatKey). Prepare a report in a document file (.doc or .docx) which includes all relevant tables and figures, using the following structure: 1. Section 1: Introduction a. Give a brief introduction about the assignment, including your research question. Include a short summary of a related article with a proper citation. b. Dataset 1: Give a short description about this dataset. Is this primary or secondary data? What types of variable(s) is involved? Display the first 5 cases of your dataset. c. Dataset 2: Explain how you collect the data and discuss its limitation (e.g. whether your sample is biased). Is this primary or secondary data? What type of variable(s) is/are involved? You don’t need to display your data in this section. 2. Section 2: Descriptive Statistics Use Dataset 1 a. Using suitable graphical display, describe the relationship between the variables Gender and Occ_code for Dataset 1. Make sure your graph shows the distribution of Gender for each Occ_code. b. Using suitable graphical display, describe the relationship between the variables Gender and Sw_amt. c. Using suitable numerical summary, describe the relationship between the variables Gender and Sw_amt. d. Using suitable graphical display, describe the relationship between the variables Sw_amt and Gift_amt. 3. Section 3: Inferential Statistics Use Dataset 1 a. List top 4 occupation based on median salary and find the proportion of the gender of those top 4 occupation. b. Perform a suitable hypothesis test at a 5% level of significance to test whether the proportion of machinery operators and drivers who are male is more than 80%. c. Perform a suitable hypothesis test at a 5% level of significance to test whether there is a difference in salary amount between gender. Use Dataset 2 d. Perform a suitable statistical analysis on dataset 2 (the one you collected) that will answer your research question. 4. Section 4: Discussion & Conclusion a. What can you conclude from your findings in the previous sections? b. Give a suggestion for further research 3 TASK DESCRIPTION: PRESENTATION/INTERVIEW A presentation/interview for the assignment is scheduled on Week 11, in your allocated tutorial. You do NOT need to prepare a presentation material (e.g. power-point slides), instead, you will be asked to demonstrate and/or explain how you summarised the data and how you performed the analysis. You may be asked to reproduce what you have made in your written report (e.g. generate a chart or numerical summary using Excel or Statkey). 4 SUBMISSION REQUIREMENT Deadline to submit written report: Week 10 Wednesday (23 May 2018), 5pm You need to submit 2 files to Turnitin: 1. Main report, in a Microsoft Word document file (this is the file that will be marked, it should contain all necessary tables and figures) 2. Dataset, in a Microsoft Excel file (this is just a supporting file) Main report (word document): 1. Size: A4 2. Use Assignment Cover Page (download from Moodle) with your details and signature 3. Single space 4. Font: Calibri, 11pt Dataset (excel document): 1. Dataset 1 in Sheet 1 2. Dataset 2 in Sheet 2 3. Data processing for each section in other sheets (rename the sheet appropriately) 5 DEDUCTION, LATE SUBMISSION AND EXTENSION Late submission penalty: - 5% of the total available marks per calendar day unless an extension is approved. For extension application procedure, please refer to Section 3.3 of the Subject Outline. 6 PLAGIARISM Please read Section 3.4 Plagiarism and Referencing, from the Subject Outline. Below is part of the statement: “Students plagiarising run the risk of severe penalties ranging from a reduction through to 0 marks for a first offence for a single assessment task, to exclusion from KOI in the most serious repeat cases. Exclusion has serious visa implications.” “Authorship is also an issue under Plagiarism – KOI expects students to submit their own original work in both assessment and exams, or the original work of their group in the case of a group project. All students agree to a statement of authorship when submitting assessments online via Moodle, stating that the work submitted is their own original work. The following are examples of academic misconduct and can attract severe penalties: Handing in work created by someone else (without acknowledgement), whether copied from another student, written by someone else, or from any published or electronic source, is fraud, and falls under the general Plagiarism guidelines. Students who willingly allow another student to copy their work in any assessment may be considered to assisting in copying/cheating, and similar penalties may be applied. ” BUS708 2018 T1 Ass Marking Scheme.xlsx Section Mark Criteria Question Section 1: Introduction 1.a. 5 Clear and concise intro: 2 Research questions: 1 Proper citation: 1 A summary of a related article: 1 a. Give a brief introduction about the assignment, including your research question. Include a short summary of a related article with a proper citation. 1.b. 5 Clear description: 2 Primary/secondary: 1 Types of variables: 1 Display first 5 cases: 1 b. Dataset 1: Give a short description about this dataset. Is this primary or secondary data? What types of variable(s) is involved? Display the first 5 cases of your dataset. 1.c. 5 Clear data collection description: 2 Limitation: 1 Primary/secondary: 1 Types of variables: 1 c. Dataset 2: Explain how you collect the data and discuss its limitation (e.g. whether your sample is biased). Is this primary or secondary data? What type of variable(s) is/are involved? You don’t need to display your data in this section. Section 2: Descriptive Statistics 2.a. 5 Correct choice of graph: 1 Correct graph based on data: 1 Title/label/legends: 1 Comments: 2 a. Using suitable graphical display, describe the relationship between the variables Gender and Occ_code for Dataset 1. Make sure your graph shows the distribution of Gender for each Occ_code. 2.b. 5 Correct choice of graph: 1 Correct graph based on data: 1 Title/label/legends: 1 Comments: 2 b. Using suitable graphical display, describe the relationship between the variables Gender and Sw_amt for Dataset 1. 2.c. 5 Correct centre (mean/median): 2 Correct spread (stddev/IQR): 2 Comments: 1 c. Using suitable numerical summary, describe the relationship between the variables Gender and Sw_amt for Dataset 1. 2.d. 5 Correct choice of graph: 1 Correct graph based on data: 1 Title/label/legends: 1 Comments: 2 d. Using suitable graphical display, describe the relationship between the variables Sw_amt and Gift_amt. Section 3: Inferential Statistics 3.a. 5 Correct list: 1 Correct proportions: 2 Comments: 2 a. List top 4 occupation based on median salary and find the proportion of the gender of those top 4 occupation. 3.b. 5 Correct hypothesis: 1 Correct test-stat: 1 Correct p-value: 1 Correct conclusion: 2 b. Perform a suitable hypothesis test at a 5% level of significance to test whether the proportion of machinery operators and drivers who are male is more than 80%. 3.c. 5 Correct hypothesis: 1 Correct test-stat: 1 Correct p-value: 1 Correct conclusion: 2 c. Perform a suitable hypothesis test at a 5% level of significance to test whether there is a difference in salary amount between gender. 3.d 5 Suitable choice of statistical method: 1 Correct step of statistical method: 2 Comments: 2 d. Perform a suitable statistical analysis