Microsoft Word - FIN10002_ Sem 2 2020_Major assignment.docx FIN10002 S2 2020 Major Assignment 1 FIN10002: Financial Statistics Semester 2, 2020 Assessment 2: Report Weighting: 25% Due date: 11.59pm...

1 answer below »

Attached are the requirements for the major assignment.


There are two files; one contains the task to be performed and the other is the data file to be used.


Please read the instructions carefully.




Microsoft Word - FIN10002_ Sem 2 2020_Major assignment.docx FIN10002 S2 2020 Major Assignment 1 FIN10002: Financial Statistics Semester 2, 2020 Assessment 2: Report Weighting: 25% Due date: 11.59pm Friday, 16th October 2020 (AEST) Word limit: 1500 (+/- 10%; not including Appendix or Executive Summary) Assessment overview For this assessment, you need to produce a report by generating responses to six tasks presented below. For each of these sections, you will need to use Excel to generate statistical output (statistical analyses and graphs). The report should be presented in the form of a business report to a senior manager who cannot be assumed to have any knowledge of statistical methods. Microsoft Word and Excel should be used to complete this assessment. Your statistical calculations should be carried out using Excel only. For this part of the assignment, you will submit the Excel file and a Word version of the report as soft copies via the submission folder in Canvas. Assessment details You will need to download the Excel dataset ‘Major Assignment data Taxation Sem 2 2020.xlsx' from Canvas. The data set contains taxation data for a location in Australia and is supplied by the ATO. The data is based upon the taxations returns of the individuals in a geographical area. There are 4600 lodgments for a particular year in this data set and eight variables as follows: 1. Id No: is the identifier number set by the ATO in no order 2. Gender: 0 = male, 1 = female 3. Age- range: age of person lodging return in an interval of 5 years. See table 1 in the file. 4. Occ_Code: Occupation code – see table 2 in file. 5. Lodgment_Method: Return ledged either by agent or self 6. Tot_inc_amt: Total income or loss – salary/wages or another source ($) 7. Tot_ded_amt: Total allowable deductions ($) 8. Taxable_Income: Taxable income ($) FIN10002 S2 2020 Major Assignment 2 You will use this data set to generate responses to the following six tasks. 1. Select a random sample Select a random sample of size 90 from the given 4600 lodgments in the Taxation lodgment data set. You will use this sample data to complete tasks 2 to 6. 2. Descriptive statistics Use data summary methods to describe the returns in your sample using seven variables – items 2 to 8 above. (Note: Do NOT do Id No) Use an appropriate graphical and summary statistical technique, chosen according to the type of variable (note that less appropriate/inappropriate techniques will receive fewer/no marks). Choose your techniques from: Tabular Techniques: frequency tables and grouped frequency tables Summary Statistics: mode, median, mean, standard deviation, range, coefficient of variation and interquartile range Graphical Techniques: pie chart, bar graph, histogram, frequency polygon. (See topics 1 and 2) Do not draw an ogive curve, stem plot, or a box plot in this assignment and do not draw 3-D graphs. NOTE: for each variable there should be one graph(chart) and at least one table. This should be presented in the body of the report. 1. For a nominal or an ordinal variable draw a graph and present a frequency table in percentages. 2. For a ratio or an interval, variable draw a graph and a summary statistics table, including summary statistics appropriate to the type of distribution only. 3. Try to use variation in drawing graphs e.g. pie chart/bar chart or histogram/polygon. 4. Do not draw two different graphs for the same variable. You can draw the same type of graph for two variables. 5. Do not include any information that you will not include in your discussion such as kurtosis. 6. Display and describe one variable at a time. 3. Confidence intervals Estimate the following quantities, using 95% confidence intervals. Explain the meaning of your confidence intervals. 1. The average taxable income for Technicians and Trades Workers only 2. The average Total allowable deductions for all sample lodgments. Compare both intervals with their respective true means by calculating the actual population mean for the full 4600 lodgments, and comparing the true population mean to the sample mean and confidence interval (note: it FIN10002 S2 2020 Major Assignment 3 is not usual to do this, so you are asked to do this for the purpose of this assignment). Your confidence interval should start with ‘We are 95% confident that…”. This section should take half a page or less. NB: Please make sure you provide enough information in the appendix for your confidence interval calculations to be replicated, so they can be checked. 4. Hypothesis testing 1. It is often felt that tax agent lodged returns claim more total allowable deductions on average than self-prepared lodged returns. Investigate this contention by carrying out an appropriate hypothesis test. 2. It is often felt that the average taxable income, differs for males and females. Use gender for this test. Only report a non-technical explanation of your methodology and your findings in the main section of the report. The computations and output should be placed in an appendix, including the test statistic, p-value and degrees of freedom. This should take half a page or less. 5. Correlation and regression In this section, you will investigate the relationship between the Total income amount and the age range. Using these two variables (total income amount (Tot_inc_amt) and age range) develop a regression model to predict average total income amount from the age of the lodger. Make sure that you undertake a full regression analysis, with appropriate discussion and include: 1. a scattergram and a brief discussion 2. an estimate of the linear regression model 3. the coefficients of correlation and determination 4. a test of the hypothesis that there is no linear relationship between age range and total income amount for each lodgment. Ensure your scattergram includes a line of best fit. Also, make sure you describe the relationship between the variables using R and R-square and interpret the slope, coefficients and the results of the hypothesis test. Use a significance level of 0.05. 6. Conclusion Provide a brief, concise summary of all your findings and briefly mention any limitations in your findings. Make sure you do not give tables or graphs here. FIN10002 S2 2020 Major Assignment 4 Presentation The report should be presented in the form of a business report to a senior manager who cannot be assumed to have any knowledge of statistical methods. Make your report informative but concise and use a non-technical style. Do not just quote statistics or analysis results but explain what they mean. In general, do not include in the report formulae, calculations, definitions of statistical terms or discussions on how graphs are constructed. Where appropriate these may be included in the appendices. It is important that the values which have been calculated are correctly analysed, discussed and interpreted, and that a written description of the main features of the tables and graphs that have been constructed is included. The emphasis in this assignment is on interpretation and analysis, not just the computation of statistics and construction of graphs. It will be assumed that all computations have been correctly performed and that graphs have been properly constructed. Nevertheless, marks will be deducted if these are inaccurate or incorrect. The presentation is an important feature of a business report. The guide to the presentation that follows gives a general outline to report writing. Executive Summary 1. Report only the highlights of the findings. 2. Entice an Executive to read on. 3. Essentially a lively summary of the main conclusions. 4. No longer than one page; this is not counted in the word count and must be on a separate page from the rest of the report. Introduction 1. State the purpose of the report i.e. what you will discuss in the report 2. Outline the contents of the Report 3. Provide a brief description of the methodology 4. Describe the source of the data and state its location in the report. 5. This should contain information about what we expect to read in the project. This should take about half a page. Analysis 1. Contains a thorough yet non-technical description of all the findings (graphs and tables will be included only where they help this discussion). 2. Details the results that were highlighted in the Executive Summary 3. Do not include any calculations here but include appropriate graphs and tables which are needed to support your discussions. FIN10002 S2 2020 Major Assignment 5 Conclusion 1. Report the findings and results of your work. 2. Essentially an expansion of the executive summary written from the point of view that the Executive Summary has not been read. 3. End with a discussion of the limitations of your analysis (e.g. reference to sample size if small, or comment on the data if it is old). Appendices 1. Must be referred to in the main body of the report. 2. Must contain your selection of random numbers and related random data. 3. Include the sample raw data, charts and tables that are not essential, but support the ANALYSIS section. 4. Include your EXCEL output for descriptive statistics, confidence intervals, hypothesis testing and regression. 5. Include any other relevant calculations. Please make sure the information in the appendix is enough for all calculations to be replicated, so they can be checked e.g. if you include the output for your confidence intervals, please show how this output was used to calculate the confidence intervals. Graphs must be in the Main body along with relevant tables and discussion. Graphs kept only in the appendices will not score any marks for graphs. Keep the appendices to a moderate size. Marks are
Answered Same DayOct 03, 2021FIN10002Swinburne University of Technology

Answer To: Microsoft Word - FIN10002_ Sem 2 2020_Major assignment.docx FIN10002 S2 2020 Major Assignment 1...

Biswajit answered on Oct 05 2021
161 Votes
Report
Executive report :
We did a random sampling of 4600 lodgments of tax data & selected 90 random data points.On the basis of analysis of these 90 random sample points,we have drawn conclusion about the entire population of data.The dataset has 8 different variables namely gender,age_range,occ_code,lodgment_method,total_inc_amount,total_ded_amount & taxable income.Out of these,gender,age_range,occ_code & lodgment_method are categorical variables & rest all are continuous variables.The sample contains 53 males (59%) & 37 females (41%).54 % of the people in the sample data fall in the age
bracket of 25-39 years age.60% of people in the sample data have occupation either as not listed,managers or professionals.79 % people filed returns through tax agents while 21% filed self.with 95% confidence,the average taxable income of technician & traders falls between Rs 32031 & Rs 115603.Also with 95% confidence,average total deduction falls between Rs 1492 & Rs 3262.There is no statistical difference in total deduction amount between self prepared returns & through tax agents.But the total taxable income of males is statistically higher for males than that of females.There is no statistically significant correlation between total income & age range (coefficient of correlation 0.19 ).
Introduction :
The purpose of the report is to analyse the tax returns data of certain year for a location in Australia & share the findings & insights of the analysis with the top management for better data driven decision making.The dataset has been supplied by ATO & is available in the canvas.The dataset consists of around 4600 tax return files but we have selected a random sample of 90 returns for analysis through sampling tool of data anlaysis function of Excel.After selecting 90 IDs randomly,we fetched all related to each ID by using Vlookup function of Excel.The details of the sample is there is sample sheet in the excel file.The report contains both descriptive & inferential statistics about the entire tax return database.The data set consists of 8 variables out of which 4 are categorical variables & rest 3 are continuous variables leaving aside the identity number.The different variables are gender,age range,occupation code,Lodgement method,total income,total deduction amount & taxable amount.we will be doing the following anlysis subsequently in the report
· Descriptive statistics of all the variables
· Visual analytics of different variables
· Confidence interval determination for certain measures
· Hypothesis testing
· Correlation & regression
Analysis :
· Descriptive statistics :
    Gender
    Count of Gender
    Percentage
    0
    53
    58.89%
    1
    37
    41.11%
    Grand Total
    90
    100.00%
As shown in table above,the sample consists of 59 % male & 41 % female.
The above is a graphical depiction of the genderwise distribution of male & female tax payers who are 53 & 37 in numbers.
    Age range
    Count of age_range
    Percentage
    0
    3
    3.33%
    1
    1
    1.11%
    2
    3
    3.33%
    3
    7
    7.78%
    4
    7
    7.78%
    5
    7
    7.78%
    6
    6
    6.67%
    7
    15
    16.67%
    8
    17
    18.89%
    9
    16
    17.78%
    10
    7
    7.78%
    11
    1
    1.11%
    Grand Total
    90
    100.00%
The table for age range shows that 30-34 age group contributes most to the sample of tax payers chosen followed by 25-29 years old.The age group 25-39 constitutes around 54 % of the total tax payers.
The barplot is a graphical depiction of the same information
    Occupation code
    Count of Occ_code
    Count of Occ_code2
    0
    16
    17.78%
    1
    16
    17.78%
    2
    22
    24.44%
    3
    9
    10.00%
    4
    6
    6.67%
    5
    10
    11.11%
    7
    1
    1.11%
    8
    6
    6.67%
    9
    4
    4.44%
    Grand Total
    90
    100.00%
As shown in above table,not specified,managers,professionals & technicians & trade workers constitute 70% of the total tax payers in the sample.The below barplot is a graphical depiction of the same information.
Around 80% people filed tax returns through agents and around 20% filed self.Below is the graphical depiction of the same information.
    lodgment method
    Count of Lodgment_method
    Count of Lodgment_method2
    A
    71
    78.89%
    S
    19
    21.11%
    Grand Total
    90
    100.00%
The below table contains different statistics about the continuous variables such Total income,total deduction & total taxable income for the sample.All the distributions are right skewed.
     
    Tot_inc_amt
    Tot_ded_amt
    Taxable_Income
     
     
     
     
    Mean
    66535
    2377.222222
    64288.24444
    Standard Error
    6014
    451.4413371
    5927.077166
    Median
    51510
    707.5
    48626.5
    Mode
    #N/A
    0
    #N/A
    Standard Deviation
    57052
    4282.748566
    56229.19114
    Sample Variance
    3254921258
    18341935.28
    3161721936
    Kurtosis
    1
    14.41860395
    0.777452693
    Skewness
    1
    3.395114019
    1.191840033
    Range
    237759
    27477
    234100
    Minimum
    -3308
    0
    0
    Maximum
    234451
    27477
    234100
    Sum
    5988112
    213950
    5785942
    Count
    90
    90
    90
    Largest(1)
    234451
    27477
    234100
    Smallest(1)
    -3308
    0
    0
    Confidence Level(95.0%)
    11949
    897.0043209
    11776.97608
    Gender wise
    Average of Tot_inc_amt
    Average of Tot_ded_amt
    Average of Taxable_Income
    0
    78138
    3091
    75269
    1
    49913
    1354
    48559
    Grand Total
    66535
    2377
    64288
The above table shows that on average male taxpayers have higher total income,higher total deduction & also higher total taxable income than that of female taxpayers.
    Age range
    Average of Tot_inc_amt
    Average of Tot_ded_amt
    Average of Taxable_Income
    0
    43199
    510
    42690
    1
    45780
    0
    45780
    2
    99477
    2950
    96527
    3
    67642
    2494
    65147
    4
    112364
    2163
    110201
    5
    90014
    2590
    89300
    6
    63407
    1777
    61631
    7
    74869
    3261
    71518
    8
    56872
    4304
    52569
    9
    59154
    1081
    58073
    10
    27597
    360
    27237
    11
    14210
    282
    13928
    Grand Total
    66535
    2377
    64288
The above table shows that the age range 4 has highest average total income,age range 8 has highest total...
SOLUTION.PDF

Answer To This Question Is Available To Download

Related Questions & Answers

More Questions »

Submit New Assignment

Copy and Paste Your Assignment Here