p.p1 {margin: 0.0px 0.0px 0.0px 0.0px; font: 12.0px Calibri; min-height: 14.0px} p.p2 {margin: 0.0px 0.0px 0.0px 0.0px; font: 11.0px Calibri} p.p3 {margin: 0.0px 0.0px 0.0px 0.0px; font: 11.0px Calibri; min-height: 13.0px} p.p4 {margin: 0.0px 0.0px 0.0px 0.0px; font: 16.0px 'Times New Roman'} p.p5 {margin: 0.0px 0.0px 0.0px 0.0px; font: 14.0px 'Times New Roman'} p.p6 {margin: 0.0px 0.0px 0.0px 0.0px; font: 11.0px Arial} p.p7 {margin: 0.0px 0.0px 0.0px 0.0px; font: 11.0px Cambria} p.p8 {margin: 0.0px 0.0px 0.0px 0.0px; font: 11.0px Cambria; min-height: 12.0px} p.p9 {margin: 0.0px 0.0px 0.0px 0.0px; font: 10.5px Cambria} p.p10 {margin: 0.0px 0.0px 0.0px 0.0px; font: 11.0px 'Times New Roman'} p.p11 {margin: 0.0px 0.0px 0.0px 0.0px; font: 13.0px 'Times New Roman'} span.s1 {font: 12.0px Calibri} span.s2 {font: 11.0px 'Times New Roman'} span.s3 {font: 11.0px Calibri} span.s4 {font: 11.0px Cambria} table.t1 {border-collapse: collapse} td.td1 {padding: 0.0px 5.0px 0.0px 5.0px}
1 | P a g e
ITECH1103- Big Data and Analytics
Group Assignment – Semester 2, 2018
Worth – 30%
ANALYTIC REPORT (20%- Due Week 11 Sunday 11:55pm) and PRESENTATION (10% - Due Week 10 in Tutorial Time)
Analytic Report:
Learning Outcomes Assessed: A3, K3, K6, and S2:
Purpose:
The purpose of this task is to provide students with practical experience in working in teams to write a Data Analytical report to provide useful insights, pattern and trends in the chosen/given dataset. This activity will give students the opportunity to show innovation and creativity in applying Watson Analytics and designing useful visualization solutions and predictive solutions for various analytics problems.
Group Presentation: Week 10 (Scheduled Laboratory)
Learning Outcomes Assessed:
K4, A1, A2, V1, V2
Purpose: The purpose of the oral presentation is to provide an opportunity for students to present the results of DATA Analysis and to share this knowledge while practicing their verbal communication skills
Project Details:
Your task for this analytical project is to use analytical tool (i.e Watson Analytics) to explore, analyze and visualize
one of the two given dataset
. Your tutor will assign you the dataset. This dataset reflects reported incidents of crime (with the exception of murders where data exists for each victim) that occurred in the City of Chicago from 2012. Data is extracted from the Chicago Police Department's CLEAR (Citizen Law Enforcement Analysis and Reporting) system. In order to protect the privacy of crime victims, addresses are shown at the block level only and specific locations are not identified.Your intended audience is a law enforcement agency’s middle and top middle management. Your primary goal is to provide different and interesting insights in the lights of 20 questions listed below. The dataset could be downloaded from the following link
Data Sets:
Dataset 1 -
https://data.world/mchadhar/chicagocrime-dataset
Dataset 2 -
https://data.world/mchadhar/dataset-2-chicago-crime
2 | P a g e
Data Dictionary:
ID - Unique identifier for the record.
Case Number - The Chicago Police Department RD Number (Records Division Number), which is unique to the incident.
Date - Date when the incident occurred. this is sometimes a best estimate.
Block - The partially redacted address where the incident occurred, placing it on the same block as the actual address.
IUCR - The Illinois Unifrom Crime Reporting code. This is directly linked to the Primary Type and Description. See the list of IUCR codes at https://data.cityofchicago.org/d/c7ck-438e.
Primary Type - The primary description of the IUCR code.
Description - The secondary description of the IUCR code, a subcategory of the primary description.
Location Description - Description of the location where the incident occurred.
Arrest - Indicates whether an arrest was made.
Domestic - Indicates whether the incident was domestic-related as defined by the Illinois Domestic Violence Act.
Beat - Indicates the beat where the incident occurred. A beat is the smallest police geographic area – each beat has a dedicated police beat car. Three to five beats make up a police sector, and three sectors make up a police district. The Chicago Police Department has 22 police districts. See the beats at https://data.cityofchicago.org/d/aerh-rz74.
District - Indicates the police district where the incident occurred. See the districts at https://data.cityofchicago.org/d/fthy-xz3r.
Ward - The ward (City Council district) where the incident occurred. See the wards at https://data.cityofchicago.org/d/sp34-6z76.
Community Area - Indicates the community area where the incident occurred. Chicago has 77 community areas. See the community areas at https://data.cityofchicago.org/d/cauq-8yn6.
FBI Code - Indicates the crime classification as outlined in the FBI's National Incident- Based Reporting System (NIBRS). See the Chicago Police Department listing of these classifications at http://gis.chicagopolice.org/clearmap_crime_sums/crime_types.html.
X Coordinate - The x coordinate of the location where the incident occurred in State Plane Illinois East NAD 1983 projection. This location is shifted from the actual location for partial redaction but falls on the same block.
Y Coordinate - The y coordinate of the location where the incident occurred in State Plane Illinois East NAD 1983 projection. This location is shifted from the actual location for partial redaction but falls on the same block.
Year - Year the incident occurred.
Month- Month the incident occurred.
Day – Day the incident occurred
Updated On - Date and time the record was last updated.
Latitude - The latitude of the location where the incident occurred. This location is shifted from the actual location for partial redaction but falls on the same block.
Longitude - The longitude of the location where the incident occurred.
3 | P a g e
This location is shifted from the actual location for partial redaction but falls on the same block.
Location - The location where the incident occurred in a format that allows for creation of maps and other geographic operations on this data portal. This location is shifted from the actual location for partial redaction but falls on the same block.
You are expected to present the data findings in a visual forms (i.e., charts and graphs). This is a group assignment. You will complete it with your team (max 3 members
enrolled in the same laboratory). It is expected that each team member will contribute equally in the project. Each team will turn in one joint document and give a joint presentation in Timetabled Laboratory class in Week 10. In addition, each individual team member will write a short reflection as part of the report. You will receive feedback on the draft about presentation choices, content, analysis, and style.
The Questions
Your job is to examine
one of the available datasets and present
it in a set of informative graphs and text by answering the following questions.
Guided Questions for Dataset 1
1. How many total number of reported crimes?
2. How many different number of reported crimes types? (Primary type)
3. Provide a list of top 21 location descriptions with respect to crimes.
4. Provide a list of least 10 location descriptions with respect to crimes.
5. What is the top three most common primary type?
6. What are the three least common primary types?
7. How many years of Years of reported crimes is in the data file?
8. How many number of reported crimes were logged every year in December?
9. Which year generated the most reported crime in Chicago?
10. Which month generated the most reported crime in Chicago?
11. How many number of reported crimes whether an arrest was made? (Arrest)
12. How many number of district in this dataset?
13. What are the top 3 districts in terms of reported crimes?
14. What are the least 3 districts in terms of reported crimes?
15. What was the primary type that reported most crimes from district “8” in 2014?
16. How many number of domestic reported crimes made in Chicago?
17. How many domestic number of reported crimes were made in 2012 to 2014?
18. Which day is the busiest day of the week in terms of committed crimes?
19. Which location description has the most number of crime reported on Weekends?
20. Which location description has the least number of crime reported on weekends?
Guided Questions for Dataset 2
1. How many total number of reported crimes?
2. How many different number of reported crimes types? (primary type)
3. How many location descriptions of reported crimes? (location description)
4 | P a g e
4. What are the top three most common primary type that reported crimes ?
5. What are the three least common primary types?
6. How many years of Years of reported crimes is in the data file?
7. How many number of reported crimes were logged in the last week of the dataset? Considered (11 of January 2017 , 18th of January 2017)
8. Which year generated the most reported crime in Chicago?
9. Which month generated the most reported crime in Chicago?
10. How many number of reported crimes whether an arrest was made? (arrest)
11. How many number of district in this dataset
12. Which District in Chicago reported most crimes in the last year? (last year of dataset)
13. Which District in Chicago reported least crimes in the last year? (last year of dataset)
14. What was the primary type that reported most crimes from district “8” last year?
15. How many number of domestic reported crimes made in Chicago?
16. How many domestic number of reported crimes were made over the past month? (last year of dataset)
17. Which location description has the most number of crime reported in Chicago?
18. Which location description has the least number of crime reported in Chicago?
19. Which location description has the most number of crime reported on Weekends?
20. Which location description has the least number of crime reported on weekends?
Task 1- Background information
Write a description of the selected dataset and project, and its importance for the firm. Information must be appropriately referenced. [1 Page]
Task 2 – Reporting / Dashboards
For your project, perform the relevant data analysis tasks by answering the above questions and, identify the visualization and dashboards you need to develop for the operational manager of the indicated firm. [2-3 Pages]
Task 3 – Advanced Insights:
In addition to the guided questions, it is expected to provide at least five (5) insights of the data. These insights will be judged in terms of quality and complexity.
Task 4 – Research
Justify why these BI reporting solution/dashboards are chosen in Task 2 (Reporting / Dashboards) and why those data sets attributes are present and laid out in the fashion you proposed (feel free to include all other relevant justifications).
Note: To ensure that you discuss this task properly, you must include visual samples of the reports you produce (i.e. the screenshots of the BI report/dashboard must be presented and explained in the written report; use ‘Snipping tool’), and also include any assumptions that you may have made about the analysis in your Task2 (i.e. the report to the operational team of the company).[1-2 Pages]
Task 5 – Recommendations for POLICE CHIEF
The POLICE CHIEF would like to improve the operations. Based on your BI analysis and the insights gained from “Data Set” in the lights of analysis performed in
previous tasks, make some logical recommendations to the POLICE CHIEF, and justify why/how your
5 | P a g e
proposal could enhance company operations and could assist in achieving operational/strategic objectives with the help of appropriate references from peer-reviewed sources. [1-2 Pages]
Task 6 – Cover letter
Write a cover letter to the POLICE CHIEF with the important data insights and recommendation to achieve operational/strategic objectives [1 page]
Task 7 - The Reflection:
Each Team member is expected to write a brief reflection about this project in terms of challenges, learning and contribution.
Other Tasks –
Please refer to marking scheme at the end of the assignment for other tasks and expectations.
Report Submission:
• Hard-copy to tutors/lecturers assignment box in week 10. Double- sided printing for the hard-copy is encouraged in order to save paper.
• You will also submit a
7-8 pages report (about 1500 words
not counting cover page and references) of this project. At least 15 references in your report must be from peer-reviewed sources. Include any and all sources of information including any person(s) you interviewed for this project.
• Please note that all references must adhere to
APA style. See http://owl.english.purdue.edu/owl/resource/560/01 and http://www.apastyle.org/ for details on how to format a report and how to cite references. Make sure your follow formal report structure with cover page, introduction, use of headings, subheadings, conclusion sand reference section.
• You are reminded to read the “Plagiarism” section of the course description. Your essay should be a synthesis of ideas from a variety of sources expressed in your own words. All reports must use the APA referencing style. University Referencing/Citation Style Guide: The University has published a style guide to help students correctly reference and cite information they use in assignments (American Psychological Association (APA) citation style, http://www.ballarat.edu.au/aasp/student/learning_support/generalguide/prin t/ch06s04.shtml or Australian citation style
• Reports are to be presented in hard copy in size 12 Arial Font and double spaced. Your report should include a list of references used in the essay and a bibliography of the wider reading you have done to familiarize yourself on the topic.
• A passing grade will be awarded to assignments adequately addressing all assessment criteria. Higher grades require better quality and more effort. For example, a minimum is set on the wider reading required. A student reading vastly more than this minimum will be better prepared to discuss the issues in depth and consequently their report is likely to be of a higher quality. So before submitting, please read through the assessment criteria very carefully.
6 | P a g e
ITECH1103- Big Data and Analytics
Assignment 1- Data Analysis-
Marking Scheme Percentage 20%
Due Week 11 (Sunday 11:55pm) – Hard and Soft Copies
Tasks
|
Max
Marks
|
Marks Awarded
|
Comments
|
1- Background of the Project: Description of Project, Datasets and firm. The important of project for the firm [1+1+1+2]
|
5
|
2- Dashboard/Reports What are the BI reporting solution/dashboards you will need to develop for operational manager of chosen in the light of Questions of your Data analysis - [Quality and complexity of the analysis –
|
30
|
3- Advanced Work: The quality and complexity of additional five (5) insights provided other than the guided questions.
|
10
|
4- Research - Justify why these BI reporting solution/dashboards are chosen and why those attributes are present and laid out in the fashion you proposed (feel free to include all other relevant justifications).
Note: To ensure that you discuss this task properly, you must include visual samples of the reports you produce (i.e. the screenshots of the BI report/dashboard must be presented and explained in the written report; use ‘Snipping tool’), and also include any assumptions that you may have made about the analysis in your assignment report (i.e. the report to the operational team of the company).
[Each analysis/dashboard and report explanation with relevant research papers, complexity and in-depth of the justification, use of peer-reviewed sources]
|
15
|
5- Recommendations - The POLICE CHIEF would like to improve the operations. Based on your BI analysis and the insights gained from “Data Set” in the lights of analysis performed in previous tasks, make some logical recommendations to the POLICE CHIEF, and justify why/how your proposal could assist in achieving operational/strategic objectives.
|
15
|