Ten-Minute Analytics Challenge
Context and Dataset
· The context for the Analytics Challenge project is theHousehold Pulse Survey(Links to an external site.)(HPS) undertaken by the U.S. census bureau during the coronavirus pandemic. HPS was executed in multiple phases and U.S. households were surveyed to unearth the social and economic impacts ofthe coronavirus pandemic. Publicly available data files contain individual (household) responses to survey questions and are available on a weekly basis from thesurvey website(Links to an external site.).
· Students can utilize the CSV version of the weekly public use filesposted here(Links to an external site.). Each CSV version of the data is a zip file (e.g., “HPS week 1 PUF CSV(Links to an external site.)”) that contains the following files:
o Data dictionary, which is an Excel spreadsheet with survey questions and explanations (e.g., “pulse2020_data.dictionary_CSV_01.xlsx”)
o Weekly household pulse survey data (e.g., “pulse2020_puf_01.csv”)
o Data about survey replicate weights (e.g., “pulse2020_repwgt_puf_01.csv”) which is used to convey information about sample design. For the purposes of the Analytics Challenge project, students can ignore this file.
· Since weekly data is available from multiple phases and U.S. States, a wide range of analysis designs and storytelling options are possible (e.g., using specific subsample of data such as Pennsylvania data only, cross-sectional approach of using only a specific time period, longitudinal comparisons across time periods and regions). Depending on their analysis goals and storytelling intent, students are welcome to construct (i.e., select, aggregate, and collate) their own sample utilizing the above-referenced publicly available HPS files. As desired, students are also permitted to augment the HPS data with any other relevant data of their choice.
· The HPS survey data is widely used in both academic and industry research, and relevant published articles can be researched through our library and other search engines likeGoogle Scholar(Links to an external site.).
Instructions
· Create a power point presentation with insights from examining the above-referenced dataset using the techniques learned from the course.
· The analytics challenge project will be used to evaluate both data analysis and presentation skills. In addition to the generalcriteria used in similar competitions(Links to an external site.), the submissions for the course project will be evaluated for: (1) rigor and thoroughness of data analysis, (2) creative representation of analysis results, (3) communication and engagement in the video, (4) the level of challenge in the analysis, and (5) organization of the project deliverables.
Output-
Coding in jupyter
Power point presentation with detailed research