Background
W. Avy likes your analysis, but he wants more information about temperature trends before opening the surf shop. Specifically, he wants temperature data for the months of June and December in Oahu, in order to determine if the surf and ice cream shop business is sustainable year-round.
What You're Creating
This new assignment consists of two technical analysis deliverables and a written report. You will submit the following:
- Deliverable 1: Determine the Summary Statistics for June
- Deliverable 2: Determine the Summary Statistics for December
- Deliverable 3: A written report for the statistical analysis (README.md)
Files
Use the following link to download the Challenge starter code.
Download challenge starter code(Links to an external site.)
Deliverable 1: Determine the Summary Statistics for June (40 points)
Deliverable 1 Instructions
Using Python, Pandas functions and methods, and SQLAlchemy, you’ll filter the date column of the Measurements table in thehawaii.sqlite
database to retrieve all the temperatures for the month of June. You’ll then convert those temperatures to a list, create a DataFrame from the list, and generate the summary statistics.
REWINDFor this deliverable, you’ve already done the following in this module:
Follow the instructions below to complete Deliverable 1.
- Download the
SurfsUp_Challenge_starter_code.ipynb
file into your surfs_up folder, then rename itSurfsUp_Challenge.ipynb
.
- Use the instructions below to add code where indicated by the numbered comments in the starter code file. The starter code file includes all dependencies needed for this Challenge.
- In Step 1, write a query that filters the
date
column from theMeasurement
table to retrieve all the temperatures for the month of June.
- In Step 2, convert the June temperatures to a list.
- In Step 3, create a DataFrame from the list of temperatures for the month of June.
- In Step 4, generate the summary statistics for the June temperatures DataFrame.
- After you run Step 4 in your
SurfsUp_Challenge.ipynb
file, confirm that the summary statistics match the image below.
Deliverable 1 Requirements
You will earn a perfect score for Deliverable 1 by completing all requirements below:
- A working query is written to retrieve the June temperatures from the
date
column of theMeasurement
table.(10 pt)
- The temperatures are added to a list.(10 pt)
- The list of temperatures is converted to a Pandas DataFrame.(10 pt)
- Summary statistics are generated for the DataFrame.(10 pt)
Deliverable 2: Determine the Summary Statistics for December (40 points)
Deliverable 2 Instructions
Using Python, Pandas functions and methods, and SQLAlchemy, you’ll filter the date column of the Measurements table in thehawaii.sqlite
database to retrieve all the temperatures for the month of December. You’ll then convert those temperatures to a list, create a DataFrame from the list, and generate the summary statistics.
REWINDFor this deliverable, you’ve already done the following in this module:
Follow the instructions below to complete Deliverable 2.
- Use the instructions below to add code where indicated by the numbered comments in your
SurfsUp_Challenge.ipynb
file.
- In Step 6, write a query that filters the
date
column from theMeasurement
table to retrieve all the temperatures for the month of December.
- In Step 7, convert the December temperatures to a list.
- In Step 8, create a DataFrame from the list of temperatures for the month of December.
- In Step 9, generate the summary statistics for the December temperatures DataFrame.
- After you run Step 9 in your
SurfsUp_Challenge.ipynb
file, confirm that the summary statistics match the image below.
Deliverable 2 Requirements
You will earn a perfect score for Deliverable 2 by completing all requirements below:
- A working query is written to retrieve the December temperatures from the
date
column of theMeasurement
table(10 pt)
- The temperatures are added to a list.(10 pt)
- The list of temperatures is converted to a Pandas DataFrame.(10 pt)
- Summary statistics are generated for the DataFrame.(10 pt)
Deliverable 3: A written report for the statistical analysis (20 points)
Deliverable 3 Instructions
For this part of the Challenge, write a report that describes the key differences in weather between June and December and two recommendations for further analysis.
The analysis should contain the following:
Overview of the analysis:Explain the purpose of this analysis.
Results:Provide a bulleted list with three major points from the two analysis deliverables. Use images as support where needed.
Summary:Provide a high-level summary of the results and two additional queries that you would perform to gather more weather data for June and December.
Deliverable 3 Requirements
Structure, Organization, and Formatting (6 points)
The written analysis has the following structure, organization, and formatting:
- There is a title, and there are multiple sections.(2 pt)
- Each section has a heading and subheading.(2 pt)
- Links to images are working and displayed correctly.(2 pt)
Analysis (14 points)
The written analysis has the following:
Overview of the statistical analysis:
- The purpose of the analysis is well defined.(3 pt)
Results:
- There is a bulleted list that addresses the three key differences in weather between June and December.(6 pt)
Summary:
- There is a high-level summary of the results and there are two additional queries to perform to gather more weather data for June and December.(5 pt)
Submission
Once you’re ready to submit, make sure to check your work against the rubric to ensure you are meeting the requirements for this Challenge one final time. It’s easy to overlook items when you’re in the zone!
As a reminder, the deliverables for this Challenge are as follows:
- Deliverable 1: Determine the Summary Statistics for June
- Deliverable 2: Determine the Summary Statistics for December
- Deliverable 3: A written report for the statistical analysis (README.md)
Upload the following to your surfs_up GitHub repository:
- The
SurfsUp_Challenge.ipynb
file.
- The
hawaii.sqlite
file.
- An updated README.md that has your written analysis
To submit your challenge assignment in Canvas, click Submit, then provide the URL of your surfs_up GitHub repository for grading. Comments are disabled for graded submissions in BootCampSpot. If you have questions about your feedback, please notify your instructional staff or the Student Success Manager. If you would like to resubmit your work for an improved grade, you can use theRe-Submit Assignmentbutton to upload new links. You may resubmit up to 3 times for a total of 4 submissions.
IMPORTANTOnce you receive feedback on your Challenge, make any suggested updates or adjustments to your work. Then, add this week’s Challenge to your professional portfolio.
NOTEYou are allowed to miss up to two Challenge assignments and still earn your certificate. If you complete all Challenge assignments, your lowest two grades will be dropped. If you wish to skip this assignment, click Next, and move on to the next Module.
Rubric
Module-9 RubricModule-9 Rubric
Criteria |
Ratings |
Pts |
---|
This criterion is linked to a learning outcomeDeliverable 1: Determine the Summary Statistics for June |
40to >36.0PtsMastery✓There is a working query that retrieves the temperatures from the Measurements table. ✓The temperatures are added to a list. ✓The list of temperatures is converted to a DataFrame. ✓A summary statistics table is generated. |
36to >32.0PtsApproaching Mastery✓The query retrieves the temperatures from the correct month. ✓The temperatures are added to a list, but there is a minor error. ✓The list of temperatures is converted to a DataFrame with a minor error. ✓A summary statistics table is generated. |
32to >26.0PtsProgressing✓The query retrieves the temperatures from the correct month, but there is additional data. ✓The temperatures and additional data are added to a list. ✓The temperatures and additional data are converted to a DataFrame. ✓A summary statistics table is generated. |
26to >0.0PtsEmerging✓The query retrieves the incorrect temperatures. ✓The incorrect temperatures are added to a list. ✓The list of temperatures and the extra data are added to a DataFrame. ✓A summary statistics table is generated, but the data is incorrect. |
0PtsIncomplete |
|
40pts
|
This criterion is linked to a learning outcomeDeliverable 2: Determine the Summary Statistics for December |
40to >36.0PtsMastery✓There is a working query that retrieves the temperatures from the Measurements table. ✓The temperatures are added to a list. ✓The list of temperatures is converted to a DataFrame. ✓A summary statistics table is generated. |
36to >32.0PtsApproaching Mastery✓The query retrieves the temperatures from the correct month. ✓The temperatures are added to a list, but there is a minor error. ✓The list of temperatures is converted to a DataFrame with a minor error. ✓A summary statistics table is generated. |
32to >26.0PtsProgressing✓The query retrieves the temperatures from the correct month, but there is additional data. ✓The temperatures and additional data are added to a list. ✓The temperatures and additional data are converted to a DataFrame. ✓A summary statistics table is generated. |
26to >0.0PtsEmerging✓The query retrieves the incorrect temperatures. ✓The incorrect temperatures are added to a list. ✓The list of temperatures and the extra data are added to a DataFrame. ✓A summary statistics table is generated, but the data is incorrect. |
0PtsIncomplete |
|
40pts
|
This criterion is linked to a learning outcomeDeliverable 3: Structure, Organization, and Formatting |
6to >5.0PtsMasteryThe written analysis has ALL of the following: ✓There is a title, and there are multiple sections. ✓Each section has a heading. ✓There are images, and they are formatted and displayed correctly. |
5to >3.0PtsApproaching MasteryThe written analysis has ALL of the following: ✓There is a title, and there are multiple sections. ✓Each section has a heading. ✓There are images, and they are formatted and displayed correctly with one or two minor errors. |
3to >2.0PtsProgressingThe written analysis has ALL of the following: ✓There is a title, and there are multiple sections. AND ONE of the following: ✓Each section may have a heading. ✓There are images to code, and they are formatted and displayed correctly with one or two minor errors. |
2to >0.0PtsEmergingThe written analysis has ALL of the following: ✓There is a title. ✓There are no headings for each section, but there are three sections. |
0PtsIncomplete |
|
6pts
|
This criterion is linked to a learning outcomeDeliverable 3: Analysis |
14to >12.0PtsMastery✓The purpose is well defined. ✓All THREE major points are described. ✓There is a high-level summary of the results and TWO additional queries. |
12to >10.0PtsApproaching Mastery✓The purpose is well defined. ✓All THREE major points are described. ✓There is a high-level summary of the results and ONE of TWO additional queries. |
10to >7.0PtsProgressing✓The purpose is well defined. ✓TWO of the THREE major points are described. ✓There is a high-level summary of the results and ONE of TWO additional queries. |
7to >0.0PtsEmerging✓The purpose is well defined. ✓ONE of the THREE major points is described. ✓There is a high-level summary of the results only. |
0PtsIncomplete |
|
14pts
|
Total points:100 |
9.5.6: Statistics Route" style="float: left;">
PreviousModule 9 Career Connection" style="float: right;">
Next© 2020 - 2022 Trilogy Education Services, a 2U, Inc. brand. All Rights Reserved.