I need Section 1.2.1 completed in this doc before 6pm est today.
The State of Affairs Leading Up To and During the Anti-Police-Brutality Protests of Summer 2020 Summary On May 25th, George Floyd was murdered by police officer Derek Chauvin while he was being accused of trying to pass a counterfeit twenty dollar bill. The video of the incident went viral and this killing sparked massive protests across the country. In the two weeks following George Floyd’s death, polls reported that between 15 and 26 million people participated in protests related to police misconduct (link). Why was the turnout so high for these protests? For instance, when the video of Philando Castile being murdered by a police officer went viral, there were fewer protests related to policing than what we saw with George Floyd’s murder. Which factors may have led to such large protests across the country? We intend to explore the circumstances surrounding the anti-police-brutality protests during the summer of 2020. We will examine these events through the following dimensions: · Size · Location · Type of Protest · Unemployment Primary Dataset The Crowd Counting Consortium (CCC) collects publicly available data on political crowds reported in the United States. It is a consortium that was originally established to generate accurate counts of the protest crowds during the Women’s March on Washington. It is maintained by a team of researchers and volunteers. The data can be downloaded here, but there is a good example to look at here. This dataset contains 67 fields and 84,662 records. We are interested in the 42,221 records where “George Floyd”, “racism”, or “policing” is claimed to be the primary reason for the protest. Primary Dataset Exploratory Questions · How were the different types of protests distributed across the United States?Comment by Ayman Fadlelbari: Different types? Is this related to the valence variable? · What cities and states had the highest number of protesters over this period? · Do larger protests tend to have more organizations involved? · Which protests had the highest ratio of arrests per protester? Secondary Dataset - Dataset Related to State Unemployment Statistics · This data is provided by the US Bureau of Labour Statistics. Data is available by city and town. Link. · Fields: “state”,”rate”, “date”Comment by Ayman Fadlelbari: We'll need this as well - we can add that ourselves if required - something like this:df[date] = datetime.datetime(yyyy, mm, dd)We'll need it to join the data · Question to be answered: Is there a link between protest and the state of unemployment rates? Data First we limit our protests dataset to those that happened within our time interval of interest and then we select only the variables of interest to us (seen below). Before starting the analysis, we conduct a sanity check on this limited set of variables. [Explain the Joins next] The variables that will be explored in this project are: · date · locality · claims · valence · issues · size_mean · size_cat · lat · lon · fips_code · Month_year · county_y · population · state_unemp_rate First we check... [variable summaries - screenshots] Further narrowing the data: Concerns: Exploratory Questions Main question: 1. How were the different types of protests distributed across the United States? 1.1 How many protests occurred in the year 2022 There is a total of N records in our dataset. [DEFINE PROTESTS HERE - check data dictionary. Need to clarify that each day is a separate protest + reason for definition] 1.2 How is the frequency of protests distributed over time There is a clear pattern in the number of pretests where policing and racism were stated as core issues. The number of protests seems to be flat between January 1st 2020 and May 26th 2020, with few incidents reported [Exact number would be good]. After May 26th there is a sharp increase in the number of reported protests and the. Comment by Ayman Fadlelbari: Loop back if time permits The Grey line on the chart indicates the date of George Flyod’s death. We believe the increase in the number of protests following this date are related to this event. The relatively low number of protests before this date may be due to covid restrictions and social distancing measures that discouraged gatherings of large groups. Comment by Ayman Fadlelbari: Link to relevant article for context.Comment by Ayman Fadlelbari: Would need pre-covid data to confirm this - Can remvoe if we don't confirm later.Comment by Ayman Fadlelbari: Also, events plotted here are "cause-specific" - would need to check for all types to back this thought Comment by Ayman Fadlelbari: Would be worth adding vertical lines for other significant indidents as well. Comparison between them might be worth commenting on. The impact of this event seems to have continued throughout the year as the number of protests has not settled back to the previous baseline. This is more apparent in the following chart. Comment by Ayman Fadlelbari: These axes might need titles before submissionComment by Ayman Fadlelbari: For all charts [A percentage increase would be nice here - number per month before and after]. Comment by Ayman Fadlelbari: Loop back if time permts. Simple enough withdf[var].size()Comment by Ayman Fadlelbari: Could plot it as well 1.2.1 Are protests more likely to occur on weekends,or workdays? Comment by Ayman Fadlelbari: The shape of the daily data raises the questions. The spikes on the tail end look a bit too regular... · [Needs some data work still - append a column to df with “work_day” or “weekend”]Comment by Ayman Fadlelbari: @
[email protected] , can you help us out with this part?_Assigned to Charles Lucas_Comment by Charles Lucas: Sure. I am working on it right now. · The datetime library should have a function for this - shouldn’t take much code · Df_day_of_week = df.groupby([“col_name_here”]).size().reset_index() · Should get a dataframe with 2 rows · Plot on a bar chart 1.3 How is the frequency of protests geographically distributed across the United States? Some comment here Graphic - geographic, showing distribution of frequency per state. (Frequency per county?) Comment by Ayman Fadlelbari: This is awesome!!Comment by Ayman Fadlelbari: @
[email protected] , can we get one that's not adjusted for the county ratio as well? - Then we can show both - the steps in getting to this one and a comparison between the two.Comment by Aris Chalini: yup I can work on that.Comment by Ayman Fadlelbari: For frequency here please ?We'll use the size ones in a section below. 1.3.1 Which states had the highest number of protests in 2020? The figure below shows the total number of days where protests were recorded in each state. The 10 states with the highest number of protests in 2020 were California, New York, Pennsylvania, Oregon, Massachusetts, Illinois, Florida, Michigan, Texas and Washington. 1.3.2 Which counties had the highest number of protests in 2020? Looking at the graphic above, we can see that the protests are concentrated in particular counties within each state. In our primary dataset, there are n counties where protests were reported in 2020. Only the top 50 counties are included in the figure below. A larger image including all counties can be found in the appendix. Comment by Ayman Fadlelbari: We should probably label figures to refere to them in other sections on the doc.Comment by Ayman Fadlelbari: Referening the US map with frequency per county 1.4 Next Q [Question]Comment by Norma Godinez: DUE TOMORROW:1 question, 1 explanation -ALSO CONTINUE WORKING ON:JOINAGGREGATE(heart disease/surgery - good example) [Explanation] [Graphic] + [Tables] Conclusions [List of at least five] Graphics Dump Comment by Ayman Fadlelbari: This is awesome!!Comment by Ayman Fadlelbari: @
[email protected] , can we get one that's not adjusted for the county ratio as well? - Then we can show both - the steps in getting to this one and a comparison between the two.Comment by Aris Chalini: yup I can work on that.Comment by Ayman Fadlelbari: For frequency here please ?We'll use the size ones in a section below. Appendix Total days where protests were reported in 2020, by county The State of Affairs Leading Up To and During the Anti - Police - Brutality Protests of Summer 2020 Summary On May 25th, George Floyd was murdered by police officer Derek Chauvin while he was being accused of trying to pass a counterfeit twenty dollar bill. The video of the incident went viral and this killing sparked massive protests across the country. In the two weeks following George Floyd’s death, polls reported that between 15 and 26 million people participated in protests related to police misconduct ( link ). Why was the turnout so high for these protests? For instance, when the video of Philando Castile being murdered by a police officer went viral, there were fewer protests related to policing than what we saw with George Floyd’s murder. Which facto rs may have led to such large protests across the country? We intend to explore the circumstances surrounding the anti - police - brutality protests during the summer of 2020. We will examine these events through the following dimensions: ? Size ? Location ? Type of Protest ? Unemployment Primary Dataset The Crowd Counting Consortium (CCC) collects publicly available data on political crowds reported in the United States. It is a consortium that was originally established to generate accurate counts of the protest crowds during the Women’s March on Washing ton. It is maintained by a team of researchers and volunteers. The data can be downloaded here , but there is a good example to look at here . This dataset contains 67 fields and 84,662 records. We are interested in the 42,221 records where “George Floyd”, “racism”, or “policing” is claimed to be the primary reason for the protest. Primary Dataset Exploratory Questions ? How were the different types of protests distributed across the United States? ? What cities and states had the highest number of protesters over this period? ? Do larger protests tend to have more organizations involved? ? Which protests had the highest ratio of arrests per protester? Secondary Dataset - Dataset Related to State Unempl oyment Statistics ? This data is provided by the US Bureau of Labour Statistics. Data is available by city and town. Link . ? Fields: “state”,”rate”, “date ” ? Question to be answered: Is there a link between protest and the state of unemployment rates? The State of Affairs Leading Up To and During the Anti-Police-Brutality Protests of Summer 2020 Summary On May 25th, George Floyd was murdered by police officer Derek Chauvin while he