STATISTICS. Final Project Sylvester Benson STAT UN1101 Prof. Li Haoran 22 April 2021 Covid-19 Infection and Death Rates Among African American and Hispanic Populations in New York City INTRODUCTION...

1 answer below »
Perform hypothesis testing for the following assumptions:
African Americans and Hispanics have higher rates of infection, hospitalization, and death.


STATISTICS. Final Project Sylvester Benson STAT UN1101 Prof. Li Haoran 22 April 2021 Covid-19 Infection and Death Rates Among African American and Hispanic Populations in New York City INTRODUCTION Inequality gaps and racial marginalization are inherent biases embedded in the social structure of America. Among the many consequences of this systemic marginalization is health inequalities. Historical archives document the rapid rate of infection and death among the marginalized communities during pandemics, including HIV and Influenza. These marginalized communities refer to African Americans, Asians, and Hispanics, among others. Largely, African Americans or Blacks and Latinos turn to be more affected by racial health disparities. In this data analysis research, I ask, are African Americans and Latinos disproportionately affected by Covid-19 in New York City? And if they are, what are the implicated social determinants? I hypothesize the rate of Covid-19 infection, hospitalization, and death among these two groups will be higher as compared to Whites and Asians. Whites will have more resources to afford quality healthcare and recover more quickly than the rest of the population thus, will spend less time in the hospital. Publicly available data from NYC Department of Health shows that there is a correlation between ethnicity and the rate of infection, hospitalization, death, and vaccination. The outcome of this project will document the current effects of Covid-19 on marginalized societies while drawing the attention of people in authority to address these biases. A brief introduction to the domains of the variables Cases: It contains borough-specific counts and rates of confirmed cases, by age group, race/ethnicity group, and sex as reported to the Health Department by laboratories. Rates are cumulative since the start of the outbreak and per 100,000 people by borough of residence and demographic groups. Hospitalization: Death: This data includes the number of confirmed deaths by age and ethnicity due to Covid-19. It also accounts for the number of confirmed deaths among people with underlying conditions by age group as well as those that do not have any underlying conditions. I have also included the number of confirmed deaths among people under investigation for underlying illness by age group Vaccination: Data for vaccination is included. This is to help analyze who has more advantage in terms of getting vaccinated and why. It also compares to the national average. Boroughs: The data collected is specific to each of the five New York City boroughs; Bronx, Manhattan, Brooklyn, Staten Island, and Queens. 1 Ethnicity: The 100,000 subjects per borough are classified as Asian/Pacific-Islander, Black/African-American,Hispanic/Latino and White. Poverty: The contains counts and rates of confirmed cases, hospitalizations, and deaths, by ZCTA-level neighborhood poverty groups in NYC (low, medium, high, very high). Low poverty: under 10%; Medium poverty: 10% to 19.9%; High poverty: 20% to 29.9%; Very high poverty: 30% and over. DATA SOURCE Data is collected from the New York City Department of Health. The data collected reports cases per 100,000 from the five boroughs of New York namely; Manhattan, Brooklyn, Queens, Staten Island, and Bronx. The age of the sample ranged from 0 to 75+ years old. EXPLORATORY ANALYSIS Missing Values: Due to the small number of cases among transgender and gender-nonconforming people, data on those cases are not included in the table.Hospitalization rate by borough is impacted by missing data from a number of facilities. This may lower the rate of hospitalization for some boroughs. There are no obvious outliers detected in the datasets. Demographics: The 500,000 random subjects are classified into 6 groups: 0 -17 , 18-24, 25-34, 35-44, and 45-54. The total number of men and women are not specified. However, the sample is an accurate representation of the populations they are drawn from. Infection Rates per 100,000 People From Each of the Five Boroughs 1. Cases recorded in Burroughs a). Table of Analysis The figure represents the total number of cases reported by each borough characterized by age and ethnicity. 2 b) Cases Recorded in Burroughs by Race The observed trend shows that African Americans and Latinos are the most affected ethnic groups across the five boroughs. c) Cases by Poverty rate This figure evaluates the relationship between poverty rate and number of cases detected. It shows that poor people have high rates of infection, confirming the influence of socio-economic status. d . Cases Recorded in Each Boroughs The figure represents the overall infection rates across the four boroughs. Bronx and Staten Island, which has bigger larger populations of black and hispanics, show higher rates of infection. 3 2. Hospitalizations a) Data for Analysis The figure represents the number of people hospitalized as a result of contracting Covid-19 across all boroughs, ages and ethnicities.. b) Hospitalization Recorded in Burroughs by Race This data describes the rate of hospitalization of different races across all the five boroughs. It shows that African Americans and Latinos are most hospitalized while Whites and Asians are less hospitalized. c)Hospitalization vs poverty rate This data confirms that hospitalization rates increase with poverty rates. 4 c)Hospitalization by Boroughs The figure shows that the Bronx and Queens have higher rates of hospitalization rates. It shows where relatively poorer people may be living due to gentrification. 3. Death a) Table of Analysis . The figure shows the rate of deaths among different boroughs, ages and races/ethnicities. B. Death Vs Poverty Rate This figure shows increasing death rates with increasing poverty rates. 5 D. Death VS Boroughs The figure shows higher rates of death in the Bronx and Staten Island. E. Death VS Race Data collected shows higher death rates among African Americans and Latinos. 6 VACCINATION These paired figures compare vaccination rates across New York City with national data for all races. Vaccination is largely determined by eligibility and this figure shows that Blacks and Hispanics are less likely to get vaccines. ANALYSIS AND RESULTS The hypothesis and complex regression and analysis ● Null hypothesis : Infection rates of all races in New York are the same. ● Alternative hypothesis: African Americans and Latinos are disproportionately affected by Covid-19 in New York City Since there are disproportionately higher rates of infection, hospitalization and deaths among African Americans and Latinos, the null hypothesis is rejected. 7 Perform Regression/correlation analysis Ethnicity and cases Accumulated data shows that ethnicity largely varies with the number of reported Covid-19 cases. Black and Latinos have higher rates of infection as compared to other races. Boroughs largely inhabited by racial minorities have higher rates of infection as well as a result of gentrification. Ethnicity and Hospitalization Accumulated data also shows that ethnicity varies largely with the number of Covid-19 hospitalizations. Black and Latinos have higher rates of hospitalizations as compared to other races. Boroughs largely inhabited by racial minorities have higher rates of hospitalizations too. Whites and Asians have better socio-economic standing and are able to afford high quality health care thus, they spend less time in the hospital. Ethnicity and Death Accumulated data shows that ethnicity largely varies with the number of deaths due to Covid-19 cases. Black and Latinos have higher rates of death as compared to other races. Boroughs largely inhabited by racial minorities have higher rates of death as well. Blacks and Latinos are not able to access quality health care. DISCUSSION AND CONCLUSIONS It is established that there are higher rates of infection, death and hospitalization among Blacks and Hispanics in New York City.When W.H.O declared the Covid-19 as pandemic last year, the whole world was gripped with fear. The statistics and reports were deteriorating each day and all we wanted was ways to put an end to the turmoil. Despite having Covid vaccines now, there are still barriers to reaching herd immunity. There is systemic racial preference for vaccine distribution. Black Americans especially are left behind in terms of vaccine distribution. To deal with Covid-19 more effectively, the leaders must deal with systemic racism in healthcare. 8 Project_guidance STATUN1101Spring2021:Project For the final project, you address some questions that interest you with the statistical methodology we learn. You choose the question; you decide how to collect data; you do the analyses. The questions can address almost any topic, including topics in psychology, sociology, natural science, medicine, public policy, sports, law, etc. ResearchQuestionsandDataCollection: The most important aspects of any statistical analysis are stating questions and collecting data. You are permitted to collect data off of the web. Good projects begin with very clear and well-defined hypotheses. You should think of questions that interest you first, then worry about how to collect and analyze data to address those questions. Generally, vague topics lead to uninteresting projects. Example data sets can be found in The Data and Story Library (DASL): https://dasl.datadescription.com/ You are permitted to use data sets in the library and study a related question. However, the grading of projects that use these topics will be a little more strict. A research question is not “shall I run a regression between father's height and Son's height? Rather, “whether the younger generation is taller than their parents?” Below is a list of example questions: 1. Is the percent of a man’s body that is fat related to age and some body measurements like height? How are they related? 2. Does the brain size of female college students affect their verbal IQs? 3. Do steel roller coasters run faster than wood ones? 4. How the car fuel economy is related to the engine displacement? It does not imply that your project shall only involve one question. Instead, as long as there are valid reasons, it is okay to investigate several closely related questions. For example, you may find interesting follow-up questions during the analysis. Or, perhaps there are different aspects to analyze the questions you are interested in. You may use multiple data sources if necessary. You also need to be realistic in planning your research design: can you carry out what you have planned within a reasonable time period and investment of your own energy? The quality of the final product is what counts, not just the amount of perspiration that went into it!  Finally, you should make use of the concepts and methods learned in this course, and not just general knowledge, in planning and completing this type of project. If you are not confident on whether the question you select is appropriate, refer to the TA for advice. http://www.apple.com FinalReport You project report should tell the whole story (the motivation, formulation of research hypothesis, data source, analysis, etc). A final report should include the following sections: o Introduction: Describe the motivation of the research, overall finding and significance (or insignificance). o Data source: Describe the data, the source and possibly the study design by which the data were collected.
Answered Same DayApr 26, 2021

Answer To: STATISTICS. Final Project Sylvester Benson STAT UN1101 Prof. Li Haoran 22 April 2021 Covid-19...

Mohd answered on Apr 27 2021
150 Votes
Problem Statement
The problem of chronic disease management is significant in today's society. Since the healthcare organization has committed to treating these complications, data analytics can help to manage, predict this risk for reducing costs that involve treating chronic disease by knowing patients with a high risk of
the disease to organize early intervention measures before it develops. Thompson, Steve, et al. provided that 85% of United healthcare costs are consumed by chronic diseases (Thompson, Steve, et al. 2020). 
Introduction / Background
Investigating the problem of chronic disease management in today’s society is needed due to several factors like the expanding prevalence, percentage population, rising cost and so on. In the United States only, Asthma is estimated at $81.9 million in societal cost (Junbo Son et al. 2020). Thus, the problems should be investigated and addressed to significantly improve patient outcomes, care, slow progression, and avert the long-term cost. Thompson, Steve, et al. asserts that health IT combined with data analytics will control the disease progression to improve patient quality of life and reduce healthcare costs (Thompson, Steve, et al. 2020). 
Objectives / Hypothesis / Research Questions
The goal of these research questions, hypothesis is to answer questions on the research while addressing all factors to fulfil the project topic and requirements. Below are details of research question, hypothesis used for this project.
Research Question 1:
Null Hypothesis: There is no mean death reported difference between male and female.
Alternative Hypothesis: There is a mean difference between male and female.
The data elements: Total death reported, Gender.
We have run an independent sample t test to evaluate these hypotheses. We have p value less than 0.05 so we are rejecting null hypothesis and accepting alternative hypothesis, that means death reported has statistical difference between male and female.
Research Question 2:
Null Hypothesis: There is no mean difference.
Alternative Hypothesis: There is a mean difference.
data elements: Datasource, Topic, gender, Death_repo.
We have used anova test to evaluate and validate these hypotheses. As we can see from anova table all death reported in all combinations of data source, chronic disease, and gender are statistically different from each other. These all groups have p value less than 0.05.
Research Question 3:
Null Hypothesis: There is no significant association.
Alternative Hypothesis: There is a significant association.
The data elements: Gender, Topic: chronic_desease
We have ran a chi- square test to find if there is any association between gender and chronic disease in terms of the number of deaths reported by each category. We got p value less than 0.05, that means there is significant association between gender and disease in terms death reported.
Dataset Description
The dataset indicates several types of chronic disease in the United States. With CDC as the publisher, unique identifier of the data, the dataset contains important reports on chronic diseases data developed by consensus for public health practice. The reason for choosing the dataset is, because of the data integrity, validity...
SOLUTION.PDF

Answer To This Question Is Available To Download

Related Questions & Answers

More Questions »

Submit New Assignment

Copy and Paste Your Assignment Here