Brooklyn Housing Analysis Dataset:CSV here Provide a short narrative describing on the Brooklyn...

Question

Brooklyn Housing Analysis Dataset:CSV here Provide a short narrative describing on the Brooklyn Housing Analysis problem. Find or create appropriate data that can be analyzed, which is in the link...

Brooklyn Housing Analysis

Dataset:CSV here

Provide a short narrative describing on the Brooklyn Housing Analysis problem. Find or create appropriate data that can be analyzed, which is in the link above.

Write the step-by-step instructions for completing the Graph Analysis and provide detailed findings along the way of the analysis.

1.Load the data from the “train.csv” file into a DataFrame.

2.Display the dimensions of the file (so you’ll have a good idea the amount of data you are working with.

3.Display the first 5 rows of data so you can see the column headings and the type of data for each column.

a.Notice that Survived is represented as a 1 or 0

b.Notice that missing data is represented as “NaN”

c.The Survived variable will be the “target” and the other variables will be the “features”

4.Think about some questions that might help you predict who will survive:

a.What do the variables look like?For example, are they numerical or categorical data. If they are numerical, what are their distribution; if they are categorical, how many are they in different categories?

b.Are the numerical variables correlated?

c.Are the distributions of numerical variables the same or different among grouped neighborhoods?Was there specific year or pattern in years displaying percentage of sales increasing and for what price range?

5.Look at summary information about your data (total, mean, min, max, freq., unique, etc.)Does this present any more questions for you?Does it lead you to a conclusion yet?

6.Make some histograms of your data (“A picture is worth a thousand words!”)

7.Make some bar charts for variables with only a few options.

a.stacked bar visualization of sale prices in pricing ranges by year

8.To see if the data is correlated, make some Pearson Ranking charts

a.Notice that in the sample code, I have saved this png file.

b.The correlation between the variables is low (1 or -1 is high positive or high negative, 0 is low or no correlation). These results show there is “some” positive correlation but it’s not a high correlation.

9.Use Parallel Coordinates visualization tocompare the distributions of numerical variables between sales and increasing cost over the years

Format:The completed task must bein Jupyter Notebook with displayed results.

brooklyn-housing-analysis-yoso3gh4.ipynb brooklyn-housing-analysis-qilnatjb.docx

Answered Same DayOct 05, 2021

Ximi · Accepted Answer

{
 "cells": [
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": [
    "import pandas as pd
",
    "import numpy as np
",
    "import string
",
    "import re
",
    "import matplotlib.pyplot as plt
",
    "from collections import Counter"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "# Graph Analysis:
",
    "### Write the step-by-step instructions for completing the Graph Analysis and provide findings"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "#### Step 1: Load data into a dataframe"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": [
    "housing_data = pd.read_csv('brooklynhomes2003to2017/brooklyn_sales_map.csv')"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "#### Step 2: Check the dimension of the table and view the data"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": [
    "print("The dimension of the table is: ", housing_data.shape)
",
    "housing_data.head(5)"
   ]
  },

Brooklyn Housing Analysis Dataset:CSV here Provide a short narrative describing on the Brooklyn Housing Analysis problem. Find or create appropriate data that can be analyzed, which is in the link...

Answer To: Brooklyn Housing Analysis Dataset:CSV here Provide a short narrative describing on the Brooklyn...

Answer To This Question Is Available To Download

Related Questions & Answers

Submit New Assignment