Instructionuse the data from the zip file ready the question carefully and answer every step and...

Question

Instructionuse the data from the zip file ready the question carefully and answer every step and explain itThe homework is a python programing, answer and do a better code according to the instruction.Submit the code in the py format and ipynb formatThe data for this homework is a zip csv file.

Ximi · Accepted Answer

{
 "cells": [
  {
   "cell_type": "code",
   "execution_count": 12,
   "metadata": {},
   "outputs": [],
   "source": [
    "#Imports
",
    "import pandas as pd
",
    "from sklearn.ensemble import RandomForestClassifier
",
    "
",
    "df = pd.read_csv('data_homwork.csv')"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 3,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
       "(4999, 1805)"
      ]
     },
     "execution_count": 3,
     "metadata": {},
     "output_type": "execute_result"
    }
   ],
   "source": [
    "#Data rows and columns
",
    "df.shape"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 31,
   "metadata": {},
   "outputs": [],
   "source": [
    "#columns
",
    "columns = df.columns"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "### Random Forest Classifier
",
    "We will build a model over all features first and then using feature importances, we will reduce the feature set.
"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 18,
   "metadata": {},
   "outputs": [],
   "source": [
    "#Making features and target variables
",
    "X = df.drop('target', axis=1, inplace=False)
",
    "y = df['target']"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 41,
   "metadata": {
    "scrolled": false
   },
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Feature ranking Top 10:
",
      "1. feature ent_q_diff_diffs_2_median (0.013538)
",
      "2. feature TB_77 (0.012635)
",
      "3. feature Img0.1 (0.011410)
",
      "4. feature TB_a9 (0.010800)
",
      "5. feature TB_b1 (0.010358)
",
      "6. feature _exit (0.010349)
",
      "7. feature TB_93 (0.008868)
",
      "8. feature TB_a3 (0.008600)
",
      "9. feature TB_82 (0.008381)
",
      "10. feature TB_aa (0.008079)
"
     ]
    }
   ],
   "source": [
    "import numpy as np
",
    "import matplotlib.pyplot as plt
",
    "%matplotlib inline
",
    "
",
    "forest = RandomForestClassifier(n_estimators=50,
",
    "                              random_state=0)
",
    "
",
    "forest.fit(X, y)
",
    "importances = forest.feature_importances_
",
    "std = np.std([tree.feature_importances_ for tree in forest.estimators_],
",
    "             axis=0)
",
    "indices = np.argsort(importances)[::-1]
",
    "
",
    "# Print the feature ranking
",
    "print("Feature ranking Top 10:")
",
    "
",
    "for f in range(10):
",
    "    print("%d. feature %s (%f)" % (f + 1, columns[indices[f]], importances[indices[f]]))
",
    "
"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 43,
   "metadata": {},
   "outputs": [
    {
     "data": {
      "image/png":

Instructionuse the data from the zip file ready the question carefully and answer every step and explain itThe homework is a python programing, answer and do a better code according to the...

Answer To: Instructionuse the data from the zip file ready the question carefully and answer every step and...

Answer To This Question Is Available To Download

Related Questions & Answers

Submit New Assignment