SIT 112 | Data Science Concepts Lecturer: Dr Sergiy Shelyag | XXXXXXXXXX Data Science Project Due: Friday 5pm, 31st May 2019 Note: This project contributes 25% to your final SIT112 mark. It must be...

1 answer below ยป
How much will it be to complete this assignment?


SIT 112 | Data Science Concepts Lecturer: Dr Sergiy Shelyag | [email protected] Data Science Project Due: Friday 5pm, 31st May 2019 Note: This project contributes 25% to your final SIT112 mark. It must be completed individually and submitted before the due date: 5pm, 31/05/2019. This Data Science Project aims to apply machine learning techniques to understand and visualize relationships in data. Our task specifications for this project will focus on two supervised learning tasks: linear regression and classification. You must demonstrate through skills acquired in describing the data, exploratory data analysis and prediction. 1. Data and Resources In the Data Science Project folder, you will find the following files: Filename Description Project_instructions.pdf project_notebook.ipynb This is the file contains the instruction to complete your project. This is the Jupyter notebook which has been prepared and pre-filled for you to complete the programming task. http://data.gov.au/ 2. Task Description A python note book file project_notebook.ipynb has been prepared for you to complete this task. Download this notebook, load it up and follow instructions inside the notebook to complete the task. You are required to submit your solution in an Jupyter Notebook format as well as its exported version in html. 3. Summary for submission This project is to be completed individually and submitted online. By the due date, you are required to submit the following files to the corresponding Assessments/DSProject in SIT112 site: 1. [YourID]_project_solution.ipynp : your IPython notebook solution source file. 2. [YourID]_project_output.html: the output of your IPython notebook solution in html. END OF PROJECT DESCRIPTION
Answered Same DayMay 04, 2021SIT113Deakin University

Answer To: SIT 112 | Data Science Concepts Lecturer: Dr Sergiy Shelyag | XXXXXXXXXX Data Science Project Due:...

Shivinder answered on May 05 2021
139 Votes
{
"cells": [
{
"cell_type": "markdown",
"metadata": {},
"source": [
"# SIT 112 - Data Science Concepts\n",
"# Data Science Project\n",
"\n",
"---\n",
"Lecturer: Sergiy Shelyag | [email protected]
\n",
"\n",
"School of Information Technology,
\n",
"Deakin University, VIC 3215, Australia.\n",
"\n",
"### Due: 5pm, 1st June 2018\n",
"---\n"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"### Instructions\n",
"\n",
"This notebook has been prepared for you to complete the Data Science Project. Some sections have been pre-filled to help you get started. **The total mark for this project is 100**.\n",
"\n",
"There are two parts in this notebook that require you to complete:\n",
"\n",
"* **Part 1**: *Linear Regression* (**50 marks**)\n",
"* **Part 2**: *Classification* (**50 marks**)\n",
"\n",
"Each part includes three main components:\n",
" * **A:** Load a dataset from sklearn and examine it.\n",
" * **B:** Build a training model and make predictions.\n",
" * **C:** Report the results and visualize the data.\n",
"\n",
"Before you start, read the entire notebook carefully to understand what you need to do. You should also refer to the main instructions in *Project_instructions.pdf* to know what else you need to complete for this project.\n",
"\n",
"For each cell marked with **# YOU ARE REQUIRED TO INSERT YOUR CODES IN THIS CELL**, there will be places where you **must** supply your own codes when instructed. \n",
"\n",
"In the end, you must execute the entire notebook and submit two files:\n",
"\n",
" 1. The source of your solution notebook: **[YourID]_project_solution.ipynb**\n",
" 2. And an exported version of your output: **[YourID]_project_output.html**\n",
" \n",
"As you go through this notebook:\n",
"\n",
"* markdown cells marked with **Note** mean description sections.\n",
"* markdown cells marked with **Instructions** mean the instructions given to you to complete the sections.\n",
"\n",
"Please proceed with the instructions for each part below to complete your programming tasks."
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"**Note**: The following packages will be required for this assignment. If you need to import more packages, you might append them to the end of the following cell. "
]
},
{
"cell_type": "code",
"execution_count": 86,
"metadata": {},
"outputs": [],
"source": [
"'''\n",
"Import packages needed for processing\n",
"'''\n",
"import numpy as np\n",
"from sklearn import datasets\n",
"import sklearn.metrics as metrics\n",
"\n",
"from sklearn import linear_model\n",
"from sklearn import naive_bayes\n",
"from sklearn.manifold import TSNE\n",
"import matplotlib.pyplot as plt \n",
"\n",
"%matplotlib inline\n",
"\n",
"'''\n",
"If you need add any additional packages, then add them below this line\n",
"'''\n",
"import collections\n",
"import seaborn as sns\n",
"from sklearn.linear_model import LinearRegression\n",
"from sklearn.naive_bayes import MultinomialNB\n",
"from sklearn.metrics import precision_score, recall_score, f1_score"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"# Part 1: Linear Regression\n",
"\n",
"In this part, you will be required to work on Linear Regression for the **diabetes** dataset from sklearn. More about the dataset can be found [here](http://scikit-learn.org/stable/modules/generated/sklearn.datasets.load_diabetes.html#sklearn.datasets.load_diabetes).\n"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"## Part 1A: Load and examine the diabetes dataset"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"**Instruction 1.1.** Write your code to load the **diabetes** dataset from sklearn and assign it to a variable called `diabetes`.\n",
"\n",
"[**Total mark: 3**]"
]
},
{
"cell_type": "code",
"execution_count": 87,
"metadata": {},
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"dict_keys(['data', 'target', 'DESCR', 'feature_names', 'data_filename', 'target_filename'])\n"
]
}
],
"source": [
"# YOU ARE REQUIRED TO INSERT YOUR CODES IN THIS CELL\n",
"'''\n",
"1. Write your code to load the **diabetes** dataset from sklearn \n",
" and assign it to a variable called `diabetes`.\n",
"'''\n",
"diabetes = datasets.load_diabetes()\n",
"\n",
"print(diabetes.keys())"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"**Note:** `diabetes` is a dictionary with two keys: *'data'* - a numpy 2D array containing the features and *'target'* containing the labels. The cell code below assigns the data to variable `X` and the labels to variable `Y`. Run the cell and use `X` and `Y` for later tasks."
]
},
{
"cell_type": "code",
"execution_count": 88,
"metadata": {},
"outputs": [],
"source": [
"X = diabetes['data']\n",
"Y = diabetes['target']"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"**Instruction 1.2.** Now you need to examine the size of data. Write your code to find and print out the number of **samples** and the number of **features** in the dataset.\n",
"\n",
"[**Total mark: 2**]"
]
},
{
"cell_type": "code",
"execution_count": 5,
"metadata": {},
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"The number of samples: 442\n",
"The number of features: 10\n"
]
}
],
"source": [
"# YOU ARE REQUIRED TO INSERT YOUR CODES IN THIS CELL\n",
"'''\n",
"2. Write your code to find and print out the number of **samples** \n",
" and the number of **features** in the dataset.\n",
" Using variable X.\n",
"'''\n",
"print('The number of samples:', X.shape[0])\n",
"\n",
"print('The number of features:',X.shape[1])"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"**Instruction 1.3.** We also need to get a brief understanding about the task by d
oing some statistics on the labels.\n",
"\n",
"**Your tasks are:**\n",
"\n",
"1. Write your code to print the **min**, **max** and **median** of the labels. (3 marks)\n",
"2. Construct a **box-plot** for the labels. (2 marks)\n",
"\n",
"[**Total marks: 5**]"
]
},
{
"cell_type": "code",
"execution_count": 54,
"metadata": {},
"outputs": [
{
"data": {
"image/png": "\n",
"text/plain": [
"
"
]
},
"metadata": {
"needs_background": "light"
},
"output_type": "display_data"
}
],
"source": [
"# YOU ARE REQUIRED TO INSERT YOUR CODES IN THIS CELL\n",
"'''\n",
"2. Construct a box-plot for the labels.\n",
"'''\n",
"# construct a box-plot for the labels\n",
"\n",
"color_vals = ['Red','Green','Grey','Darkorange','Olivedrab','Indigo','Cadetblue','Lightgreen','Gold','Salmon']\n",
"\n",
"fig = plt.figure(figsize=(16,8))\n",
"\n",
"fontsize = 15\n",
"for pos in range(X.shape[1]):\n",
" ax = plt.subplot(5,2,pos+1)\n",
" sns.boxplot(X[:,pos], color = color_vals[pos], ax = ax)\n",
" ax.set_title(diabetes['feature_names'][pos], fontsize = fontsize)\n",
" \n",
"plt.tight_layout()\n",
"plt.show()"
]
},
{
"cell_type": "code",
"execution_count": 11,
"metadata": {},
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"Feature\t\tMinimum Value\t\t\tMaximum Value\t\t\tMedian\n",
"age\t\t-0.107225631607358\t\t0.110726675453815\t\t0.00538306037424807\n",
"sex\t\t-0.044641636506989\t\t0.0506801187398187\t\t-0.044641636506989\n",
"bmi\t\t-0.0902752958985185\t\t0.17055522598066\t\t-0.00728376620968916\n",
"bp\t\t-0.112399602060758\t\t0.132044217194516\t\t-0.00567061055493425\n",
"s1\t\t-0.126780669916514\t\t0.153913713156516\t\t-0.00432086553661359\n",
"s2\t\t-0.115613065979398\t\t0.198787989657293\t\t-0.00381906512053488\n",
"s3\t\t-0.10230705051742\t\t0.181179060397284\t\t-0.00658446761115617\n",
"s4\t\t-0.076394503750001\t\t0.185234443260194\t\t-0.00259226199818282\n",
"s5\t\t-0.126097385560409\t\t0.133598980013008\t\t-0.00194763415685317\n",
"s6\t\t-0.137767225690012\t\t0.135611830689079\t\t-0.00107769750046639\n",
"Target Variable\n",
"Minimum Value: 25.0, Maximum Value: 346.0, Median: 140.5\n"
]
}
],
"source": [
"# YOU ARE REQUIRED TO INSERT YOUR CODES IN THIS CELL\n",
"'''\n",
"1. Write your code to print the min, max and median of the labels.\n",
" Using variable Y.\n",
"'''\n",
"print('Feature\\t\\tMinimum Value\\t\\t\\tMaximum Value\\t\\t\\tMedian')\n",
"for pos in range(X.shape[1]):\n",
" print('{}\\t\\t{}\\t\\t{}\\t\\t{}'.format(diabetes['feature_names'][pos], np.min(X[:,pos]), np.max(X[:,pos]), np.median(X[:,pos])))\n",
"\n",
"print('Target Variable')\n",
"print('Minimum Value: {}, Maximum Value: {}, Median: {}'.format(np.min(Y), np.max(Y), np.median(Y)))"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"## Part 1B. Linear Regression\n",
"\n",
"You are required to apply Linear Regression to train and make predictions on the **diabetes** dataset."
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"**Note:** To perform a supervised learning task, we need to train the model on a training set of the input data and the correct labels, and predict the trained model on **unseen** data. Then, we use the correct labels of the **unseen** data to evaluate the performance of the model. The **unseen** dataset is called the **test set**."
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"**Instruction 1.4.** First you need to split the **diabetes** dataset into a training set and a test set. We will use 70% samples for training and 30% for testing. Print the number of samples in each set.\n",
"\n",
"[**Total marks: 5**]"
]
},
{
"cell_type": "code",
"execution_count": 12,
"metadata": {},
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"The number of samples in the training set:\n",
"309\n",
"The number of samples in the test set:\n",
"133\n"
]
}
],
"source": [
"# YOU ARE REQUIRED TO INSERT YOUR CODES IN THIS CELL\n",
"\n",
"# first, compute the number of samples in the training set:\n",
"n_train = int(len(Y) * 0.7)\n",
"\n",
"# The training set is the first n_train samples in the dataset\n",
"X_train = X[:n_train]\n",
"Y_train = Y[:n_train]\n",
"\n",
"# The test set is the remaining samples in the dataset\n",
"X_test = X[n_train:]\n",
"Y_test = Y[n_train:]\n",
"\n",
"# Print the number of samples in the training set\n",
"print('The number of samples in the training set:')\n",
"print(len(X_train))\n",
"\n",
"# Print the number of samples in the test set\n",
"print('The number of samples in the test set:')\n",
"print(len(X_test))\n"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"**Instruction 1.5.** Create a Linear Regression model called `lr`.\n",
"\n",
"[**Total marks: 5**]"
]
},
{
"cell_type": "code",
"execution_count": 14,
"metadata": {},
"outputs": [],
"source": [
"lr = LinearRegression()"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"**Instruction 1.6.** Fit the training data to the `lr` model.\n",
"\n",
"[**Total marks: 5**]"
]
},
{
"cell_type": "code",
"execution_count": 16,
"metadata": {},
"outputs": [
{
"data": {
"text/plain": [
"LinearRegression(copy_X=True, fit_intercept=True, n_jobs=None,\n",
" normalize=False)"
]
},
"execution_count": 16,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
"# YOU ARE REQUIRED TO INSERT YOUR CODES IN THIS CELL\n",
"lr.fit(X_train, Y_train)"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"**Instruction 1.7** Predict the output of the test set.\n",
"\n",
"[**Total marks: 5**]"
]
},
{
"cell_type": "code",
"execution_count": 18,
"metadata": {},
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"[139.99381608 205.02640697 176.97778704 122.05974488 213.18077169\n",
" 174.98852064 112.65389212 202.82658131 172.35243033 164.47418705\n",
" 196.15962387 192.54406753 293.90068958 299.7465604 232.76334834\n",
" 213.53873831 228.48533512 157.1751098 226.27609511 194.21368886\n",
" 101.82219913 174.76393876 111.17348152 294.07940325 179.7869339\n",
" 76.42319163 85.90565583 259.9593708 168.07240778 119.57919313\n",
" 150.68038442 164.06743377 179.24446569 159.59344772 155.87358338\n",
" 143.36467882 123.92736711 176.618279 103.82379184 133.74527488\n",
" 90.94561588 253.81583777 84.12062018 61.3713544 177.97627662\n",
" 196.51767018 130.92146865 88.54111378 199.91912195 53.81979958\n",
" 173.22993854 198.32897556 121.58954455 233.94327416 161.48314734\n",
" 161.86371717 166.464451 261.38157828 260.15223634 204.29534606\n",
" 187.46774384 60.21859679 205.12037857 107.69117313 143.08887348\n",
" 127.96638789 174.54142953 213.69268751 162.95717781 160.21421003\n",
" 137.66760774 173.22737347 70.19308694 262.04969756 111.92846102\n",
" 106.78308221 135.27897046 111.39812286 96.75830871 156.2932552\n",
" 74.87226648 264.28605791 57.01316879 98.1732752 101.31653912\n",
" 276.71186426 170.88557856 62.93955692 186.13594953 171.95979912\n",
" 187.00830031 186.13852705 92.7534905 147.48019274 258.94145986\n",
" 198.28792015 280.72025011 49.41893384 178.41110056 202.3385569\n",
" 167.8026343 155.69294572 155.31812231 236.4358523 124.55734266\n",
" 164.29993856 174.51019295 225.77959304 155.78051853 100.5574813\n",
" 84.53953058 141.2656356 190.79685208 196.75932921 145.53709822\n",
" 171.10705054 113.76652532 160.56105266 130.19483903 262.70560773\n",
" 100.20764258 115.05499404 119.61524219 226.25793937 63.5114853\n",
" 133.50498439 119.54796909 54.62907085 189.07170565 101.65801976\n",
" 119.11267429 212.71676793 57.80383242]\n"
]
}
],
"source": [
"Y_pred = lr.predict(X_test)\n",
"print(Y_pred)"
]
},
{
"cell_type": "markdown",
"metadata": {
"collapsed": true
},
"source": [
"## Part 1C. Results and Visualization"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"**Note:** To evaluate the performance of a Linear Regression model, two commonly used measures are **mean absolute error** and **root mean squared error**.\n",
"\n",
"**mean absolute error** is defined by:\n",
"\n",
"$$mean\\_absolute\\_error(Y_{test}, Y_{pred}) = \\frac{1}{n_{samples}}\\sum_{i=1}^{n_{samples}}|y_{test}^i - y_{pred}^i|$$\n",
"\n",
"**root mean squared error** is defined by:\n",
"\n",
"$$root\\_mean\\_squared\\_error(Y_{test}, Y_{pred}) = \\sqrt{\\frac{1}{n_{samples}}\\sum_{i=1}^{n_{samples}}(y_{test}^i - y_{pred}^i)^2}$$\n"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"**Instruction 1.8.** Compute **mean absolute error** and **root mean squared error** between the correct labels and the predictions of the test set and print these two values.\n",
"\n",
"[**Total marks: 8**]\n",
"\n",
"**Hint:** You might need to use [Regression metrics](http://scikit-learn.org/stable/modules/model_evaluation.html#regression-metrics) from sklearn."
]
},
{
"cell_type": "code",
"execution_count": 24,
"metadata": {},
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"Mean Absolute Error: 40.871175845654435\n",
"Root Mean Squared Error: 52.174426892911136\n"
]
}
],
"source": [
"# YOU ARE REQUIRED TO INSERT YOUR CODES IN THIS CELL\n",
"\n",
"# Compute the mean absolute error between Y_test and Y_pred\n",
"# Then, print the value\n",
"\n",
"diff = abs(Y_test - Y_pred)\n",
"mean_absolute_error = np.sum(diff)/len(diff)\n",
"print('Mean Absolute Error:',mean_absolute_error)\n",
"\n",
"# Compute the root mean squared error between Y_test and Y_pred\n",
"# Then, print the value\n",
"\n",
"sqrd_diff = np.power((Y_test - Y_pred),2)\n",
"mean_squared_error = np.sum(sqrd_diff)/len(sqrd_diff)\n",
"root_mean_squared_error = np.sqrt(mean_squared_error)\n",
"print('Root Mean Squared Error:',root_mean_squared_error)"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"**Note:** Visualisation is an important task. We want to see if two similar samples are predicted with two close labels. To evaluate how similar two samples are, we can compute their Euclidean distance and to evaluate how close the two labels are, we just need to compute their absolute difference.\n",
"\n",
"The function below returns a Euclidean distance matrix whose element (i, j) stores the Euclidean distance between X[i] and X[j]. **You will need this function for a subsequent task.**"
]
},
{
"cell_type": "code",
"execution_count": 25,
"metadata": {},
"outputs": [],
"source": [
"def compute_euclidean_distance_matrix(X):\n",
" n_samples = X.shape[0]\n",
" \n",
" # initialise the distance matrix and set all value to 0\n",
" euclidean_distance_matrix = np.zeros([n_samples, n_samples], dtype=float)\n",
" \n",
" # compute the Euclidean distance matrix\n",
" for i in range(n_samples):\n",
" for j in range(n_samples):\n",
" euclidean_distance_matrix[i, j] = np.sqrt(np.sum((X[i] - X[j]) ** 2))\n",
" \n",
" return euclidean_distance_matrix"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"**Instruction 1.9**. The following code segment is designed to visualise the Euclidean distances of the samples and the absolute differences of their predicted labels.\n",
"\n",
"**Your tasks are:**\n",
"\n",
"1. Construct a function that returns a matrix of absolute differences of the prediction **Y_pred** whose element (i, j) stores the absolute difference between Y_pred[i] and Y_pred[j]. (4 marks)\n",
"\n",
"2. Compute the absolute difference matrix for **Y_pred** and visualise the matrix. (4 marks)\n",
"\n",
"3. Compute the Euclidean distance matrix for **X_test** using compute_euclidean_distance_matrix() and visualise the matrix. (4 marks)\n",
"\n",
"[**Total mark: 12**]"
]
},
{
"cell_type": "code",
"execution_count": 26,
"metadata": {},
"outputs": [],
"source": [
"# YOU ARE REQUIRED TO INSERT YOUR CODES IN THIS CELL\n",
"'''\n",
"1. Construct a function that returns a matrix of absolute difference of \n",
" the prediction **Y_pred** whose element (i, j) stores \n",
" the absolute difference between Y_pred[i] and Y_pred[j].\n",
"'''\n",
"def compute_abs_difference_matrix(Y):\n",
" # compute the absolute difference matrix\n",
" # and remember to return the matrix\n",
" \n",
" # initialise the absolute difference matrix and set all value to 0\n",
" absolute_diff_matrix = np.zeros([Y.shape[0], Y.shape[0]], dtype = float)\n",
" \n",
" # compute the absolute difference matrix\n",
" for i in range(Y.shape[0]):\n",
" for j in range(Y.shape[0]):\n",
" absolute_diff_matrix[i, j] = abs(Y[i] - Y[j])\n",
" \n",
" return absolute_diff_matrix"
]
},
{
"cell_type": "code",
"execution_count": 84,
"metadata": {},
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"[[0. 0. 1. ... 1. 1. 0.]\n",
" [0. 0. 1. ... 1. 1. 0.]\n",
" [1. 1. 0. ... 0. 0. 1.]\n",
" ...\n",
" [1. 1. 0. ... 0. 0. 1.]\n",
" [1. 1. 0. ... 0. 0. 1.]\n",
" [0. 0. 1. ... 1. 1. 0.]]\n"
]
},
{
"data": {
"image/png": "\n",
"text/plain": [
"
"
]
},
"metadata": {
"needs_background": "light"
},
"output_type": "display_data"
}
],
"source": [
"# YOU ARE REQUIRED TO INSERT YOUR CODES IN THIS CELL\n",
"'''\n",
"2. Compute the absolute difference matrix for Y_pred and visualise the matrix\n",
"Hint: You might want to use imshow function.\n",
"'''\n",
"\n",
"# compute the absolute difference matrix\n",
"abs_difference_matrix = compute_abs_difference_matrix(Y_pred)\n",
"print(abs_difference_matrix)\n",
"\n",
"# visualise the matrix\n",
"fig = plt.figure(figsize=(10,10))\n",
"plt.imshow(abs_difference_matrix)\n",
"plt.show()"
]
},
{
"cell_type": "code",
"execution_count": 85,
"metadata": {},
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"[[0. 3.74165739 9. ... 7.93725393 9.89949494 5.29150262]\n",
" [3.74165739 0. 7.54983444 ... 6.8556546 8.1240384 4.24264069]\n",
" [9. 7.54983444 0. ... 5.47722558 3.31662479 9.64365076]\n",
" ...\n",
" [7.93725393 6.8556546 5.47722558 ... 0. 5.56776436 7.81024968]\n",
" [9.89949494 8.1240384 3.31662479 ... 5.56776436 0. 9.38083152]\n",
" [5.29150262 4.24264069 9.64365076 ... 7.81024968 9.38083152 0. ]]\n"
]
},
{
"data": {
"image/png":...
SOLUTION.PDF

Answer To This Question Is Available To Download

Related Questions & Answers

More Questions ยป

Submit New Assignment

Copy and Paste Your Assignment Here