The dataset Education - Post 12th Standard.csv contains information on various colleges. You are expected to do a Principal Component Analysis for this case study according to the instructions given....

1 answer below »

View more »
Answered 59 days AfterMay 25, 2022

Answer To: The dataset Education - Post 12th Standard.csv contains information on various colleges. You are...

Vishali answered on Jul 23 2022
89 Votes
First of all load the dataset and check shape and data type of variables.
Here, we have 777 rows a
nd 18 columns
Next type is to see summary of our data using describe function.
In order to check whether the data is normally distributed or not, we use distplot
1. Now, we will check normality of data using skewness.
2. Skewness =0 means data is normally distributed, if it is >0 it is left skewed and if it < 0 it is skewed towards right.
df.skew(axis=0,skipna=True)
3. For multivariate Analysis, we plot heatmap and check correlation using df.corr()
sns.heatmap(df.corr(),annot=True)
In order to do scaling we need to remove outliers. It is done to keep data on one common scale. It is kind of data pre processing which can be applied to independent variables or features of data. Another Calculations can also be speed up using scaling.
We have one column with object data type, so we need to drop that...
SOLUTION.PDF

Answer To This Question Is Available To Download

Related Questions & Answers

More Questions »

Submit New Assignment

Copy and Paste Your Assignment Here
April
January
February
March
April
May
June
July
August
September
October
November
December
2025
2025
2026
2027
SunMonTueWedThuFriSat
30
31
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
1
2
3
00:00
00:30
01:00
01:30
02:00
02:30
03:00
03:30
04:00
04:30
05:00
05:30
06:00
06:30
07:00
07:30
08:00
08:30
09:00
09:30
10:00
10:30
11:00
11:30
12:00
12:30
13:00
13:30
14:00
14:30
15:00
15:30
16:00
16:30
17:00
17:30
18:00
18:30
19:00
19:30
20:00
20:30
21:00
21:30
22:00
22:30
23:00
23:30