IS6052 ‐ Descriptive and Predictive Analytics 2022‐2023 Individual CA Project Due Date: Thursday December 8th Submit your project report as a single pdf file on Canvas Loan Appr...

1 answer below »

View more »
Answered 13 days AfterDec 01, 2022

Answer To: IS6052 ‐ Descriptive and Predictive Analytics 2022‐2023 Individual CA Project Due Date:...

Subhanbasha answered on Dec 15 2022
50 Votes
FNB Bank – prediction model
                 Data and summary
The data collected is about the customers of banks whether they write off or not. The data is from the FNB bank. Here I used descriptive and predictive analytics to find out th
e pattern from the past behaviour of the customers and used it to predict new customers whether we proceed to give loans or not.
    So, by observing the collected data there are some numerical and categorical variables. The total number of customer data is 40000 and we have a total of 18 columns where these are the main inputs to the model. The following are the type of variables that they have in the data.
Character variables:
· Gender
· marital_status
· education
· employ_status
· spouse_work
· residential_status
· loan_purpose
· collateral
· writeoff
Numerical/integer variables:
· age
· nb_depend_child
· yrs_current_job
· yrs_employed
· net_income
· spouse_income
· yrs_current_address
· loan_amount
· loan_length
In the data, the age variable has a minimum age of 20 and maximum age is 65. The maximum number of dependent children is 3 and in years of current job is 25 years. The average net income is 42956 and the maximum is 178500. The average spouse’s income is 10266 and the maximum income is 167298. The average loan amount is 30702 and the maximum is 272343. The average loan tenure is 37 months the maximum is 96 months.
                Prediction Methods
Here we have the previous data about the customers of various characteristics geographical features and personal details. We can use here various machine learning algorithm models to make the model and predict. Here mainly the problem is a classification problem, so we use classification algorithms.
The methods used for prediction is as follows
· Decision Trees
· Random Forest
· Naïve Bayes
· Support Vector Machine
Decision Trees:
Pros:
· It will take less time and effort to create the algorithm and called straight forward algorithm
· It can be easily understood by the users.
· The algorithm does not require the scaling of the raw data it will handle itself only
· The missing values also not reflect in the model.
Cons:
· It will change drastically while we are changing data for the small part
· Decision tree algorithm not recommended for the continuous variables.
· This will take some time to make...
SOLUTION.PDF

Answer To This Question Is Available To Download

Related Questions & Answers

More Questions »

Submit New Assignment

Copy and Paste Your Assignment Here
April
January
February
March
April
May
June
July
August
September
October
November
December
2025
2025
2026
2027
SunMonTueWedThuFriSat
30
31
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
1
2
3
00:00
00:30
01:00
01:30
02:00
02:30
03:00
03:30
04:00
04:30
05:00
05:30
06:00
06:30
07:00
07:30
08:00
08:30
09:00
09:30
10:00
10:30
11:00
11:30
12:00
12:30
13:00
13:30
14:00
14:30
15:00
15:30
16:00
16:30
17:00
17:30
18:00
18:30
19:00
19:30
20:00
20:30
21:00
21:30
22:00
22:30
23:00
23:30