(a) Applyk-NN, Naïve Bayes, and Classification Trees (use GridsearchCV on training data coupled with cross-validation) to classify a loan application as a “lower risk” (approve) or “higher risk” (deny), using appropriate predictors. Partition the data into training (60%) and validation (40%) sets. Normalize data where it’s appropriate. Find the bestkfork-NN. Report classification accuracy rate for both training and validation data. Produce the lift and gains charts for all classifiers.
(b) Repartition the data into training, validation, and test sets (50%:30%:20%). Apply thek-NN classifier with thekchosen using the validation set. Compare the confusion matrix of the test set with that of the training and validation sets.
5 rows × 35 columns
5 rows × 21 columns
Already registered? Login
Not Account? Sign up
Enter your email address to reset your password
Back to Login? Click here