Business Understanding: What is the business context? What is the analytics question? Data Understanding: How many variables? Is there a target variable (supervised learning or unsupervised)? If so,...




Business Understanding: What is the business context? What is the analytics question? Data Understanding: How many variables? Is there a target variable (supervised learning or unsupervised)? If so, which one? What are the data types? How many data records? Data Preparation/Preprocessing: Renaming? Missing values? Transformation? Model Building: You can use RapidMiner to build the Decision Tree Model as a starting model. First, split your dataset to have about ~400 records in model building (Split Data operator). Second, use the "Cross-Validation" subprocess to set up a Decision Tree Model based on these records. Report the model training/learning accuracy. Model Testing: Use the hold-out 25 records for model testing. You must use the "apply model" operator to take the model in step 4 and the hold-out 25 data as testing data. Report the testing accuracy. Is it very different from the training accuracy? Model Evaluation (1): Compare your Decision Tree Model with other options: kNN, Naive Bayes, and Random Forrest. Configure the operator parameters as needed. Report their corresponding training and test accuracy results. Model Evaluation (2): Compare your Decision Tree Model with more options: SVM, Neural Nets, GLM, and Logistic Regression. Note that you must do some data type conversions to use these models because they take numerical inputs. Report their corresponding training and test accuracy results.

















Mar 29, 2023
SOLUTION.PDF

Get Answer To This Question

Related Questions & Answers

More Questions »

Submit New Assignment

Copy and Paste Your Assignment Here