Because x and y are completely independent, you should not be able to predict y using x with accuracy larger than 0.5. Confirm this by running cross-validation using logistic regression to fit the model. Because we have so many predictors, we selected a random sample x_subset. Use the subset when training the model. Hint: use the caret train function. The results component of the output of train shows you the accuracy. Ignore the warnings.
Already registered? Login
Not Account? Sign up
Enter your email address to reset your password
Back to Login? Click here