I have the following 3 questions regarding that process:
- What percent of my dataset is separated into a the training data set vs the test data set?
- Is the test data set assembled randomly or shuffled?
- Is the test data set added back to my training data set after the accuracy score is calculated?