Top "Train-test-split" questions

Questions with this tag are about how to split the machine learning data set into random train and test subsets.

Keras split train test set when using ImageDataGenerator

I have a single directory which contains sub-folders (according to labels) of images. I want to split this data into …

keras train-test-split
Singleton array array(<function train at 0x7f3a311320d0>, dtype=object) cannot be considered a valid collection

Not sure how to fix . Any help much appreciate. I saw thi Vectorization: Not a valid collection but not sure …

python pandas scikit-learn pipeline train-test-split
scikit-learn error: The least populated class in y has only 1 member

I'm trying to split my dataset into a training and a test set by using the train_test_split function …

python scikit-learn train-test-split
How to perform k-fold cross validation with tensorflow?

I am following the IRIS example of tensorflow. My case now is I have all data in a single CSV …

python tensorflow cross-validation train-test-split
Splitting data using time-based splitting in test and train datasets

I know that train_test_split splits it randomly, but I need to know how to split it based on …

python scikit-learn timestamp train-test-split
Normalize data before or after split of training and testing data?

I want to separate my data into train and test set, should I apply normalization over data before or after …

machine-learning data-science normalization training-data train-test-split
Spark train test split

I am curious if there is something similar to sklearn's http://scikit-learn.org/stable/modules/generated/sklearn.model_selection.StratifiedShuffleSplit.…

apache-spark apache-spark-mllib train-test-split
Split image dataset into train-test datasets

So I have a main folder which contains sub-folders which in turn contains images for the dataset as follows. -main_…

python-3.x training-data train-test-split
train_test_split( ) method of scikit learn

I am trying to create a machine learning model using DecisionTreeClassifier. To train & test my data I imported train_…

python python-3.x machine-learning scikit-learn train-test-split
Do I have to do one-hot-encoding separately for train and test dataset?

I'm working on a classification problem and I've split my data into train and test set. I have few categorical …

python machine-learning one-hot-encoding train-test-split