Top "Cross-validation" questions

Cross-Validation is a method of evaluating and comparing predictive systems in statistics and machine learning.

How to cross validate RandomForest model?

I want to evaluate a random forest being trained on some data. Is there any utility in Apache Spark to …

apache-spark random-forest cross-validation apache-spark-ml apache-spark-mllib
Using explicit (predefined) validation set for grid search with sklearn

I have a dataset, which has previously been split into 3 sets: train, validation and test. These sets have to be …

python validation scikit-learn cross-validation
Cross-validating a CART model

In an assignment, we are asked to perform a cross-validation on a CART model. I have tried using the cvFit …

r cross-validation rpart
How to split data (raw text) into test/train sets with scikit crossvalidation module?

I have a large corpus of opinions (2500) in raw text. I would like to use scikit-learn library to split them …

machine-learning scikit-learn classification cross-validation text-classification
Put customized functions in Sklearn pipeline

In my classification scheme, there are several steps including: SMOTE (Synthetic Minority Over-sampling Technique) Fisher criteria for feature selection Standardization (…

machine-learning scikit-learn pipeline cross-validation feature-selection
Deprecation warnings from sklearn

I am using cross_validation from sklearn, from sklearn.cross_validation import train_test_split I get the below warning: …

scikit-learn cross-validation
ValueError: n_splits=10 cannot be greater than the number of members in each class

I am trying to run the following code: from sklearn.model_selection import StratifiedKFold X = ["hey", "join now", "hello", "join …

python scikit-learn cross-validation
return coefficients from Pipeline object in sklearn

I've fit a Pipeline object with RandomizedSearchCV pipe_sgd = Pipeline([('scl', StandardScaler()), ('clf', SGDClassifier(n_jobs=-1))]) param_dist_sgd = {…

python scikit-learn pipeline cross-validation
Is there a simple command to do leave-one-out cross validation with the lm() function?

Is there a simple command to do leave-one-out cross validation with the lm() function in R? Specifically is there a …

r cross-validation lm
How to extract best parameters from a CrossValidatorModel

I want to find the parameters of ParamGridBuilder that make the best model in CrossValidator in Spark 1.4.x, In Pipeline …

scala apache-spark pipeline cross-validation apache-spark-mllib