Top "Feature-selection" questions

In machine learning, this is the process of selecting a subset of most relevant features to construction your data model.

Should Feature Selection be done before Train-Test Split or after?

Actually, there is a contradiction of 2 facts that are the possible answers to the question: The conventional answer is to …

machine-learning feature-selection train-test-split
How is feature importance calculated for GradientBoostingClassifier

I'm using scikit-learn's gradient-boosted trees classifier, GradientBoostingClassifier. It makes feature importance score available in feature_importances_. How are these feature …

python machine-learning scikit-learn feature-selection
Does R randomForest's rfcv method actually say which features it selected, or not?

I would like to use rfcv to cull the unimportant variables from a data set before creating a final random …

r machine-learning classification random-forest feature-selection
Multi-label feature selection using sklearn

I'm looking to perform feature selection with a multi-label dataset using sklearn. I want to get the final set of …

python-2.7 machine-learning scikit-learn feature-selection multilabel-classification
Logistic Regression: How to find top three feature that have highest weights?

I am working on UCI breast cancer dataset and trying to find the top 3 features that have highest weights. I …

python machine-learning scikit-learn logistic-regression feature-selection
XGBoost plot importance has no property max_num_features

xgboost's plotting API states: xgboost.plot_importance(booster, ax=None, height=0.2, xlim=None, ylim=None, title='Feature importance', xlabel='F …

python feature-selection xgboost