Top "Scikit-learn" questions

scikit-learn is a machine-learning library for Python that provides simple and efficient tools for data analysis and data mining, with a focus on machine learning.

Different result with roc_auc_score() and auc()

I have trouble understanding the difference (if there is one) between roc_auc_score() and auc() in scikit-learn. Im tying …

python machine-learning scikit-learn
Understanding min_df and max_df in scikit CountVectorizer

I have five text files that I input to a CountVectorizer. When specifying min_df and max_df to the …

python machine-learning scikit-learn nlp
tf-idf feature weights using sklearn.feature_extraction.text.TfidfVectorizer

this page: http://scikit-learn.org/stable/modules/feature_extraction.html mentions: As tf–idf is a very often used for …

python scikit-learn tf-idf
Using the predict_proba() function of RandomForestClassifier in the safe and right way

I'm using Scikit-learn to apply machine learning algorithm on my data sets. Sometimes I need to have the probabilities of …

python machine-learning scikit-learn random-forest
How to graph grid scores from GridSearchCV?

I am looking for a way to graph grid_scores_ from GridSearchCV in sklearn. In this example I am trying …

python machine-learning scikit-learn grid-search
How to use the a k-fold cross validation in scikit with naive bayes classifier and NLTK

I have a small corpus and I want to calculate the accuracy of naive Bayes classifier using 10-fold cross validation, …

python scikit-learn nltk cross-validation naivebayes
Scikit-learn cross validation scoring for regression

How can one use cross_val_score for regression? The default scoring seems to be accuracy, which is not very …

python scikit-learn regression
Anaconda Python - how to reinstall NumPy

I am using Anaconda 5.3.0's Python interpreter in Visual Studio Code. When I try to import sklearn I get an …

python numpy scikit-learn anaconda conda
Plot trees for a Random Forest in Python with Scikit-Learn

I want to plot a decision tree of a random forest. So, i create the following code: clf = RandomForestClassifier(n_…

python tree scikit-learn random-forest pydot
How to get most informative features for scikit-learn classifiers?

The classifiers in machine learning packages like liblinear and nltk offer a method show_most_informative_features(), which is really …

python machine-learning classification scikit-learn