Top "Categorical-data" questions

Statistical data type whose value is one of a fixed number of nominal categories.

How do I run the Spark decision tree with a categorical feature set using Scala?

I have a feature set with a corresponding categoricalFeaturesInfo: Map[Int,Int]. However, for the life of me I cannot …

scala apache-spark tree apache-spark-mllib categorical-data
How to apply custom column order (on Categorical) to pandas boxplot?

EDIT: this question arose back with pandas ~0.13 and was obsoleted by direct support somewhere between version 0.15-0.18 (as per @Cireo's …

python pandas boxplot categorical-data
Interpretation of ordered and non-ordered factors, vs. numerical predictors in model summary

I have fitted a model where: Y ~ A + A^2 + B + mixed.effect(C) Y is continuous A is continuous B …

r statistics modeling categorical-data
Mosaic plot with labels in each box showing a name and percentage of all observations

I would like to create a mosaic plot (R package vcd, see e.g. http://cran.r-project.org/web/packages/…

r plot categorical-data
Pandas DataFrame sort by categorical column but by specific class ordering

I would like to select the top entries in a Pandas dataframe base on the entries of a specific column …

python-2.7 sorting pandas dataframe categorical-data
Python equivalent of daisy() in the cluster package of R

I have a dataset that contains both categorical (nominal and ordinal) and numerical attributes. I want to calculate the (dis)…

python r similarity categorical-data r-daisy
In gbm multinomial dist, how to use predict to get categorical output?

My response is a categorical variable (some alphabets), so I used distribution='multinomial' when making the model, and now I …

r machine-learning categorical-data multinomial gbm
Pandas: get_dummies vs categorical

I have a dataset which has a few columns with categorical data. I've been using the Categorical function to replace …

python pandas categorical-data dummy-data
pandas Categorical error: "Cannot setitem on a Categorical with a new category, set the categories first"

I have the following df data frame in pandas: weekday venta_total_cy 0 Viernes 5.430211e+09 1 Lunes 3.425554e+09 2 Sabado 6.833202e+09 3 Domingo 6.566466…

python pandas categorical-data
Linear Regression with sklearn using categorical variables

I am trying to run a usual linear regression in Python using sk-learn, but I have some categorical data that …

python scikit-learn linear-regression categorical-data