Top "Categorical-data" questions

Statistical data type whose value is one of a fixed number of nominal categories.

Reveal k-modes cluster features

I'm performing a cluster analysis on categorical data, hence using k-modes approach. My data is shaped as a preference survey: …

scikit-learn k-means categorical-data
Feature preprocessing of both continuous and categorical variables (of integer type) with scikit-learn

The main goals are as follows: Apply StandardScaler to continuous variables Apply LabelEncoder and OnehotEncoder to categorical variables The continuous …

python pandas machine-learning scikit-learn categorical-data
How to generate pandas DataFrame column of Categorical from string column?

I can convert a pandas string column to Categorical, but when I try to insert it as a new DataFrame …

pandas categorical-data
Trouble training xgboost on categorical column

I am trying to run a Python notebook (link). At line below In [446]: where author train XGBoost, I am getting …

python xgboost categorical-data
line graph with 2 categorical variables and 1 continuous in R

I'm quite new to R and statistics in general. I am trying to plot in a line graph 2 categorical variables (…

r ggplot2 categorical-data linegraph
Legend of a raster map with categorical data

I would like to plot a raster containing 4 different values (1) with a categorical text legend describing the categories such as 2 …

r map legend raster categorical-data
Combining factor levels in data frame column

I have a data frame data with a column, named "Project License", which represents a categorical variable, and, thus, in …

r taxonomy categorical-data merging-data
How to use formula in R to exclude main effect but retain interaction

I do not want main effect because it is collinear with a finer factor fixed effect, so it is annoying …

r regression linear-regression lm categorical-data
Pandas dataframe encode Categorical variable with thousands of unique values

I have a dataframe about data on schools for a few thousands cities. The school is the row identifier and …

python pandas categorical-data
How to transform a categorical variable in Spark into a set of columns coded as {0,1}?

I'm trying to perform a logistic regression (LogisticRegressionWithLBFGS) with Spark MLlib (with Scala) on a dataset which contains categorical variables. …

scala apache-spark bigdata apache-spark-mllib categorical-data