Top "Data-science" questions

Implementation questions about data science.

find the "elbow point" on an optimization curve with Python

i have a list of points which are the inertia values of a kmeans algorithm. To determine the optimum amount …

python numpy scikit-learn data-science
Pandas Fillna of Multiple Columns with Mode of Each Column

Working with census data, I want to replace NaNs in two columns ("workclass" and "native-country") with the respective modes of …

python pandas numpy data-science
Best way to subset a pandas dataframe

Hey I'm new to Pandas and I just came across df.query(). Why people would use df.query() when you …

python pandas dataframe data-science
How to optimize MAPE code in Python?

I need to have a MAPE function, however I was not able to find it in standard packages ... Below, my …

python numpy machine-learning statistics data-science
pickle data was truncated

i created a corpus file then stored in a pickle file. my messages file is a collection of different news …

python data-science pickle data-science-experience
Scoring in Gridsearch CV

I just started with GridSearchCV in Python, but I am confused what is scoring in this. Somewhere I have seen …

python machine-learning data-science grid-search
quantile normalization on pandas dataframe

Simply speaking, how to apply quantile normalization on a large Pandas dataframe (probably 2,000,000 rows) in Python? PS. I know that …

python deep-learning data-science
Invalid Length Argument

I want to convert all the coloumns of my dataframe to numeric format. So I use lapply data.frame(lapply(…

r dataframe data-science sapply
Plotly missing orca

I have small problem when exporting static chart using plotly. Plotly does not correctly recognize that I have orca installed …

python plotly data-science orca
Pandas drop rows vs filter

I have a pandas dataframe and want to get rid of rows in which the column 'A' is negative. I …

python pandas dataframe data-science