Top "Data-analysis" questions

Data Analysis involves extracting meaning and insights from raw data.

Plotting results of Pandas GroupBy

I'm starting to learn Pandas and am trying to find the most Pythonic (or panda-thonic?) ways to do certain tasks. …

python matplotlib group-by pandas data-analysis
Why does one hot encoding improve machine learning performance?

I have noticed that when One Hot encoding is used on a particular data set (a matrix) and used as …

machine-learning data-mining scikit-learn data-analysis
Processing a very very big data set in python - memory error

I'm trying to process data obtained from a csv file using csv module in python. there are about 50 columns & 401125 …

python numpy python-2.7 data-analysis
How can I convert Json to data frame in R

I'd like to convert my json data to data frame in R. Here is what I've done so far: library("…

json r data-analysis
Getting Probability Density of Data

I need to analyze some data about internet sessions for a DSL Line. I wanted to have a look at …

r plot probability distribution data-analysis
how to get rid of pandas converting large numbers in excel sheet to exponential?

In the excel sheet , i have two columns with large numbers. But when i read the excel file with read_…

python pandas machine-learning data-analysis
python pandas: how to calculate derivative/gradient

Given that I have the following two vectors: In [99]: time_index Out[99]: [1484942413, 1484942712, 1484943012, 1484943312, 1484943612, 1484943912, 1484944212, 1484944511, 1484944811, 1484945110] In [100]: bytes_in Out[100]: [1293981210388, 1293981379944, 1293981549960, 1293981720866, 1293981890968, 1293982062261, 1293982227492, 1293982391244, 1293982556526, 1293982722320] Where bytes_in is …

python pandas data-analysis
How to find the closest word to a vector using word2vec

I have just started using Word2vec and I was wondering how can we find the closest word to a …

python text-mining data-analysis word2vec
Set x-axis intervals(ticks) for graph of Pandas DataFrame

I'm trying to set the ticks (time-steps) of the x-axis on my matplotlib graph of a Pandas DataFrame. My goal …

python matplotlib pandas data-analysis
What to do with missing values when plotting with seaborn?

I replaced the missing values with NaN using lambda following function: data = data.applymap(lambda x: np.nan if isinstance(…

python python-2.7 pandas data-analysis seaborn