Top "Bigdata" questions

Big data is a concept that deals with data sets of extreme volumes.

Strategies for reading in CSV files in pieces?

I have a moderate-sized file (4GB CSV) on a computer that doesn't have sufficient RAM to read it in (8GB …

r bigdata
sklearn and large datasets

I have a dataset of 22 GB. I would like to process it on my laptop. Of course I can't load …

python bigdata scikit-learn
clustering very large dataset in R

I have a dataset consisting of 70,000 numeric values representing distances ranging from 0 till 50, and I want to cluster these numbers; …

r machine-learning bigdata cluster-analysis data-mining
When do you start additional Elasticsearch nodes?

I'm in the middle of attempting to replace a Solr setup with Elasticsearch. This is a new setup, which has …

elasticsearch sharding bigdata
Find out actual disk usage in HDFS

Is there a way to find out how much space is consumed in HDFS? I used hdfs dfs -df but …

hadoop hdfs bigdata diskspace
Operation Time Out Error in cqlsh console of cassandra

I have a three nodes Cassandra Cluster and I have created one table which has more than 2,000,000 rows. When I …

java cassandra bigdata datastax cqlsh
Edge nodes in hadoop cluster

Can Some one explain me the architecture of Edge node in hadoop. I am able to find only the definition …

hadoop bigdata
How to get started with Big Data Analysis

I've been a long time user of R and have recently started working with Python. Using conventional RDBMS systems for …

python r hadoop bigdata
Is there something like Redis DB, but not limited with RAM size?

I'm looking for a database matching these criteria: May be non-persistent; Almost all keys of DB need to be updated …

database redis nosql bigdata
What is the best way to load huge result set in memory?

I am trying to load 2 huge resultsets(source and target) coming from different RDBMS but the problem with which i …

c# ado.net bigdata datareader