Top "Bigdata" questions

Big data is a concept that deals with data sets of extreme volumes.

MySQL Cluster vs. Hadoop for handling big data

I want to know the advantages/disadvantages of using a MySQL Cluster and using the Hadoop framework. What is the …

hadoop mapreduce hive bigdata mysql-cluster
How to return large amount of rows from mongodb using node.js http server?

I have a user database in mongodb which I would like to export via a REST interface in JSON. The …

node.js mongodb rest http bigdata
what is the basic difference between jobconf and job?

hi i wanted to know the basic difference between jobconf and job objects,currently i am submitting my job like …

hadoop mapreduce bigdata
What are the limitations of implementing MySQL NDB Cluster?

I want to implement NDB Cluster for MySQL Cluster 6. I want to do it for very huge data structure with …

mysql cluster-computing bigdata mysql6
How can I add a column with a value to a new Dataset in Spark Java?

So, I'm creating some Datasets from the java Spark API. These datasets are populated from hive table, using the spark.…

java apache-spark dataset apache-spark-dataset bigdata
sorting large text data

I have a large file (100 million lines of tab separated values - about 1.5GB in size). What is the fastest …

python sorting bigdata
How to process a range of hbase rows using spark?

I am trying to use HBase as a data source for spark. So the first step turns out to be …

java hadoop bigdata apache-spark
Send KafkaProducer from local machine to hortonworks sandbox on virtualbox

I have a really simple producer that I am running through eclipse on my windows local machine... What I really …

hadoop bigdata apache-kafka hortonworks-data-platform
Reading big data with fixed width

How can I read big data formated with fixed width? I read this question and tried some tips, but all …

r bigdata
AWS S3 Sync very slow when copying to large directories

When syncing data to an empty directory in S3 using AWS-CLI, it's almost instant. However, when syncing to a large …

amazon-web-services amazon-s3 aws-cli bigdata