Top "Bigdata" questions

Big data is a concept that deals with data sets of extreme volumes.

scala.reflect.internal.MissingRequirementError: object java.lang.Object in compiler mirror not found

I'am trying to build spark streaming application using sbt package,I can't discover what's the reason of this error. this …

scala apache-spark bigdata
Determining optimal number of Spark partitions based on workers, cores and DataFrame size

There are several similar-yet-different concepts in Spark-land surrounding how work gets farmed out to different nodes and executed concurrently. Specifically, …

apache-spark spark-dataframe distributed-computing partitioning bigdata
SolrException Plugin init failure for [schema.xml] fieldType "pint": Error loading class 'solr.IntField'

I am getting this error collection1: org.apache.solr.common.SolrException:org.apache.solr.common.SolrException: Could not load conf …

apache solr tomcat7 bigdata solr-schema
Any good tools to make 3D data visualizations for Big Data?

I don't know if you have seen this video: http://www.youtube.com/watch?v=Q9wcvFkWpsM. I wonder what …

3d data-visualization bigdata graph-visualization
Number of reducers in hadoop

I was learning hadoop, I found number of reducers very confusing : 1) Number of reducers is same as number of partitions. 2) …

hadoop mapreduce hadoop2 reducers bigdata
Hadoop Nodemanager and Resourcemanager not starting

I am trying to setup the latest Hadoop 2.2 single node cluster on Ubuntu 13.10 64 bit. the OS is a fresh installation, …

hadoop bigdata yarn ubuntu-13.10
Load a small random sample from a large csv file into R data frame

The csv file to be processed does not fit into the memory. How can one read ~20K random lines of …

r csv random dataframe bigdata
What are the differences between Sort Comparator and Group Comparator in Hadoop?

What are the differences between Sort Comparator and Group Comparator in Hadoop?

hadoop bigdata
Export large amount of data from Cassandra to CSV

I'm using Cassandra 2.0.9 for store quite big amounts of data, let's say 100Gb, in one column family. I would like …

csv cassandra bigdata cassandra-2.0
Casting date in Talend Data Integration

In a data flow from one table to another, I would like to cast a date. The date leaves the …

date casting bigdata talend data-integration