Top "Bigdata" questions

Big data is a concept that deals with data sets of extreme volumes.

How do you ingest Spring boot logs directly into elastic

I’m investigating feasability of sending spring boot application logs directly into elastic search. Without using filebeats or logstash. I …

spring elasticsearch logging data-ingestion bigdata
100 TB of data on Mongo DB? Possible?

What kind of an architecture is needed to store 100 TB data and query it with aggregation? How many nodes? Disk …

mongodb hadoop vertica bigdata database
How to balance my data across the partitions?

Edit: The answer helps, but I described my solution in: memoryOverhead issue in Spark. I have an RDD with 202092 partitions, …

python hadoop apache-spark distributed-computing bigdata
How to output a file using tab delimiter in Netezza NZSQL

I am trying to output some files using NZSQL CLI but not able to output as tab delimited files. Can …

sql bigdata netezza nzsql
Best solution for finding 1 x 1 million set intersection? Redis, Mongo, other

Hi all and thanks in advance. I am new to the NoSQL game but my current place of employment has …

mongodb redis bigdata nosql
Pig - ERROR 1045: AVG as multiple or none of them fit. Please use an explicit cast

I have a comma seperated .txt file, I want to DUMP the AVG age of all Males. records = LOAD 'file:/…

hadoop mapreduce apache-pig bigdata
Machine Learning & Big Data

In the beginning, I would like to describe my current position and the goal that I would like to achieve. …

machine-learning bigdata