Top "Apache-pig" questions

Apache Pig is a platform for analyzing large data sets that consists of a high-level language for expressing data analysis programs, coupled with infrastructure for evaluating these programs.

Flatten tuple like a bag

My dataset looks like the following: ( A, (1,2) ) ( B, (2,9) ) I would like to "flatten" the tuples in Pig, basically repeating each …

hadoop apache-pig flatten
ERROR 1066: Unable to open iterator for alias in Pig, Generic solution

A very common, error message in Apache Pig is: ERROR 1066: Unable to open iterator for alias There are several questions …

debugging apache-pig hortonworks-data-platform hdp
Pig Batch mode: how to set logging level to hide INFO log messages?

Using Apache Pig version 0.10.1.21 (rexported). When I execute a pig script, there are a lots of INFO logging lines which …

apache-pig
Hadoop, Hive, Pig, HBase, Cassandra - when to use what?

First of all I am relatively new to Big Data and the Hadoop world and I have just started to …

hadoop cassandra hive apache-pig
Usage of Apache Pig rank function

Am using Pig 0.11.0 rank function and generating ranks for every id in my data. I need ranking of my data …

apache-pig
How can I incorporate the current input filename into my Pig Latin script?

I am processing data from a set of files which contain a date stamp as part of the filename. The …

apache-pig
Still getting "Unable to load realm info from SCDynamicStore" after bug fix

I installed Hadoop and Pig using brew install hadoop and brew install pig. I read here that you will to …

hadoop apache-pig homebrew
Connection Error in Apache Pig

I am running Apache Pig .11.1 with Hadoop 2.0.5. Most simple jobs that I run in Pig work perfectly fine. However, whenever …

hadoop apache-pig
Transform bag of key-value tuples to map in Apache Pig

I am new to Pig and I want to convert a bag of tuples to a map with specific value …

map apache-pig
How to use Cassandra's Map Reduce with or w/o Pig?

Can someone explain how MapReduce works with Cassandra .6? I've read through the word count example, but I don't quite follow …

mapreduce cassandra apache-pig