Top "Apache-pig" questions

Apache Pig is a platform for analyzing large data sets that consists of a high-level language for expressing data analysis programs, coupled with infrastructure for evaluating these programs.

Filtering null values with pig

It looks like a silly problem, but I can´t find a way to filter null values from my rows. …

hadoop apache-pig
Is there any Conditional IF like operator in Apache PIG?

Actually I am writing PIG Script and want to execute some set of statements if one of the condition is …

hadoop apache-pig
How to force STORE (overwrite) to HDFS in Pig?

When developing Pig scripts that use the STORE command I have to delete the output directory for every run or …

apache-pig hdfs
How do I parse JSON in Pig?

I have a lot of gzip'd log files in s3 that has 3 types of log lines: b,c,i. i …

json apache-pig
select count distinct using pig latin

I need help with this pig script. I am just getting a single record. I am selecting 2 columns and doing …

hadoop apache-pig
pig - how to reference columns in a FOREACH after a JOIN?

A = load 'a.txt' as (id, a1); B = load 'b.txt as (id, b1); C = join A by id, B …

apache-pig
ERROR 1066: Unable to open iterator for alias - Pig

Just started Pig; trying to load the data from a file and dump it henceforth. Loading seems to be proper, …

apache-pig
How can I use the map datatype in Apache Pig?

I'd like to use Apache Pig to build a large key -> value mapping, look things up in the …

syntax map hadoop apache-pig
How to get array/bag of elements from Hive group by operator?

I want to group by a given field and get the output with grouped fields. Below is an example of …

sql hadoop hive apache-pig bigdata
using PIG to load a file

I am very new to PIG and I am having what feels like a very basic problem. I have a …

hadoop apache-pig