Top "Apache-pig" questions

Apache Pig is a platform for analyzing large data sets that consists of a high-level language for expressing data analysis programs, coupled with infrastructure for evaluating these programs.

How to do outer join on two columns in Pig Latin

I do outer joins on single columns in Pig like this result = JOIN A by id LEFT OUTER, B by …

hadoop apache-pig
Pig vs Hive vs Native Map Reduce

I've basic understanding on what Pig, Hive abstractions are. But I don't have a clear idea on the scenarios that …

hadoop mapreduce hive apache-pig
Pig: Get top n values per group

I have data that's already grouped and aggregated, it looks like so: user value count ---- -------- ------ Alice third 5 …

hadoop hdfs apache-pig
GUI for using Hadoop

Is there an easy way to use Hadoop other than with the command line? Which tools are you using and …

user-interface hadoop hive apache-pig hue
how to deploy and run oozie job?

I'm trying to do a simple job using oozie. It will be a one simple Pig Action. I have a …

hadoop apache-pig oozie
Skipping the header while loading the text file using Piglatin

I have a text file and it's first row contains the header. Now I want to do some operation on …

hadoop apache-pig
Max/Min for whole sets of records in PIG

I have a set set of records that I am loading from a file and the first thing I need …

hadoop apache-pig
Export from pig to CSV

I'm having a lot of trouble getting data out of pig and into a CSV that I can use in …

excel csv apache-pig
What is the difference between Apache Pig and Apache Hive?

What is the exact difference between Pig and Hive? I found that both have same functional meaning because they are …

hadoop hive apache-pig
Calculate Average using PIG

I am new to PIG and want to calculate Average of my one column data that looks like 0 10.1 20.1 30 40 50 60 70 80.1 I wrote …

hadoop apache-pig