Am new in the Apache Flume. I need to install the flume on top of the HDFS cluster environment. I did Google it, all are saying using the cloudera distribution but I need to install and configure from the source.
So can anyone please suggest me, where to start and how to customize the flume agent and sink services?
I have just installed Apache Flume 1.3 on Ubuntu.
You need to download the binary zip for your OS, extract it and create a config file which is similar to properties file in Java.
The installation and running of agents is a dumb/easy process, just read this