Open Source ETL framework

jchips12 picture jchips12 · Jan 18, 2012 · Viewed 21.1k times · Source

I was asked to prototype two ETL frameworks. The requirements are as follows:

  • Open Source
  • Available to Linux
  • Maintained
  • Logs can be viewed on web browser (nice to have)
  • Written in Perl, Python, Ruby or Java

The raw file can be anything (excel, csv, html page etc..) The target database is MySQL.

Dont just drop names, please indicate the advantages/disadvantages based from your experience.

Thanks!

Answer

Husain Basrawala picture Husain Basrawala · Jan 18, 2012

One of the most popular Java based ETL would be Talend.

Jaspersoft ETL is another one extended from Talend and has a nice eclipse based UI.