Minimum hardware requirements for Apache Airflow cluster

Duleendra picture Duleendra · Nov 14, 2017 · Viewed 7.8k times · Source

What are the minimum hardware requirements for setting up an Apache Airflow cluster.

Eg. RAM, CPU, Disk etc for different types of nodes in the cluster.

Answer

jastang picture jastang · Jan 23, 2018

I have had no issues using very small instances in pseudo-distributed mode (32 parallel workers; Postgres backend):

  • RAM 4096 MB
  • CPU 1000 MHz
  • VCPUs 2 VCPUs
  • Disk 40 GB

If you want distributed mode, you should be more than fine with that if you keep it homogenous. Airflow shouldn't really do heavy lifting anyways; push the workload out to other things (Spark, EMR, BigQuery, etc).

You will also have to run some kind of messaging queue, like RabbitMQ. I think they take Redis too. However, this doesn't really dramatically impact how you size.