Top "Apache-beam" questions

Apache Beam is a unified SDK for batch and stream processing.

What are the benefits of Apache Beam over Spark/Flink for batch processing?

Apache Beam supports multiple runner backends, including Apache Spark and Flink. I'm familiar with Spark/Flink and I'm trying to …

apache-spark apache-flink apache-beam
What is Apache Beam?

I was going through the Apache posts and found a new term called Beam. Can anybody explain what exactly Apache …

apache-beam
Apache Airflow or Apache Beam for data processing and job scheduling

I'm trying to give useful information but I am far from being a data engineer. I am currently using the …

pandas airflow apache-beam
Collecting output from Apache Beam pipeline and displaying it to console

I have been working on Apache Beam for a couple of days. I wanted to quickly iterate on the application …

apache-beam
How to convert csv into a dictionary in apache beam dataflow

I would like to read a csv file and write it to BigQuery using apache beam dataflow. In order to …

python csv google-bigquery google-cloud-dataflow apache-beam
Apache Beam : FlatMap vs Map?

I want to understand in which scenario that I should use FlatMap or Map. The documentation did not seem clear …

google-cloud-dataflow apache-beam
Apache Beam over Apache Kafka Stream processing

What are the differences between Apache Beam and Apache Kafka with respect to Stream processing? I am trying to grasp …

apache-kafka apache-beam apache-kafka-streams stream-processing
Connecting to Cloud SQL from Dataflow Job

I'm struggling to use JdbcIO with Apache Beam 2.0 (Java) to connect to a Cloud SQL instance from Dataflow within the …

google-cloud-sql google-cloud-dataflow apache-beam
Using Dataflow vs. Cloud Composer

I'd like to get some clarification on whether Cloud Dataflow or Cloud Composer is the right tool for the job, …

google-cloud-dataflow airflow apache-beam google-cloud-composer