Top "Etl" questions

ETL is an acronym for Extract, Transform, and Load.

How to add a new Struct column to a DataFrame

I'm currently trying to extract a database from MongoDB and use Spark to ingest into ElasticSearch with geo_points. The …

scala elasticsearch apache-spark etl apache-spark-sql
What is a staging table?

Are staging tables used only in Data warehouse project or in any SSIS Project? I would like to know what …

ssis etl data-warehouse staging-table
How to reconfigure the column information on a flat file connection manager?

I have a Flat File Source that is reading data from a flat file. We have recently added a new …

ssis etl dataflow
How to set dependencies between DAGs in Airflow?

I am using Airflow to schedule batch jobs. I have one DAG (A) that runs every night and another DAG (…

python etl airflow
How to extract data from Google Analytics and build a data warehouse (webhouse) from it?

I have click stream data such as referring URL, top landing pages, top exit pages and metrics such as page …

google-analytics data-warehouse etl
Missing SSIS toolbox in Visual Studio 2017

I am running SQL Server 2014 on my machine. I have installed SSDT for Visual Studio 2017, which I also have. Now …

sql-server visual-studio ssis etl sql-server-data-tools
SSIS: Code page goes back to 65001

In an SSIS package that I'm writing, I have a CSV file as a source. On the Connection Manager General …

sql-server csv encoding ssis etl
What is the required language knowledge to use Informatica effectively?

In the next few weeks, my company will be engaging multiple vendors to establish a choice for a common global …

etl talend informatica-powercenter
SSIS error: delimiter for column "columnX" is not found

This question was asked here before but the solutions proposed don't seem to be working for me. I'm trying to …

sql ssis etl sql-server-data-tools ssis-2012
Load CSV into Redshift, with header?

Is there an option to load a CSV into Redshift with a header? I see the documentation for CSV but …

csv amazon-web-services amazon-redshift etl