Top "Aws-glue" questions

AWS Glue is a fully managed ETL (extract, transform, and load) service that can categorize your data, clean it, enrich it, and move it between various data stores.

AWS Glue Crawler Not Creating Table

I have a crawler I created in AWS Glue that does not create a table in the Data Catalog after …

amazon-web-services aws-glue
Overwrite parquet files from dynamic frame in AWS Glue

I use dynamic frames to write a parquet file in S3 but if a file already exists my program append …

amazon-web-services parquet aws-glue
Add a partition on glue table via API on AWS?

I have an S3 bucket which is constantly being filled with new data, I am using Athena and Glue to …

amazon-web-services amazon-s3 amazon-athena aws-glue
use SQL inside AWS Glue pySpark script

I want to use AWS Glue to convert some csv data to orc. The ETL job I created generated the …

amazon-web-services pyspark pyspark-sql amazon-athena aws-glue
Is there any way to trigger a AWS Lambda function at the end of an AWS Glue job?

Currently I'm using an AWS Glue job to load data into RedShift, but after that load I need to run …

aws-lambda etl aws-glue
AWS Glue issue with double quote and commas

I have this CSV file: reference,address V7T452F4H9,"12410 W 62TH ST, AA D" The following options are …

hadoop hive presto amazon-athena aws-glue
AWS Glue takes a long time to finish

I just run a very simple job as follows glueContext = GlueContext(SparkContext.getOrCreate()) l_table = glueContext.create_dynamic_frame.from_…

amazon-web-services aws-glue
Event based trigger of AWS Glue Crawler after a file is uploaded into a S3 Bucket?

Is it possible to trigger an AWS Glue crawler on new files, that get uploaded into a S3 bucket, given …

amazon-web-services amazon-s3 aws-glue