Popular "apache-spark-dataset" questions | Page 4

I am trying to add a UUID column to my dataset. getDataset(Transaction.class)).withColumn("uniqueId", functions.lit(UUID.randomUUID().…

apache-spark apache-spark-dataset spark-csv

So, I'm creating some Datasets from the java Spark API. These datasets are populated from hive table, using the spark.…

java apache-spark dataset apache-spark-dataset bigdata

How can we overwrite a partitioned dataset, but only the partitions we are going to change? For example, recomputing last …

apache-spark hive apache-spark-dataset

Given the following DataSet values as inputData: column0 column1 column2 column3 A 88 text 99 Z 12 test 200 T 120 foo 12 In Spark, what …

scala apache-spark spark-dataframe apache-spark-dataset

Looking at the select() function on the spark DataSet there are various generated function signatures: (c1: TypedColumn[MyClass, U1],c2: …

scala apache-spark apache-spark-dataset

I like Spark Datasets as they give me analysis errors and syntax errors at compile time and also allow me …

scala apache-spark join apache-spark-sql apache-spark-dataset

While I am using Spark DataSet to load a csv file. I prefer designating schema clearly. But I find there …

apache-spark apache-spark-dataset

Getting this null error in spark Dataset.filter Input CSV: name,age,stat abc,22,m xyz,,s Working code: case …

scala apache-spark apache-spark-sql apache-spark-dataset

Recently I wanted to do Spark Machine Learning Lab from Spark Summit 2016. Training video is here and exported notebook is …

excel scala apache-spark apache-spark-dataset spark-excel

I have a table with a array type column named writer which has the values like array[value1, value2], array[…

apache-spark apache-spark-sql spark-dataframe hiveql apache-spark-dataset

Top "Apache-spark-dataset" questions