Top "Udf" questions

A function provided by the user of a program or an environment most often for spreadsheet type applications or database applications.

Spark UDF for StructType / Row

I have a "StructType" column in spark Dataframe that has an array and a string as sub-fields. I'd like to …

scala apache-spark udf
Select all columns of a Hive Struct

I have a requirement to select * from all columns from a hive struct. Hive create table script is here below …

struct hive udf apache-hive hive-udf
Spark UDF with varargs

Is it an only option to list all the arguments up to 22 as shown in documentation? https://spark.apache.org/…

scala apache-spark udf
Unable to use an existing Hive permanent UDF from Spark SQL

I have previously registered a UDF with hive. It is permanent not TEMPORARY. It works in beeline. CREATE FUNCTION normaliseURL …

apache-spark hive apache-spark-sql udf
Spark SQL nested withColumn

I have a DataFrame that has multiple columns of which some of them are structs. Something like this root |-- …

scala apache-spark dataframe udf
How to allow sklearn K Nearest Neighbors to take custom distance metric?

I have a custom distance metric that I need to use for KNN, K Nearest Neighbors. I tried following this, …

scikit-learn distance nearest-neighbor udf