pyspark Spark UDF抛出NullPointerException

gmxoilav 于 2023-10-15 发布在 Spark

关注(0)|答案(1)|浏览(103)

我有spark UDF，我从SQL语句调用它。这就是我如何定义和注册UDF函数：

自定义项：

import org.apache.spark.sql.functions.udf

def udf_temp_emp (_type: String, _name: String): Int  = {

   val statement = s"select key from employee where empName = $_name"

   val key = spark.sql(statement).collect()(0).getInt(0)
   return key
}

spark.udf.register("udf_temp_emp", udf_temp_emp(_,_))

这就是我在SQL命令中调用它的方式：

select 
  udf_temp_emp(emp.type, emp.name), 
  emp.id 
from 
   empMaster

当我在上面运行命令时，它会抛出下面的异常：
SparkException：[FANUC_EXECUTE_UDF]执行自定义函数失败（$read$iw$$iw$$iw$$iw$$iw$$iw$iw$iw$iw$iw$iw$iw$iw$iw$iw$iw$iw$iw$iw$iw$iw$iw$iw$iw$iw$iw$iw$iw$iw$iw$iw$iw$iw$iw$iw$iw$iw$iw$iw$Lambda$13841/67716274：（string，string）=> string）。原因：NullPointerException：

pyspark

来源：https://stackoverflow.com/questions/77126540/spark-udf-throws-nullpointerexception