使用Pyspark将查询结果发送到DataFrame：值错误：对象长度与字段长度不匹配

t8e9dugd 于 2022-11-21 发布在 Spark

关注(0)|答案(1)|浏览(168)

我从 RDS 运行一个查询，并使用 Pyspark 将该查询转换为 DataFrame 。
这是我的代码

query= "Select * from profit"
profit=pd.read_sql(query, con=db_connection)

StructureSechma=StructType([
   StructField("id",IntegerType(), True),
   StructField("type",StringType(), False),
   StructField("userId",IntegerType(), True),
   StructField("amount",FloatType(), False),
   StructField("sell",StringType(), False),
   StructField("buy",StringType(), False),
   StructField("createdAt",DateType(), False),
   StructField("updatedAt",DateType(), False)
    ])
   profit_df = spark.createDataFrame(profit,,schema=StructureSechma)

中的每一个
我收到这期

File "<stdin>", line 1, in <module>
  File "/home/ec2-user/anaconda3/lib/python3.6/site-packages/pyspark/sql/session.py", line 748, in createDataFrame
    rdd, schema = self._createFromLocal(map(prepare, data), schema)
  File "/home/ec2-user/anaconda3/lib/python3.6/site-packages/pyspark/sql/session.py", line 413, in _createFromLocal
    data = list(data)
  File "/home/ec2-user/anaconda3/lib/python3.6/site-packages/pyspark/sql/session.py", line 730, in prepare
    verify_func(obj)
  File "/home/ec2-user/anaconda3/lib/python3.6/site-packages/pyspark/sql/types.py", line 1391, in verify
    verify_value(obj)
  File "/home/ec2-user/anaconda3/lib/python3.6/site-packages/pyspark/sql/types.py", line 1370, in verify_struct
    "length of fields (%d)" % (len(obj), len(verifiers))))
ValueError: Length of object (25) does not match with length of fields (8)

格式
对于如何解决此问题有何建议？
谢谢

pyspark

来源：https://stackoverflow.com/questions/74485418/query-result-to-dataframe-using-pyspark-valueerror-length-of-object-does-not-m

1条答案

按热度按时间

nx7onnlm1#

你不需要 Pandas 。
使用 Spark 直接查询使用 spark.read.jdbc 的 RDS ，然后会自动从数据库本身推断出您的模式。
https://spark.apache.org/docs/latest/sql-data-sources-jdbc.html 的最大值
否则，看看考拉库，它有 from_pandas 函数
https://koalas.readthedocs.io/en/latest/user_guide/pandas_pyspark.html 格式

赞(0）回复(0）举报 2022-11-21

我来回答

使用Pyspark将查询结果发送到DataFrame：值错误：对象长度与字段长度不匹配

1条答案

相关问题

热门标签

最新问答