用hive解析amazon评论文件

wwwo4jvm  于 2021-06-02  发布在  Hadoop
关注(0)|答案(0)|浏览(197)

我想用一个amazon评论文件创建一个具有以下格式的表

product/productId: B00006HAXW
review/userId: A1RSDE90N6RSZF
review/profileName: Joseph M. Kotow
review/helpfulness: 9/9
review/score: 5.0
review/time: 1042502400
review/summary: Pittsburgh - Home of the OLDIES
review/text: I have all of the doo wop DVD's and this one is as good or better than the
1st ones. Remember once these performers are gone, we'll never get to see them again.
Rhino did an excellent job and if you like or love doo wop and Rock n Roll you'll LOVE
this DVD !!

我的sql:

CREATE EXTERNAL TABLE reviews (id int, user_id int, profile_name int, helpfulness string, review_score float, review_time int, review_summary string, review_text string)

我´我知道hive可以使用行和字段定界符加载数据。但我不知道´不是每行都有相同的格式。有没有人能帮我用hive解析这个文件格式,这样我就可以把它加载到我的文件系统中。?
谢谢您!

暂无答案!

目前还没有任何答案,快来回答吧!

相关问题