Pig如何排序元组的袋值之一

093gszye  于 2021-06-21  发布在  Pig
关注(0)|答案(1)|浏览(311)

我的数据集结构如下: {movie:chararry, year:int, weight:float, actor:chararray} 我正在努力找出每年体重最高的运动。所以我按年份和电影对场景进行分组,然后得到以下场景:

{group: (year:int, movie:chararray), movies:{(movie:chararry, year:int, weight:float, actor:chararray)}}

我的问题是,我如何能排序袋值,重量之一设置?。谢谢您

hjzp0vay

hjzp0vay1#

可以在foreach中使用嵌套语句。

inpt = load '...../data.csv' using PigStorage(',') as (movie:chararry, year:int, weight:float, actor:chararray);

grp = group inpt by (year, movie);

srt = foreach grp {
    by_wright = ORDER inpt BY weight;
    generate group, by_wright;
};

相关问题