记录:
Pune,2007,31.5
Pune,2007,30.5
Pune,2008,34.5
Blre,2009,13.0
Blre,2009,10.5
我正在使用的脚本:
grunt> A = LOAD '/home/cloudera/temp' using PigStorage(',') AS (city:chararray,year:int,temp:double);
grunt> B = group A by city;
grunt> C = FOREACH B GENERATE group, MAX(A.temp);
输出:
Pune, 34.5
Blre, 13.0
预期产量:
Pune, 2007, 31.5
Pune, 2008, 34.5
Blre, 2009, 13.0
怎样才能达到这个效果,提前谢谢。
1条答案
按热度按时间pepwfjgg1#
按城市和年份分组。