A = foreach win_grouped generate $0 as id,count($1) as c; -- (1,228),(2,230)... so on
A1 = filter A by ($0 > 1); -- (2,230),(3,296)... so on
B = foreach A1 generate ($0 - 1) as id,$1 as c; -- (1,230),(2,296)... so on
AB = join A by id,B by id; -- (1,228,1,230),(2,230,2,296)...so on
C = foreach AB generate (A::id + 1),(B::c - A::c) -- (2,2),(3,66)...so on
D = limit A 1; -- (1,288)
E = UNION D,C; -- (1,288),(2,2),(3,66)...so on
DUMP E;
1条答案
按热度按时间nkhmeac61#
这有点棘手,但可以使用联接来实现。生成另一个从第二行开始但id为1的关系,即($0-1)。联接这两个关系并生成差异。对于id,添加1以获取原始id。将第一行与包含差异的行合并。