hadoop使用intwritable减少输出总是在2处停止

g9icjywg 于 2021-06-04 发布在 Hadoop

关注(0)|答案(1)|浏览(366)

reduce程序总是将值输出为2，即使给定键的值列表大于2。
例如：字数测试文件的字数与字数测试文件的字数相似，字数测试文件的字数与字数测试文件的字数相似
输出为：this 2 the 2 word 2

reduce代码是：

public class WordCountReducer
  extends Reducer<Text, IntWritable, Text, IntWritable> {
    //public static final log LOG = LogFactory.getLog(MyMapper.class);
  @Override
  public void reduce(Text key, Iterable<IntWritable> values,
      Context context)
      throws IOException, InterruptedException {
      IntWritable count = null;

      for (IntWritable value: values) {
           if (count == null) {
            count = value;
           } else {

            count.set(count.get() + value.get());

           }
          }

    context.write(key, count);
  }

}

你能解释一下这个问题吗？当我使用int counter时，它工作得很好。

Java hadoop reduce

来源：https://stackoverflow.com/questions/22082150/hadoop-reduce-output-using-intwritable-always-stops-at-2