为什么同一个hadoop示例(java示例)和hadoop streaming with python示例(来自权威指南)的拆分数量不同

jdgnovmf  于 2021-06-02  发布在  Hadoop
关注(0)|答案(0)|浏览(232)

我正在执行hadoop权威指南第2章中的最高温度示例,我注意到java示例的拆分数量与使用python的hadoop流不同。有人能帮我理解这种差异背后的原因吗?
java输出示例:

Job Counters 
                Launched map tasks=1
                Launched reduce tasks=1
                Rack-local map tasks=1
                Total time spent by all maps in occupied slots (ms)=7007
                Total time spent by all reduces in occupied slots (ms)=5760
                Total time spent by all map tasks (ms)=7007
                Total time spent by all reduce tasks (ms)=5760
                Total vcore-seconds taken by all map tasks=7007
                Total vcore-seconds taken by all reduce tasks=5760
                Total megabyte-seconds taken by all map tasks=7175168
                Total megabyte-seconds taken by all reduce tasks=5898240

使用python的hadoop流的输出示例:

Job Counters 
        Launched map tasks=2
        Launched reduce tasks=1
        Rack-local map tasks=2
        Total time spent by all maps in occupied slots (ms)=16730
        Total time spent by all reduces in occupied slots (ms)=4673
        Total time spent by all map tasks (ms)=16730
        Total time spent by all reduce tasks (ms)=4673
        Total vcore-seconds taken by all map tasks=16730
        Total vcore-seconds taken by all reduce tasks=4673
        Total megabyte-seconds taken by all map tasks=17131520
        Total megabyte-seconds taken by all reduce tasks=4785152

暂无答案!

目前还没有任何答案,快来回答吧!

相关问题