在eclipse中运行nutch1.9得到错误crawldb update:java.io.ioexception:job failed

nlejzf6q  于 2021-06-02  发布在  Hadoop
关注(0)|答案(1)|浏览(303)

我试图在eclipse中运行Nutch1.9,我的所有配置都是根据本文进行的(http://yewintko.wordpress.com/2014/02/02/setting-up-nutch-in-eclipse-indigo/). 但我有个错误:

CrawlDb update: starting at 2014-11-10 15:50:10
CrawlDb update: db: urls
CrawlDb update: segments: [3, crawl]
CrawlDb update: additions allowed: true
CrawlDb update: URL normalizing: false
CrawlDb update: URL filtering: false
CrawlDb update: 404 purging: false
CrawlDb update: Merging segment data into db.
CrawlDb update: java.io.IOException: Job failed!
    at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1357)
    at org.apache.nutch.crawl.CrawlDb.update(CrawlDb.java:119)
    at org.apache.nutch.crawl.CrawlDb.run(CrawlDb.java:219)
    at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
    at org.apache.nutch.crawl.CrawlDb.main(CrawlDb.java:179)
p5fdfcr1

p5fdfcr11#

你试过按照nutchwiki的步骤来做吗?

相关问题