我试图在eclipse中运行Nutch1.9,我的所有配置都是根据本文进行的(http://yewintko.wordpress.com/2014/02/02/setting-up-nutch-in-eclipse-indigo/). 但我有个错误:
CrawlDb update: starting at 2014-11-10 15:50:10
CrawlDb update: db: urls
CrawlDb update: segments: [3, crawl]
CrawlDb update: additions allowed: true
CrawlDb update: URL normalizing: false
CrawlDb update: URL filtering: false
CrawlDb update: 404 purging: false
CrawlDb update: Merging segment data into db.
CrawlDb update: java.io.IOException: Job failed!
at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1357)
at org.apache.nutch.crawl.CrawlDb.update(CrawlDb.java:119)
at org.apache.nutch.crawl.CrawlDb.run(CrawlDb.java:219)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
at org.apache.nutch.crawl.CrawlDb.main(CrawlDb.java:179)
1条答案
按热度按时间p5fdfcr11#
你试过按照nutchwiki的步骤来做吗?