我试着用Kafka来运行spark流媒体。我使用的是Scala2.11.8版和Spark2.1.0版,构建于Scala2.11.8之上。我知道问题是scala版本不匹配,但是所有的依赖项都添加了正确的版本(见附件pic),我仍然得到了这个错误。
Exception in thread "main" java.lang.NoClassDefFoundError: scala/collection/GenTraversableOnce$class
at kafka.utils.Pool.<init>(Unknown Source)
at kafka.consumer.FetchRequestAndResponseStatsRegistry$.<init>(Unknown Source)
at kafka.consumer.FetchRequestAndResponseStatsRegistry$.<clinit>(Unknown Source)
at kafka.consumer.SimpleConsumer.<init>(Unknown Source)
at org.apache.spark.streaming.kafka.KafkaCluster.connect(KafkaCluster.scala:59)
at org.apache.spark.streaming.kafka.KafkaCluster$$anonfun$org$apache$spark$streaming$kafka$KafkaCluster$$withBrokers$1.apply(KafkaCluster.scala:364)
at org.apache.spark.streaming.kafka.KafkaCluster$$anonfun$org$apache$spark$streaming$kafka$KafkaCluster$$withBrokers$1.apply(KafkaCluster.scala:361)
at scala.collection.IndexedSeqOptimized$class.foreach(IndexedSeqOptimized.scala:33)
at scala.collection.mutable.WrappedArray.foreach(WrappedArray.scala:35)
at org.apache.spark.streaming.kafka.KafkaCluster.org$apache$spark$streaming$kafka$KafkaCluster$$withBrokers(KafkaCluster.scala:361)
at org.apache.spark.streaming.kafka.KafkaCluster.getPartitionMetadata(KafkaCluster.scala:132)
at org.apache.spark.streaming.kafka.KafkaCluster.getPartitions(KafkaCluster.scala:119)
at org.apache.spark.streaming.kafka.KafkaUtils$.getFromOffsets(KafkaUtils.scala:211)
at org.apache.spark.streaming.kafka.KafkaUtils$.createDirectStream(KafkaUtils.scala:484)
at org.apache.spark.streaming.kafka.KafkaUtils$.createDirectStream(KafkaUtils.scala:607)
at com.forrester.streaming.kafka.App$.main(App.scala:19)
at com.forrester.streaming.kafka.App.main(App.scala)
从属关系
依赖项
<dependency>
<groupId>org.scala-lang</groupId>
<artifactId>scala-library</artifactId>
<version>2.11.8</version>
<scope>provided</scope>
</dependency>
<dependency>
<groupId>com.koverse</groupId>
<artifactId>koverse-shaded-deps</artifactId>
<version>${koverse.version}</version>
<scope>provided</scope>
</dependency>
<dependency>
<groupId>org.apache.spark</groupId>
<artifactId>spark-mllib_2.11</artifactId>
<version>2.1.0</version>
<exclusions>
<exclusion>
<groupId>*</groupId>
<artifactId>*</artifactId>
</exclusion>
</exclusions>
</dependency>
<dependency>
<groupId>org.scalanlp</groupId>
<artifactId>breeze_2.11</artifactId>
<version>0.11.2</version>
</dependency>
<dependency>
<groupId>org.xerial.snappy</groupId>
<artifactId>snappy-java</artifactId>
<version>1.0.5</version>
</dependency>
<dependency>
<groupId>org.apache.spark</groupId>
<artifactId>spark-hive_2.11</artifactId>
<version>2.1.0</version>
<scope>test</scope>
</dependency>
<dependency>
<groupId>org.apache.spark</groupId>
<artifactId>spark-streaming_2.11</artifactId>
<version>2.1.0</version>
</dependency>
<dependency>
<groupId>org.apache.spark</groupId>
<artifactId>spark-streaming-kafka-0-8-assembly_2.11</artifactId>
<version>2.1.0</version>
</dependency>
<dependency>
<groupId>org.apache.spark</groupId>
<artifactId>spark-sql_2.11</artifactId>
<version>2.1.0</version>
</dependency>
</dependencies>
我对不同的版本做了更多的分析:
|Spark build on Scala | Kafka jar | Result |
| ------------------- | --------------------------------------------------- | -------------- |
| 2.1.1 on 2.11.8 | spark-streaming-kafka-0-8-assembly_2.11-2.1.1.jar | **Working**|
| 2.1.1 on 2.11.8 | spark-streaming-kafka-0-8-assembly_2.10-2.1.1.jar | Error as Expected |
| 2.1.1 on 2.11.8 | spark-streaming-kafka-0-8-assembly_2.10-2.1.0.jar | Error as Expected |
| 2.1.0 on 2.11.8 | spark-streaming-kafka-0-8-assembly_2.10-2.1.0.jar | Error as Expected |
| 2.1.0 on 2.11.8 | spark-streaming-kafka-0-8-assembly_2.11-2.1.0.jar | **Error : ideally should pass**|
| 2.1.0 on 2.11.8 | spark-streaming-kafka-0-8-assembly_2.11-2.1.1.jar | Error as Expected |
| 2.1.0 on 2.11.8 | spark-streaming-kafka-0-8-assembly_2.10-2.1.0.jar | Error as Expected |
错误消息classnotfoundexception:scala.collection.gentraversableonce$class
case-1正在工作,但是case-5失败了,这不应该引发任何错误
暂无答案!
目前还没有任何答案,快来回答吧!