将 Flink DataStream 写入 Iceberg 表:NoSuchMethodError: org.apache.parquet.schema.Types$PrimitiveBuilder.as

问题描述 投票:0回答:1

我尝试将flink数据流写入冰山表,如下:

val kafkaStream = new KafkaDataSource(parameter, new PacketSchema).getStream(env)
val dataStream = kafkaStream.flatMap(new NullPacketFilter).map(FilteredPacket.from(_).toRow).javaStream

FlinkSink.forRow(dataStream, FilteredPacket.schema)
  .tableLoader(tableLoader)
  .build

然后我收到错误:

2021-02-18 18:12:12,086 WARN  org.apache.hadoop.util.NativeCodeLoader [] - Unable to load native-hadoop library for your platform... using builtin-java classes where applicable
2021-02-18 18:12:12,424 INFO  org.apache.iceberg.BaseMetastoreCatalog [] - Table loaded by catalog: iceberg.flink_test.filtered_packets
2021-02-18 18:12:12,477 WARN  org.apache.flink.runtime.taskmanager.Task [] - Source: tianyi -> Flat Map -> Map -> Map -> IcebergStreamWriter (1/1) (9612408d42df7e69b829367434bbc43d) switched from RUNNING to FAILED.
java.lang.NoSuchMethodError: org.apache.parquet.schema.Types$PrimitiveBuilder.as(Lorg/apache/parquet/schema/LogicalTypeAnnotation;)Lorg/apache/parquet/schema/Types$Builder;
   at org.apache.iceberg.parquet.TypeToMessageType.primitive(TypeToMessageType.java:145) ~[tianyi112-1.0-SNAPSHOT.jar:?]
   at org.apache.iceberg.parquet.TypeToMessageType.field(TypeToMessageType.java:88) ~[tianyi112-1.0-SNAPSHOT.jar:?]
   at org.apache.iceberg.parquet.TypeToMessageType.convert(TypeToMessageType.java:65) ~[tianyi112-1.0-SNAPSHOT.jar:?]
   at org.apache.iceberg.parquet.ParquetSchemaUtil.convert(ParquetSchemaUtil.java:43) ~[tianyi112-1.0-SNAPSHOT.jar:?]
   ...

我检查了jar的内容,它包括所需的Types$PrimitiveBuilder类:

2651 Fri Feb 19 08:32:10 CST 2021 org/apache/parquet/schema/Types$PrimitiveBuilder.class
3101 Fri Feb 19 08:32:12 CST 2021 org/apache/flink/hive/shaded/parquet/schema/Types$PrimitiveBuilder.clas

查看源码时,发现idea有错误:

库源与 TypeToMessageType 类的字节码不匹配

但其他所有课程都可以。

我尝试删除我的maven存储库中的iceberg-parquet.jar和parquet-column.jar并重新导入项目,并尝试禁用Idea的Lombok插件 - 但没有效果。

版本:CDH 6.3.2 Flink 1.11.2 Iceberg 0.11.0

scala apache-flink parquet apache-iceberg
1个回答
1
投票

我已经解决了这个问题。原因是 jar 冲突,具体来说是 parquet-hadoop.jar(hive-exec 2.3.4) 和iceberg-parquet.jar(0.11.0)

© www.soinside.com 2019 - 2024. All rights reserved.