Class ParquetDataColumnReaderFactory

java.lang.Object
org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReaderFactory

public final class ParquetDataColumnReaderFactory extends Object
Parquet file has self-describing schema which may differ from the user required schema (e.g. schema evolution). This factory is used to retrieve user required typed data via corresponding reader which reads the underlying data. Part of the code is referred from Apache Hive.
  • Method Details

    • getDataColumnReaderByTypeOnDictionary

      public static ParquetDataColumnReader getDataColumnReaderByTypeOnDictionary(org.apache.parquet.schema.PrimitiveType parquetType, org.apache.parquet.column.Dictionary realReader, boolean isUtcTimestamp)
    • getDataColumnReaderByType

      public static ParquetDataColumnReader getDataColumnReaderByType(org.apache.parquet.schema.PrimitiveType parquetType, org.apache.parquet.column.values.ValuesReader realReader, boolean isUtcTimestamp)