Class ParquetDataColumnReaderFactory
java.lang.Object
org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReaderFactory
Parquet file has self-describing schema which may differ from the user required schema (e.g.
schema evolution). This factory is used to retrieve user required typed data via corresponding
reader which reads the underlying data. Part of the code is referred from Apache Hive.
-
Nested Class Summary
Nested ClassesModifier and TypeClassDescriptionstatic classThe default data column reader for existing Parquet page reader which works for both dictionary or non dictionary types, Mirror from dictionary encoding path.static classThe reader who reads from the underlying Timestamp value. -
Method Summary
Modifier and TypeMethodDescriptionstatic ParquetDataColumnReadergetDataColumnReaderByType(org.apache.parquet.schema.PrimitiveType parquetType, org.apache.parquet.column.values.ValuesReader realReader, boolean isUtcTimestamp) static ParquetDataColumnReadergetDataColumnReaderByTypeOnDictionary(org.apache.parquet.schema.PrimitiveType parquetType, org.apache.parquet.column.Dictionary realReader, boolean isUtcTimestamp)
-
Method Details
-
getDataColumnReaderByTypeOnDictionary
public static ParquetDataColumnReader getDataColumnReaderByTypeOnDictionary(org.apache.parquet.schema.PrimitiveType parquetType, org.apache.parquet.column.Dictionary realReader, boolean isUtcTimestamp) -
getDataColumnReaderByType
public static ParquetDataColumnReader getDataColumnReaderByType(org.apache.parquet.schema.PrimitiveType parquetType, org.apache.parquet.column.values.ValuesReader realReader, boolean isUtcTimestamp)
-