Index
All Classes and Interfaces|All Packages|Constant Field Values|Serialized Form
A
- AbstractColumnReader<VECTOR extends org.apache.flink.table.data.columnar.vector.writable.WritableColumnVector> - Class in org.apache.flink.formats.parquet.vector.reader
-
Abstract
ColumnReader. - AbstractColumnReader(ColumnDescriptor, PageReader) - Constructor for class org.apache.flink.formats.parquet.vector.reader.AbstractColumnReader
- addElement(T) - Method in class org.apache.flink.formats.parquet.ParquetBulkWriter
- afterReadPage() - Method in class org.apache.flink.formats.parquet.vector.reader.AbstractColumnReader
-
After read a page, we may need some initialization.
- afterReadPage() - Method in class org.apache.flink.formats.parquet.vector.reader.BooleanColumnReader
- appendBytes(int, byte[], int, int) - Method in class org.apache.flink.formats.parquet.vector.ParquetDecimalVector
- AvroParquetReaders - Class in org.apache.flink.formats.parquet.avro
-
A convenience builder to create
AvroParquetRecordFormatinstances for the different kinds of Avro record types. - AvroParquetWriters - Class in org.apache.flink.formats.parquet.avro
-
Convenience builder to create
ParquetWriterFactoryinstances for the different Avro types.
B
- BATCH_SIZE - Static variable in class org.apache.flink.formats.parquet.ParquetFileFormatFactory
- BooleanColumnReader - Class in org.apache.flink.formats.parquet.vector.reader
-
Boolean
ColumnReader. - BooleanColumnReader(ColumnDescriptor, PageReader) - Constructor for class org.apache.flink.formats.parquet.vector.reader.BooleanColumnReader
- buildFieldsList(List<RowType.RowField>, List<String>, MessageColumnIO) - Static method in class org.apache.flink.formats.parquet.vector.ParquetSplitReaderUtil
- ByteColumnReader - Class in org.apache.flink.formats.parquet.vector.reader
-
Byte
ColumnReader. - ByteColumnReader(ColumnDescriptor, PageReader) - Constructor for class org.apache.flink.formats.parquet.vector.reader.ByteColumnReader
- BytesColumnReader - Class in org.apache.flink.formats.parquet.vector.reader
-
Bytes
ColumnReader. - BytesColumnReader(ColumnDescriptor, PageReader) - Constructor for class org.apache.flink.formats.parquet.vector.reader.BytesColumnReader
C
- calculateCollectionOffsets(ParquetField, int[], int[]) - Static method in class org.apache.flink.formats.parquet.utils.NestedPositionUtil
-
Calculate the collection's offsets according to column's max repetition level, definition level, value's repetition level and definition level.
- calculateLengthByOffsets(boolean[], long[]) - Static method in class org.apache.flink.formats.parquet.utils.NestedPositionUtil
- calculateRowOffsets(ParquetField, int[], int[]) - Static method in class org.apache.flink.formats.parquet.utils.NestedPositionUtil
-
Calculate row offsets according to column's max repetition level, definition level, value's repetition level and definition level.
- checkTypeName(PrimitiveType.PrimitiveTypeName) - Method in class org.apache.flink.formats.parquet.vector.reader.AbstractColumnReader
- close() - Method in class org.apache.flink.formats.parquet.vector.ParquetColumnarRowSplitReader
- CollectionPosition - Class in org.apache.flink.formats.parquet.vector.position
-
To represent collection's position in repeated type.
- CollectionPosition(boolean[], long[], long[], int) - Constructor for class org.apache.flink.formats.parquet.vector.position.CollectionPosition
- columnarBatch - Variable in class org.apache.flink.formats.parquet.ParquetVectorizedInputFormat.ParquetReaderBatch
- ColumnBatchFactory<SplitT extends org.apache.flink.connector.file.src.FileSourceSplit> - Interface in org.apache.flink.formats.parquet.vector
-
Interface to create
VectorizedColumnBatch. - ColumnReader<VECTOR extends org.apache.flink.table.data.columnar.vector.writable.WritableColumnVector> - Interface in org.apache.flink.formats.parquet.vector.reader
-
Read a batch of records for a column to
WritableColumnVectorfrom parquet data file. - computeMinBytesForDecimalPrecision(int) - Static method in class org.apache.flink.formats.parquet.utils.ParquetSchemaConverter
- conf() - Method in class org.apache.flink.formats.parquet.utils.SerializableConfiguration
- convertAndGetIterator(long) - Method in class org.apache.flink.formats.parquet.ParquetVectorizedInputFormat.ParquetReaderBatch
-
Provides reading iterator after the records are written to the
ParquetVectorizedInputFormat.ParquetReaderBatch.columnarBatch. - convertToParquetMessageType(String, RowType, Configuration) - Static method in class org.apache.flink.formats.parquet.utils.ParquetSchemaConverter
- convertToParquetType(String, LogicalType, Configuration) - Static method in class org.apache.flink.formats.parquet.utils.ParquetSchemaConverter
- create(FSDataOutputStream) - Method in class org.apache.flink.formats.parquet.ParquetWriterFactory
- create(SplitT, ColumnVector[]) - Method in interface org.apache.flink.formats.parquet.vector.ColumnBatchFactory
- createColumnReader(boolean, LogicalType, Type, List<ColumnDescriptor>, PageReadStore, ParquetField, int) - Static method in class org.apache.flink.formats.parquet.vector.ParquetSplitReaderUtil
- createDecodingFormat(DynamicTableFactory.Context, ReadableConfig) - Method in class org.apache.flink.formats.parquet.ParquetFileFormatFactory
- createEncodingFormat(DynamicTableFactory.Context, ReadableConfig) - Method in class org.apache.flink.formats.parquet.ParquetFileFormatFactory
- createPartitionedFormat(Configuration, RowType, TypeInformation<RowData>, List<String>, PartitionFieldExtractor<SplitT>, int, boolean, boolean) - Static method in class org.apache.flink.formats.parquet.ParquetColumnarRowInputFormat
-
Create a partitioned
ParquetColumnarRowInputFormat, the partition columns can be generated byPath. - createReader(Configuration, SplitT) - Method in class org.apache.flink.formats.parquet.ParquetVectorizedInputFormat
- createReaderBatch(WritableColumnVector[], VectorizedColumnBatch, Pool.Recycler<ParquetVectorizedInputFormat.ParquetReaderBatch<RowData>>) - Method in class org.apache.flink.formats.parquet.ParquetColumnarRowInputFormat
- createReaderBatch(WritableColumnVector[], VectorizedColumnBatch, Pool.Recycler<ParquetVectorizedInputFormat.ParquetReaderBatch<T>>) - Method in class org.apache.flink.formats.parquet.ParquetVectorizedInputFormat
- createRuntimeDecoder(DynamicTableSource.Context, DataType, int[][]) - Method in class org.apache.flink.formats.parquet.ParquetFileFormatFactory.ParquetBulkDecodingFormat
- createVectorFromConstant(LogicalType, Object, int) - Static method in class org.apache.flink.formats.parquet.vector.ParquetSplitReaderUtil
- createWritableColumnVector(int, LogicalType, Type, List<ColumnDescriptor>, int) - Static method in class org.apache.flink.formats.parquet.vector.ParquetSplitReaderUtil
- createWriter(OutputFile) - Method in interface org.apache.flink.formats.parquet.ParquetBuilder
-
Creates and configures a parquet writer to the given output file.
- createWriter(OutputFile) - Method in class org.apache.flink.formats.parquet.row.ParquetRowDataBuilder.FlinkParquetBuilder
- createWriterFactory(RowType, Configuration, boolean) - Static method in class org.apache.flink.formats.parquet.row.ParquetRowDataBuilder
-
Create a parquet
BulkWriter.Factory.
D
- decodeInt64ToTimestamp(boolean, Dictionary, int, LogicalTypeAnnotation.TimeUnit) - Static method in class org.apache.flink.formats.parquet.vector.reader.TimestampColumnReader
- decodeInt96ToTimestamp(boolean, Dictionary, int) - Static method in class org.apache.flink.formats.parquet.vector.reader.TimestampColumnReader
- decodeToBinary(int) - Method in class org.apache.flink.formats.parquet.vector.ParquetDictionary
- decodeToDouble(int) - Method in class org.apache.flink.formats.parquet.vector.ParquetDictionary
- decodeToFloat(int) - Method in class org.apache.flink.formats.parquet.vector.ParquetDictionary
- decodeToInt(int) - Method in class org.apache.flink.formats.parquet.vector.ParquetDictionary
- decodeToLong(int) - Method in class org.apache.flink.formats.parquet.vector.ParquetDictionary
- decodeToTimestamp(int) - Method in class org.apache.flink.formats.parquet.vector.ParquetDictionary
- DefaultParquetDataColumnReader(Dictionary) - Constructor for class org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReaderFactory.DefaultParquetDataColumnReader
- DefaultParquetDataColumnReader(ValuesReader) - Constructor for class org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReaderFactory.DefaultParquetDataColumnReader
- descriptor - Variable in class org.apache.flink.formats.parquet.vector.reader.AbstractColumnReader
- dict - Variable in class org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReaderFactory.DefaultParquetDataColumnReader
- dictionary - Variable in class org.apache.flink.formats.parquet.vector.reader.AbstractColumnReader
-
The dictionary, if this column has dictionary encoding.
- DoubleColumnReader - Class in org.apache.flink.formats.parquet.vector.reader
-
Double
ColumnReader. - DoubleColumnReader(ColumnDescriptor, PageReader) - Constructor for class org.apache.flink.formats.parquet.vector.reader.DoubleColumnReader
F
- factoryIdentifier() - Method in class org.apache.flink.formats.parquet.ParquetFileFormatFactory
- fill(byte[]) - Method in class org.apache.flink.formats.parquet.vector.ParquetDecimalVector
- fill(int) - Method in class org.apache.flink.formats.parquet.vector.ParquetDecimalVector
- fill(long) - Method in class org.apache.flink.formats.parquet.vector.ParquetDecimalVector
- fillWithNulls() - Method in class org.apache.flink.formats.parquet.vector.ParquetDecimalVector
- finish() - Method in class org.apache.flink.formats.parquet.ParquetBulkWriter
- FixedLenBytesColumnReader<VECTOR extends org.apache.flink.table.data.columnar.vector.writable.WritableColumnVector> - Class in org.apache.flink.formats.parquet.vector.reader
-
Fixed length bytes
ColumnReader, just for decimal. - FixedLenBytesColumnReader(ColumnDescriptor, PageReader, int) - Constructor for class org.apache.flink.formats.parquet.vector.reader.FixedLenBytesColumnReader
- FlinkParquetBuilder(RowType, Configuration, boolean) - Constructor for class org.apache.flink.formats.parquet.row.ParquetRowDataBuilder.FlinkParquetBuilder
- FloatColumnReader - Class in org.apache.flink.formats.parquet.vector.reader
-
Float
ColumnReader. - FloatColumnReader(ColumnDescriptor, PageReader) - Constructor for class org.apache.flink.formats.parquet.vector.reader.FloatColumnReader
- flush() - Method in class org.apache.flink.formats.parquet.ParquetBulkWriter
- forGenericRecord(Schema) - Static method in class org.apache.flink.formats.parquet.avro.AvroParquetReaders
-
Creates a new
AvroParquetRecordFormatthat reads the parquet file into AvroGenericRecords. - forGenericRecord(Schema) - Static method in class org.apache.flink.formats.parquet.avro.AvroParquetWriters
-
Creates a ParquetWriterFactory that accepts and writes Avro generic types.
- forReflectRecord(Class<T>) - Static method in class org.apache.flink.formats.parquet.avro.AvroParquetReaders
-
Creates a new
AvroParquetRecordFormatthat reads the parquet file into Avro records via reflection. - forReflectRecord(Class<T>) - Static method in class org.apache.flink.formats.parquet.avro.AvroParquetWriters
-
Creates a ParquetWriterFactory for the given type.
- forSpecificRecord(Class<T>) - Static method in class org.apache.flink.formats.parquet.avro.AvroParquetReaders
-
Creates a new
AvroParquetRecordFormatthat reads the parquet file into AvroSpecificRecords. - forSpecificRecord(Class<T>) - Static method in class org.apache.flink.formats.parquet.avro.AvroParquetWriters
-
Creates a ParquetWriterFactory for an Avro specific type.
- forType(Class<T>) - Static method in class org.apache.flink.formats.parquet.protobuf.ParquetProtoWriters
-
Creates a
ParquetWriterFactoryfor the given type.
G
- generate(ColumnVector[]) - Method in interface org.apache.flink.formats.parquet.vector.ParquetColumnarRowSplitReader.ColumnBatchGenerator
- genPartColumnarRowReader(boolean, boolean, Configuration, String[], DataType[], Map<String, Object>, int[], int, Path, long, long) - Static method in class org.apache.flink.formats.parquet.vector.ParquetSplitReaderUtil
-
Util for generating partitioned
ParquetColumnarRowSplitReader. - getArrayElementColumn(ColumnIO) - Static method in class org.apache.flink.formats.parquet.vector.ParquetSplitReaderUtil
- getBytes(int) - Method in class org.apache.flink.formats.parquet.vector.ParquetDecimalVector
- getChangelogMode() - Method in class org.apache.flink.formats.parquet.ParquetFileFormatFactory.ParquetBulkDecodingFormat
- getChildren() - Method in class org.apache.flink.formats.parquet.vector.type.ParquetGroupField
- getDataColumnReaderByType(PrimitiveType, ValuesReader, boolean) - Static method in class org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReaderFactory
- getDataColumnReaderByTypeOnDictionary(PrimitiveType, Dictionary, boolean) - Static method in class org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReaderFactory
- getDecimal(int, int, int) - Method in class org.apache.flink.formats.parquet.vector.ParquetDecimalVector
- getDefinitionLevel() - Method in class org.apache.flink.formats.parquet.vector.position.LevelDelegation
- getDefinitionLevel() - Method in class org.apache.flink.formats.parquet.vector.type.ParquetField
- getDescriptor() - Method in class org.apache.flink.formats.parquet.vector.type.ParquetPrimitiveField
- getDictionary() - Method in interface org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReader
- getDictionary() - Method in class org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReaderFactory.DefaultParquetDataColumnReader
- getDictionaryIds() - Method in class org.apache.flink.formats.parquet.vector.ParquetDecimalVector
- getId() - Method in class org.apache.flink.formats.parquet.vector.type.ParquetPrimitiveField
- getInt(int) - Method in class org.apache.flink.formats.parquet.vector.ParquetDecimalVector
- getIsNull() - Method in class org.apache.flink.formats.parquet.vector.position.CollectionPosition
- getIsNull() - Method in class org.apache.flink.formats.parquet.vector.position.RowPosition
- getLength() - Method in class org.apache.flink.formats.parquet.ParquetInputFile
- getLength() - Method in class org.apache.flink.formats.parquet.vector.position.CollectionPosition
- getLevelDelegation() - Method in class org.apache.flink.formats.parquet.vector.reader.NestedPrimitiveColumnReader
- getLong(int) - Method in class org.apache.flink.formats.parquet.vector.ParquetDecimalVector
- getMapKeyValueColumn(GroupColumnIO) - Static method in class org.apache.flink.formats.parquet.vector.ParquetSplitReaderUtil
- getOffsets() - Method in class org.apache.flink.formats.parquet.vector.position.CollectionPosition
- getPositionsCount() - Method in class org.apache.flink.formats.parquet.vector.position.RowPosition
- getProducedType() - Method in class org.apache.flink.formats.parquet.ParquetColumnarRowInputFormat
- getRepetitionLevel() - Method in class org.apache.flink.formats.parquet.vector.position.LevelDelegation
- getRepetitionLevel() - Method in class org.apache.flink.formats.parquet.vector.type.ParquetField
- getTableStatistics(List<Path>, DataType, Configuration, boolean) - Static method in class org.apache.flink.formats.parquet.utils.ParquetFormatStatisticsReportUtil
- getTableStatistics(List<Path>, DataType, Configuration, boolean, int) - Static method in class org.apache.flink.formats.parquet.utils.ParquetFormatStatisticsReportUtil
- getType() - Method in class org.apache.flink.formats.parquet.vector.type.ParquetField
- getValueCount() - Method in class org.apache.flink.formats.parquet.vector.position.CollectionPosition
- getVector() - Method in class org.apache.flink.formats.parquet.vector.ParquetDecimalVector
- getWriteSupport(Configuration) - Method in class org.apache.flink.formats.parquet.protobuf.ParquetProtoWriters.ParquetProtoWriterBuilder
- getWriteSupport(Configuration) - Method in class org.apache.flink.formats.parquet.row.ParquetRowDataBuilder
H
- hadoopConfig - Variable in class org.apache.flink.formats.parquet.ParquetVectorizedInputFormat
- hasDictionary() - Method in class org.apache.flink.formats.parquet.vector.ParquetDecimalVector
I
- IDENTIFIER - Static variable in class org.apache.flink.formats.parquet.ParquetFileFormatFactory
- initFromPage(int, ByteBufferInputStream) - Method in interface org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReader
-
Initialize the reader by page data.
- initFromPage(int, ByteBufferInputStream) - Method in class org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReaderFactory.DefaultParquetDataColumnReader
- int64ToTimestamp(boolean, long, LogicalTypeAnnotation.TimeUnit) - Static method in class org.apache.flink.formats.parquet.vector.reader.TimestampColumnReader
- int96ToTimestamp(boolean, long, int) - Static method in class org.apache.flink.formats.parquet.vector.reader.TimestampColumnReader
- IntColumnReader - Class in org.apache.flink.formats.parquet.vector.reader
-
Int
ColumnReader. - IntColumnReader(ColumnDescriptor, PageReader) - Constructor for class org.apache.flink.formats.parquet.vector.reader.IntColumnReader
- is32BitDecimal(int) - Static method in class org.apache.flink.formats.parquet.utils.ParquetSchemaConverter
- is64BitDecimal(int) - Static method in class org.apache.flink.formats.parquet.utils.ParquetSchemaConverter
- isNullAt(int) - Method in class org.apache.flink.formats.parquet.vector.ParquetDecimalVector
- isOptionalFieldValueNull(int, int) - Static method in class org.apache.flink.formats.parquet.utils.NestedPositionUtil
- isRequired() - Method in class org.apache.flink.formats.parquet.vector.type.ParquetField
- isSplittable() - Method in class org.apache.flink.formats.parquet.ParquetVectorizedInputFormat
- isUtcTimestamp - Variable in class org.apache.flink.formats.parquet.ParquetVectorizedInputFormat
J
- JULIAN_EPOCH_OFFSET_DAYS - Static variable in class org.apache.flink.formats.parquet.vector.reader.TimestampColumnReader
L
- LevelDelegation - Class in org.apache.flink.formats.parquet.vector.position
-
To delegate repetition level and definition level.
- LevelDelegation(int[], int[]) - Constructor for class org.apache.flink.formats.parquet.vector.position.LevelDelegation
- LongColumnReader - Class in org.apache.flink.formats.parquet.vector.reader
-
Long
ColumnReader. - LongColumnReader(ColumnDescriptor, PageReader) - Constructor for class org.apache.flink.formats.parquet.vector.reader.LongColumnReader
- lookupColumnByName(GroupColumnIO, String) - Static method in class org.apache.flink.formats.parquet.vector.ParquetSplitReaderUtil
-
Parquet's column names are case in sensitive.
M
- maxDefLevel - Variable in class org.apache.flink.formats.parquet.vector.reader.AbstractColumnReader
-
Maximum definition level for this column.
- MICROS_PER_MILLISECOND - Static variable in class org.apache.flink.formats.parquet.vector.reader.TimestampColumnReader
- MICROS_PER_SECOND - Static variable in class org.apache.flink.formats.parquet.vector.reader.TimestampColumnReader
- MILLIS_IN_DAY - Static variable in class org.apache.flink.formats.parquet.vector.reader.TimestampColumnReader
- MILLIS_PER_SECOND - Static variable in class org.apache.flink.formats.parquet.vector.reader.TimestampColumnReader
N
- NANOS_PER_MICROSECONDS - Static variable in class org.apache.flink.formats.parquet.vector.reader.TimestampColumnReader
- NANOS_PER_MILLISECOND - Static variable in class org.apache.flink.formats.parquet.vector.reader.TimestampColumnReader
- NANOS_PER_SECOND - Static variable in class org.apache.flink.formats.parquet.vector.reader.TimestampColumnReader
- NestedColumnReader - Class in org.apache.flink.formats.parquet.vector.reader
-
This ColumnReader mainly used to read `Group` type in parquet such as `Map`, `Array`, `Row`.
- NestedColumnReader(boolean, PageReadStore, ParquetField) - Constructor for class org.apache.flink.formats.parquet.vector.reader.NestedColumnReader
- NestedPositionUtil - Class in org.apache.flink.formats.parquet.utils
-
Utils to calculate nested type position.
- NestedPositionUtil() - Constructor for class org.apache.flink.formats.parquet.utils.NestedPositionUtil
- NestedPrimitiveColumnReader - Class in org.apache.flink.formats.parquet.vector.reader
-
Reader to read nested primitive column.
- NestedPrimitiveColumnReader(ColumnDescriptor, PageReader, boolean, Type, LogicalType) - Constructor for class org.apache.flink.formats.parquet.vector.reader.NestedPrimitiveColumnReader
- NestedPrimitiveColumnReader.NullIntIterator - Class in org.apache.flink.formats.parquet.vector.reader
-
Reading zero always.
- NestedPrimitiveColumnReader.RLEIntIterator - Class in org.apache.flink.formats.parquet.vector.reader
-
Reading int from
RunLengthBitPackingHybridDecoder. - NestedPrimitiveColumnReader.ValuesReaderIntIterator - Class in org.apache.flink.formats.parquet.vector.reader
-
Reading int from
ValuesReader. - newStream() - Method in class org.apache.flink.formats.parquet.ParquetInputFile
- nextInt() - Method in class org.apache.flink.formats.parquet.vector.reader.NestedPrimitiveColumnReader.NullIntIterator
- nextInt() - Method in class org.apache.flink.formats.parquet.vector.reader.NestedPrimitiveColumnReader.RLEIntIterator
- nextInt() - Method in class org.apache.flink.formats.parquet.vector.reader.NestedPrimitiveColumnReader.ValuesReaderIntIterator
- nextRecord() - Method in class org.apache.flink.formats.parquet.vector.ParquetColumnarRowSplitReader
- NullIntIterator() - Constructor for class org.apache.flink.formats.parquet.vector.reader.NestedPrimitiveColumnReader.NullIntIterator
- numBatchesToCirculate(Configuration) - Method in class org.apache.flink.formats.parquet.ParquetColumnarRowInputFormat
- numBatchesToCirculate(Configuration) - Method in class org.apache.flink.formats.parquet.ParquetVectorizedInputFormat
O
- optionalOptions() - Method in class org.apache.flink.formats.parquet.ParquetFileFormatFactory
- org.apache.flink.formats.parquet - package org.apache.flink.formats.parquet
- org.apache.flink.formats.parquet.avro - package org.apache.flink.formats.parquet.avro
- org.apache.flink.formats.parquet.protobuf - package org.apache.flink.formats.parquet.protobuf
- org.apache.flink.formats.parquet.row - package org.apache.flink.formats.parquet.row
- org.apache.flink.formats.parquet.utils - package org.apache.flink.formats.parquet.utils
- org.apache.flink.formats.parquet.vector - package org.apache.flink.formats.parquet.vector
- org.apache.flink.formats.parquet.vector.position - package org.apache.flink.formats.parquet.vector.position
- org.apache.flink.formats.parquet.vector.reader - package org.apache.flink.formats.parquet.vector.reader
- org.apache.flink.formats.parquet.vector.type - package org.apache.flink.formats.parquet.vector.type
P
- ParquetBuilder<T> - Interface in org.apache.flink.formats.parquet
-
A builder to create a
ParquetWriterfrom a ParquetOutputFile. - ParquetBulkDecodingFormat(ReadableConfig) - Constructor for class org.apache.flink.formats.parquet.ParquetFileFormatFactory.ParquetBulkDecodingFormat
- ParquetBulkWriter<T> - Class in org.apache.flink.formats.parquet
-
A simple
BulkWriterimplementation that wraps aParquetWriter. - ParquetBulkWriter(ParquetWriter<T>) - Constructor for class org.apache.flink.formats.parquet.ParquetBulkWriter
-
Creates a new ParquetBulkWriter wrapping the given ParquetWriter.
- ParquetColumnarRowInputFormat<SplitT extends org.apache.flink.connector.file.src.FileSourceSplit> - Class in org.apache.flink.formats.parquet
-
A
ParquetVectorizedInputFormatto provideRowDataiterator. - ParquetColumnarRowInputFormat(Configuration, RowType, TypeInformation<RowData>, int, boolean, boolean) - Constructor for class org.apache.flink.formats.parquet.ParquetColumnarRowInputFormat
-
Constructor to create parquet format without extra fields.
- ParquetColumnarRowSplitReader - Class in org.apache.flink.formats.parquet.vector
-
This reader is used to read a
VectorizedColumnBatchfrom input split. - ParquetColumnarRowSplitReader(boolean, boolean, Configuration, LogicalType[], String[], ParquetColumnarRowSplitReader.ColumnBatchGenerator, int, Path, long, long) - Constructor for class org.apache.flink.formats.parquet.vector.ParquetColumnarRowSplitReader
- ParquetColumnarRowSplitReader.ColumnBatchGenerator - Interface in org.apache.flink.formats.parquet.vector
-
Interface to gen
VectorizedColumnBatch. - ParquetDataColumnReader - Interface in org.apache.flink.formats.parquet.vector.reader
-
The interface to wrap the underlying Parquet dictionary and non dictionary encoded page reader.
- ParquetDataColumnReaderFactory - Class in org.apache.flink.formats.parquet.vector.reader
-
Parquet file has self-describing schema which may differ from the user required schema (e.g.
- ParquetDataColumnReaderFactory.DefaultParquetDataColumnReader - Class in org.apache.flink.formats.parquet.vector.reader
-
The default data column reader for existing Parquet page reader which works for both dictionary or non dictionary types, Mirror from dictionary encoding path.
- ParquetDataColumnReaderFactory.TypesFromInt96PageReader - Class in org.apache.flink.formats.parquet.vector.reader
-
The reader who reads from the underlying Timestamp value.
- ParquetDecimalVector - Class in org.apache.flink.formats.parquet.vector
-
Parquet write decimal as int32 and int64 and binary, this class wrap the real vector to provide
DecimalColumnVectorinterface. - ParquetDecimalVector(ColumnVector) - Constructor for class org.apache.flink.formats.parquet.vector.ParquetDecimalVector
- ParquetDictionary - Class in org.apache.flink.formats.parquet.vector
-
Parquet dictionary.
- ParquetDictionary(Dictionary, ColumnDescriptor) - Constructor for class org.apache.flink.formats.parquet.vector.ParquetDictionary
- ParquetField - Class in org.apache.flink.formats.parquet.vector.type
-
Field that represent parquet's field type.
- ParquetField(LogicalType, int, int, boolean) - Constructor for class org.apache.flink.formats.parquet.vector.type.ParquetField
- ParquetFileFormatFactory - Class in org.apache.flink.formats.parquet
-
Parquet format factory for file system.
- ParquetFileFormatFactory() - Constructor for class org.apache.flink.formats.parquet.ParquetFileFormatFactory
- ParquetFileFormatFactory.ParquetBulkDecodingFormat - Class in org.apache.flink.formats.parquet
-
ParquetBulkDecodingFormat which implements
FileBasedStatisticsReportableInputFormat. - ParquetFormatStatisticsReportUtil - Class in org.apache.flink.formats.parquet.utils
-
Utils for Parquet format statistics report.
- ParquetFormatStatisticsReportUtil() - Constructor for class org.apache.flink.formats.parquet.utils.ParquetFormatStatisticsReportUtil
- ParquetGroupField - Class in org.apache.flink.formats.parquet.vector.type
-
Field that represent parquet's Group Field.
- ParquetGroupField(LogicalType, int, int, boolean, List<ParquetField>) - Constructor for class org.apache.flink.formats.parquet.vector.type.ParquetGroupField
- ParquetInputFile - Class in org.apache.flink.formats.parquet
-
Parquet
InputFileimplementation,ParquetInputFile.newStream()call will delegate to FlinkFSDataInputStream. - ParquetInputFile(FSDataInputStream, long) - Constructor for class org.apache.flink.formats.parquet.ParquetInputFile
- ParquetPrimitiveField - Class in org.apache.flink.formats.parquet.vector.type
-
Field that represent parquet's primitive field.
- ParquetPrimitiveField(LogicalType, boolean, ColumnDescriptor, int) - Constructor for class org.apache.flink.formats.parquet.vector.type.ParquetPrimitiveField
- ParquetProtoWriterBuilder(OutputFile, Class<T>) - Constructor for class org.apache.flink.formats.parquet.protobuf.ParquetProtoWriters.ParquetProtoWriterBuilder
- ParquetProtoWriters - Class in org.apache.flink.formats.parquet.protobuf
-
Convenience builder for creating
ParquetWriterFactoryinstances for Protobuf classes. - ParquetProtoWriters.ParquetProtoWriterBuilder<T extends com.google.protobuf.Message> - Class in org.apache.flink.formats.parquet.protobuf
-
The builder for Protobuf
ParquetWriter. - ParquetReaderBatch(WritableColumnVector[], VectorizedColumnBatch, Pool.Recycler<ParquetVectorizedInputFormat.ParquetReaderBatch<T>>) - Constructor for class org.apache.flink.formats.parquet.ParquetVectorizedInputFormat.ParquetReaderBatch
- ParquetRowDataBuilder - Class in org.apache.flink.formats.parquet.row
-
RowDataofParquetWriter.Builder. - ParquetRowDataBuilder(OutputFile, RowType, boolean) - Constructor for class org.apache.flink.formats.parquet.row.ParquetRowDataBuilder
- ParquetRowDataBuilder.FlinkParquetBuilder - Class in org.apache.flink.formats.parquet.row
-
Flink Row
ParquetBuilder. - ParquetRowDataWriter - Class in org.apache.flink.formats.parquet.row
-
Writes a record to the Parquet API with the expected schema in order to be written to a file.
- ParquetRowDataWriter(RecordConsumer, RowType, GroupType, boolean, Configuration) - Constructor for class org.apache.flink.formats.parquet.row.ParquetRowDataWriter
- ParquetSchemaConverter - Class in org.apache.flink.formats.parquet.utils
-
Schema converter converts Parquet schema to and from Flink internal types.
- ParquetSchemaConverter() - Constructor for class org.apache.flink.formats.parquet.utils.ParquetSchemaConverter
- ParquetSplitReaderUtil - Class in org.apache.flink.formats.parquet.vector
-
Util for generating
ParquetColumnarRowSplitReader. - ParquetSplitReaderUtil() - Constructor for class org.apache.flink.formats.parquet.vector.ParquetSplitReaderUtil
- ParquetVectorizedInputFormat<T,
SplitT extends org.apache.flink.connector.file.src.FileSourceSplit> - Class in org.apache.flink.formats.parquet -
Parquet
BulkFormatthat reads data from the file toVectorizedColumnBatchin vectorized mode. - ParquetVectorizedInputFormat(SerializableConfiguration, RowType, ColumnBatchFactory<SplitT>, int, boolean, boolean) - Constructor for class org.apache.flink.formats.parquet.ParquetVectorizedInputFormat
- ParquetVectorizedInputFormat.ParquetReaderBatch<T> - Class in org.apache.flink.formats.parquet
-
Reader batch that provides writing and reading capabilities.
- ParquetWriterFactory<T> - Class in org.apache.flink.formats.parquet
-
A factory that creates a Parquet
BulkWriter. - ParquetWriterFactory(ParquetBuilder<T>) - Constructor for class org.apache.flink.formats.parquet.ParquetWriterFactory
-
Creates a new ParquetWriterFactory using the given builder to assemble the ParquetWriter.
R
- reachedEnd() - Method in class org.apache.flink.formats.parquet.vector.ParquetColumnarRowSplitReader
-
Method used to check if the end of the input is reached.
- readAndNewVector(int, WritableColumnVector) - Method in class org.apache.flink.formats.parquet.vector.reader.NestedPrimitiveColumnReader
- readBatch(int, int, WritableBooleanVector) - Method in class org.apache.flink.formats.parquet.vector.reader.BooleanColumnReader
- readBatch(int, int, WritableBytesVector) - Method in class org.apache.flink.formats.parquet.vector.reader.BytesColumnReader
- readBatch(int, int, WritableByteVector) - Method in class org.apache.flink.formats.parquet.vector.reader.ByteColumnReader
- readBatch(int, int, WritableDoubleVector) - Method in class org.apache.flink.formats.parquet.vector.reader.DoubleColumnReader
- readBatch(int, int, WritableFloatVector) - Method in class org.apache.flink.formats.parquet.vector.reader.FloatColumnReader
- readBatch(int, int, WritableIntVector) - Method in class org.apache.flink.formats.parquet.vector.reader.IntColumnReader
- readBatch(int, int, WritableLongVector) - Method in class org.apache.flink.formats.parquet.vector.reader.LongColumnReader
- readBatch(int, int, WritableShortVector) - Method in class org.apache.flink.formats.parquet.vector.reader.ShortColumnReader
- readBatch(int, int, WritableTimestampVector) - Method in class org.apache.flink.formats.parquet.vector.reader.TimestampColumnReader
- readBatch(int, int, VECTOR) - Method in class org.apache.flink.formats.parquet.vector.reader.AbstractColumnReader
-
Read batch from
AbstractColumnReader.runLenDecoderandAbstractColumnReader.dataInputStream. - readBatch(int, int, VECTOR) - Method in class org.apache.flink.formats.parquet.vector.reader.FixedLenBytesColumnReader
- readBatchFromDictionaryIds(int, int, WritableBooleanVector, WritableIntVector) - Method in class org.apache.flink.formats.parquet.vector.reader.BooleanColumnReader
- readBatchFromDictionaryIds(int, int, WritableBytesVector, WritableIntVector) - Method in class org.apache.flink.formats.parquet.vector.reader.BytesColumnReader
- readBatchFromDictionaryIds(int, int, WritableByteVector, WritableIntVector) - Method in class org.apache.flink.formats.parquet.vector.reader.ByteColumnReader
- readBatchFromDictionaryIds(int, int, WritableDoubleVector, WritableIntVector) - Method in class org.apache.flink.formats.parquet.vector.reader.DoubleColumnReader
- readBatchFromDictionaryIds(int, int, WritableFloatVector, WritableIntVector) - Method in class org.apache.flink.formats.parquet.vector.reader.FloatColumnReader
- readBatchFromDictionaryIds(int, int, WritableIntVector, WritableIntVector) - Method in class org.apache.flink.formats.parquet.vector.reader.IntColumnReader
- readBatchFromDictionaryIds(int, int, WritableLongVector, WritableIntVector) - Method in class org.apache.flink.formats.parquet.vector.reader.LongColumnReader
- readBatchFromDictionaryIds(int, int, WritableShortVector, WritableIntVector) - Method in class org.apache.flink.formats.parquet.vector.reader.ShortColumnReader
- readBatchFromDictionaryIds(int, int, WritableTimestampVector, WritableIntVector) - Method in class org.apache.flink.formats.parquet.vector.reader.TimestampColumnReader
- readBatchFromDictionaryIds(int, int, VECTOR, WritableIntVector) - Method in class org.apache.flink.formats.parquet.vector.reader.AbstractColumnReader
-
Decode dictionary ids to data.
- readBatchFromDictionaryIds(int, int, VECTOR, WritableIntVector) - Method in class org.apache.flink.formats.parquet.vector.reader.FixedLenBytesColumnReader
- readBoolean() - Method in interface org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReader
- readBoolean() - Method in class org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReaderFactory.DefaultParquetDataColumnReader
- readBoolean(int) - Method in interface org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReader
- readBoolean(int) - Method in class org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReaderFactory.DefaultParquetDataColumnReader
- readBytes() - Method in interface org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReader
- readBytes() - Method in class org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReaderFactory.DefaultParquetDataColumnReader
- readBytes(int) - Method in interface org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReader
- readBytes(int) - Method in class org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReaderFactory.DefaultParquetDataColumnReader
- readDouble() - Method in interface org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReader
- readDouble() - Method in class org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReaderFactory.DefaultParquetDataColumnReader
- readDouble(int) - Method in interface org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReader
- readDouble(int) - Method in class org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReaderFactory.DefaultParquetDataColumnReader
- readFloat() - Method in interface org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReader
- readFloat() - Method in class org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReaderFactory.DefaultParquetDataColumnReader
- readFloat(int) - Method in interface org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReader
- readFloat(int) - Method in class org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReaderFactory.DefaultParquetDataColumnReader
- readInteger() - Method in interface org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReader
- readInteger() - Method in class org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReaderFactory.DefaultParquetDataColumnReader
- readInteger(int) - Method in interface org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReader
- readInteger(int) - Method in class org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReaderFactory.DefaultParquetDataColumnReader
- readLong() - Method in interface org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReader
- readLong() - Method in class org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReaderFactory.DefaultParquetDataColumnReader
- readLong(int) - Method in interface org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReader
- readLong(int) - Method in class org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReaderFactory.DefaultParquetDataColumnReader
- readPage() - Method in class org.apache.flink.formats.parquet.vector.reader.NestedPrimitiveColumnReader
- readSmallInt() - Method in interface org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReader
- readSmallInt() - Method in class org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReaderFactory.DefaultParquetDataColumnReader
- readSmallInt(int) - Method in interface org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReader
- readSmallInt(int) - Method in class org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReaderFactory.DefaultParquetDataColumnReader
- readTimestamp() - Method in interface org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReader
- readTimestamp() - Method in class org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReaderFactory.DefaultParquetDataColumnReader
- readTimestamp() - Method in class org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReaderFactory.TypesFromInt96PageReader
- readTimestamp(int) - Method in interface org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReader
- readTimestamp(int) - Method in class org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReaderFactory.DefaultParquetDataColumnReader
- readTimestamp(int) - Method in class org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReaderFactory.TypesFromInt96PageReader
- readTinyInt() - Method in interface org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReader
- readTinyInt() - Method in class org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReaderFactory.DefaultParquetDataColumnReader
- readTinyInt(int) - Method in interface org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReader
- readTinyInt(int) - Method in class org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReaderFactory.DefaultParquetDataColumnReader
- readToVector(int, WritableColumnVector) - Method in class org.apache.flink.formats.parquet.vector.reader.NestedColumnReader
- readToVector(int, WritableColumnVector) - Method in class org.apache.flink.formats.parquet.vector.reader.NestedPrimitiveColumnReader
- readToVector(int, VECTOR) - Method in class org.apache.flink.formats.parquet.vector.reader.AbstractColumnReader
-
Reads `total` values from this columnReader into column.
- readToVector(int, VECTOR) - Method in interface org.apache.flink.formats.parquet.vector.reader.ColumnReader
- readValueDictionaryId() - Method in interface org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReader
- readValueDictionaryId() - Method in class org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReaderFactory.DefaultParquetDataColumnReader
- recycle() - Method in class org.apache.flink.formats.parquet.ParquetVectorizedInputFormat.ParquetReaderBatch
- reportStatistics(List<Path>, DataType) - Method in class org.apache.flink.formats.parquet.ParquetColumnarRowInputFormat
- reportStatistics(List<Path>, DataType) - Method in class org.apache.flink.formats.parquet.ParquetFileFormatFactory.ParquetBulkDecodingFormat
- requiredOptions() - Method in class org.apache.flink.formats.parquet.ParquetFileFormatFactory
- reserveDictionaryIds(int) - Method in class org.apache.flink.formats.parquet.vector.ParquetDecimalVector
- reset() - Method in class org.apache.flink.formats.parquet.vector.ParquetDecimalVector
- restoreReader(Configuration, SplitT) - Method in class org.apache.flink.formats.parquet.ParquetVectorizedInputFormat
- RLEIntIterator(RunLengthBitPackingHybridDecoder) - Constructor for class org.apache.flink.formats.parquet.vector.reader.NestedPrimitiveColumnReader.RLEIntIterator
- RowPosition - Class in org.apache.flink.formats.parquet.vector.position
-
To represent struct's position in repeated type.
- RowPosition(boolean[], int) - Constructor for class org.apache.flink.formats.parquet.vector.position.RowPosition
- runLenDecoder - Variable in class org.apache.flink.formats.parquet.vector.reader.AbstractColumnReader
-
Run length decoder for data and dictionary.
S
- seekToRow(long) - Method in class org.apache.flink.formats.parquet.vector.ParquetColumnarRowSplitReader
-
Seek to a particular row number.
- self() - Method in class org.apache.flink.formats.parquet.protobuf.ParquetProtoWriters.ParquetProtoWriterBuilder
- self() - Method in class org.apache.flink.formats.parquet.row.ParquetRowDataBuilder
- SerializableConfiguration - Class in org.apache.flink.formats.parquet.utils
-
Wrap
Configurationto a serializable class. - SerializableConfiguration(Configuration) - Constructor for class org.apache.flink.formats.parquet.utils.SerializableConfiguration
- setDictionary(Dictionary) - Method in class org.apache.flink.formats.parquet.vector.ParquetDecimalVector
- setInt(int, int) - Method in class org.apache.flink.formats.parquet.vector.ParquetDecimalVector
- setInts(int, int, int) - Method in class org.apache.flink.formats.parquet.vector.ParquetDecimalVector
- setInts(int, int, int[], int) - Method in class org.apache.flink.formats.parquet.vector.ParquetDecimalVector
- setIntsFromBinary(int, int, byte[], int) - Method in class org.apache.flink.formats.parquet.vector.ParquetDecimalVector
- setLong(int, long) - Method in class org.apache.flink.formats.parquet.vector.ParquetDecimalVector
- setLongsFromBinary(int, int, byte[], int) - Method in class org.apache.flink.formats.parquet.vector.ParquetDecimalVector
- setNullAt(int) - Method in class org.apache.flink.formats.parquet.vector.ParquetDecimalVector
- setNulls(int, int) - Method in class org.apache.flink.formats.parquet.vector.ParquetDecimalVector
- ShortColumnReader - Class in org.apache.flink.formats.parquet.vector.reader
-
Short
ColumnReader. - ShortColumnReader(ColumnDescriptor, PageReader) - Constructor for class org.apache.flink.formats.parquet.vector.reader.ShortColumnReader
- skip() - Method in class org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReaderFactory.DefaultParquetDataColumnReader
- supportLazyDecode() - Method in class org.apache.flink.formats.parquet.vector.reader.AbstractColumnReader
-
Support lazy dictionary ids decode.
- supportLazyDecode() - Method in class org.apache.flink.formats.parquet.vector.reader.BooleanColumnReader
- supportLazyDecode() - Method in class org.apache.flink.formats.parquet.vector.reader.TimestampColumnReader
T
- TIMESTAMP_TIME_UNIT - Static variable in class org.apache.flink.formats.parquet.ParquetFileFormatFactory
- TimestampColumnReader - Class in org.apache.flink.formats.parquet.vector.reader
-
Timestamp
ColumnReader. - TimestampColumnReader(boolean, ColumnDescriptor, PageReader) - Constructor for class org.apache.flink.formats.parquet.vector.reader.TimestampColumnReader
- toString() - Method in class org.apache.flink.formats.parquet.vector.type.ParquetField
- TypesFromInt96PageReader(Dictionary, boolean) - Constructor for class org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReaderFactory.TypesFromInt96PageReader
- TypesFromInt96PageReader(ValuesReader, boolean) - Constructor for class org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReaderFactory.TypesFromInt96PageReader
U
- UTC_TIMEZONE - Static variable in class org.apache.flink.formats.parquet.ParquetFileFormatFactory
V
- valuesReader - Variable in class org.apache.flink.formats.parquet.vector.reader.ParquetDataColumnReaderFactory.DefaultParquetDataColumnReader
- ValuesReaderIntIterator(ValuesReader) - Constructor for class org.apache.flink.formats.parquet.vector.reader.NestedPrimitiveColumnReader.ValuesReaderIntIterator
W
- withoutExtraFields() - Static method in interface org.apache.flink.formats.parquet.vector.ColumnBatchFactory
- write(RowData) - Method in class org.apache.flink.formats.parquet.row.ParquetRowDataWriter
-
It writes a record to Parquet.
- WRITE_INT64_TIMESTAMP - Static variable in class org.apache.flink.formats.parquet.ParquetFileFormatFactory
All Classes and Interfaces|All Packages|Constant Field Values|Serialized Form