Interface ParquetDataColumnReader

All Known Implementing Classes:
ParquetDataColumnReaderFactory.DefaultParquetDataColumnReader, ParquetDataColumnReaderFactory.TypesFromInt96PageReader

public interface ParquetDataColumnReader
The interface to wrap the underlying Parquet dictionary and non dictionary encoded page reader.
  • Method Details

    • initFromPage

      void initFromPage(int valueCount, org.apache.parquet.bytes.ByteBufferInputStream in) throws IOException
      Initialize the reader by page data.
      Parameters:
      valueCount - value count
      in - page data
      Throws:
      IOException
    • readValueDictionaryId

      int readValueDictionaryId()
      Returns:
      the next Dictionary ID from the page
    • readBoolean

      boolean readBoolean()
      Returns:
      the next Boolean from the page
    • readTinyInt

      int readTinyInt()
      Returns:
      the next TinyInt from the page
    • readSmallInt

      int readSmallInt()
      Returns:
      the next SmallInt from the page
    • readInteger

      int readInteger()
      Returns:
      the next Integer from the page
    • readLong

      long readLong()
      Returns:
      the next Long from the page
    • readFloat

      float readFloat()
      Returns:
      the next Float from the page
    • readDouble

      double readDouble()
      Returns:
      the next Double from the page
    • readBytes

      byte[] readBytes()
      Returns:
      the next Bytes from the page
    • readTimestamp

      org.apache.flink.table.data.TimestampData readTimestamp()
      Returns:
      the next TimestampData from the page
    • getDictionary

      org.apache.parquet.column.Dictionary getDictionary()
      Returns:
      the underlying dictionary if current reader is dictionary encoded
    • readBoolean

      boolean readBoolean(int id)
      Parameters:
      id - in dictionary
      Returns:
      the Boolean from the dictionary by id
    • readTinyInt

      int readTinyInt(int id)
      Parameters:
      id - in dictionary
      Returns:
      the tiny int from the dictionary by id
    • readSmallInt

      int readSmallInt(int id)
      Parameters:
      id - in dictionary
      Returns:
      the Small Int from the dictionary by id
    • readInteger

      int readInteger(int id)
      Parameters:
      id - in dictionary
      Returns:
      the Integer from the dictionary by id
    • readLong

      long readLong(int id)
      Parameters:
      id - in dictionary
      Returns:
      the Long from the dictionary by id
    • readFloat

      float readFloat(int id)
      Parameters:
      id - in dictionary
      Returns:
      the Float from the dictionary by id
    • readDouble

      double readDouble(int id)
      Parameters:
      id - in dictionary
      Returns:
      the Double from the dictionary by id
    • readBytes

      byte[] readBytes(int id)
      Parameters:
      id - in dictionary
      Returns:
      the Bytes from the dictionary by id
    • readTimestamp

      org.apache.flink.table.data.TimestampData readTimestamp(int id)
      Parameters:
      id - in dictionary
      Returns:
      the TimestampData from the dictionary by id