Packages

c

com.nvidia.spark.rapids

ParquetCachedBatchSerializer

class ParquetCachedBatchSerializer extends GpuCachedBatchSerializer with Arm

This class assumes, the data is Columnar and the plugin is on. Note, this class should not be referenced directly in source code. It should be loaded by reflection using ShimLoader.newInstanceOf, see ./docs/dev/shims.md

Attributes
protected
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. ParquetCachedBatchSerializer
  2. Arm
  3. GpuCachedBatchSerializer
  4. CachedBatchSerializer
  5. Serializable
  6. Serializable
  7. AnyRef
  8. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Instance Constructors

  1. new ParquetCachedBatchSerializer()

Value Members

  1. final def !=(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  4. final def asInstanceOf[T0]: T0
    Definition Classes
    Any
  5. def buildFilter(predicates: Seq[Expression], cachedAttributes: Seq[Attribute]): (Int, Iterator[CachedBatch]) ⇒ Iterator[CachedBatch]
    Definition Classes
    ParquetCachedBatchSerializer → CachedBatchSerializer
  6. def clone(): AnyRef
    Attributes
    protected[lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... ) @native() @HotSpotIntrinsicCandidate()
  7. def closeOnExcept[T <: AutoCloseable, V](r: Option[T])(block: (Option[T]) ⇒ V): V

    Executes the provided code block, closing the resources only if an exception occurs

    Executes the provided code block, closing the resources only if an exception occurs

    Definition Classes
    Arm
  8. def closeOnExcept[T <: AutoCloseable, V](r: ArrayBuffer[T])(block: (ArrayBuffer[T]) ⇒ V): V

    Executes the provided code block, closing the resources only if an exception occurs

    Executes the provided code block, closing the resources only if an exception occurs

    Definition Classes
    Arm
  9. def closeOnExcept[T <: AutoCloseable, V](r: Array[T])(block: (Array[T]) ⇒ V): V

    Executes the provided code block, closing the resources only if an exception occurs

    Executes the provided code block, closing the resources only if an exception occurs

    Definition Classes
    Arm
  10. def closeOnExcept[T <: AutoCloseable, V](r: Seq[T])(block: (Seq[T]) ⇒ V): V

    Executes the provided code block, closing the resources only if an exception occurs

    Executes the provided code block, closing the resources only if an exception occurs

    Definition Classes
    Arm
  11. def closeOnExcept[T <: AutoCloseable, V](r: T)(block: (T) ⇒ V): V

    Executes the provided code block, closing the resource only if an exception occurs

    Executes the provided code block, closing the resource only if an exception occurs

    Definition Classes
    Arm
  12. def convertCachedBatchToColumnarBatch(input: RDD[CachedBatch], cacheAttributes: Seq[Attribute], selectedAttributes: Seq[Attribute], conf: SQLConf): RDD[ColumnarBatch]

    Convert the cached data into a ColumnarBatch taking the result data back to the host

    Convert the cached data into a ColumnarBatch taking the result data back to the host

    input

    the cached batches that should be converted.

    cacheAttributes

    the attributes of the data in the batch.

    selectedAttributes

    the fields that should be loaded from the data and the order they should appear in the output batch.

    conf

    the configuration for the job.

    returns

    an RDD of the input cached batches transformed into the ColumnarBatch format.

    Definition Classes
    ParquetCachedBatchSerializer → CachedBatchSerializer
  13. def convertCachedBatchToInternalRow(input: RDD[CachedBatch], cacheAttributes: Seq[Attribute], selectedAttributes: Seq[Attribute], conf: SQLConf): RDD[InternalRow]

    Convert the cached batch into InternalRows.

    Convert the cached batch into InternalRows.

    input

    the cached batches that should be converted.

    cacheAttributes

    the attributes of the data in the batch.

    selectedAttributes

    the field that should be loaded from the data and the order they should appear in the output rows.

    conf

    the configuration for the job.

    returns

    RDD of the rows that were stored in the cached batches.

    Definition Classes
    ParquetCachedBatchSerializer → CachedBatchSerializer
  14. def convertColumnarBatchToCachedBatch(input: RDD[ColumnarBatch], schema: Seq[Attribute], storageLevel: StorageLevel, conf: SQLConf): RDD[CachedBatch]

    Convert an RDD[ColumnarBatch] into an RDD[CachedBatch] in preparation for caching the data.

    Convert an RDD[ColumnarBatch] into an RDD[CachedBatch] in preparation for caching the data. This method uses Parquet Writer on the GPU to write the cached batch

    input

    the input RDD to be converted.

    schema

    the schema of the data being stored.

    storageLevel

    where the data will be stored.

    conf

    the config for the query.

    returns

    The data converted into a format more suitable for caching.

    Definition Classes
    ParquetCachedBatchSerializer → CachedBatchSerializer
  15. def convertInternalRowToCachedBatch(input: RDD[InternalRow], schema: Seq[Attribute], storageLevel: StorageLevel, conf: SQLConf): RDD[CachedBatch]

    Convert an RDD[InternalRow] into an RDD[CachedBatch] in preparation for caching the data.

    Convert an RDD[InternalRow] into an RDD[CachedBatch] in preparation for caching the data. We use the RowToColumnarIterator and convert each batch at a time

    input

    the input RDD to be converted.

    schema

    the schema of the data being stored.

    storageLevel

    where the data will be stored.

    conf

    the config for the query.

    returns

    The data converted into a format more suitable for caching.

    Definition Classes
    ParquetCachedBatchSerializer → CachedBatchSerializer
  16. final def eq(arg0: AnyRef): Boolean
    Definition Classes
    AnyRef
  17. def equals(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  18. def freeOnExcept[T <: RapidsBuffer, V](r: T)(block: (T) ⇒ V): V

    Executes the provided code block, freeing the RapidsBuffer only if an exception occurs

    Executes the provided code block, freeing the RapidsBuffer only if an exception occurs

    Definition Classes
    Arm
  19. def getBytesAllowedPerBatch(conf: SQLConf): Long
  20. final def getClass(): Class[_]
    Definition Classes
    AnyRef → Any
    Annotations
    @native() @HotSpotIntrinsicCandidate()
  21. def getParquetWriterOptions(useCompression: Boolean, schema: StructType): ParquetWriterOptions
  22. def gpuConvertCachedBatchToColumnarBatch(input: RDD[CachedBatch], cacheAttributes: Seq[Attribute], selectedAttributes: Seq[Attribute], conf: SQLConf): RDD[ColumnarBatch]

    This method decodes the CachedBatch leaving it on the GPU to avoid the extra copying back to the host

    This method decodes the CachedBatch leaving it on the GPU to avoid the extra copying back to the host

    input

    the cached batches that should be converted.

    cacheAttributes

    the attributes of the data in the batch.

    selectedAttributes

    the fields that should be loaded from the data and the order they should appear in the output batch.

    conf

    the configuration for the job.

    returns

    an RDD of the input cached batches transformed into the ColumnarBatch format.

    Definition Classes
    ParquetCachedBatchSerializerGpuCachedBatchSerializer
  23. def hashCode(): Int
    Definition Classes
    AnyRef → Any
    Annotations
    @native() @HotSpotIntrinsicCandidate()
  24. final def isInstanceOf[T0]: Boolean
    Definition Classes
    Any
  25. def isSchemaSupportedByCudf(schema: Seq[Attribute]): Boolean
  26. def isSupportedByCudf(dataType: DataType): Boolean
  27. final def ne(arg0: AnyRef): Boolean
    Definition Classes
    AnyRef
  28. final def notify(): Unit
    Definition Classes
    AnyRef
    Annotations
    @native() @HotSpotIntrinsicCandidate()
  29. final def notifyAll(): Unit
    Definition Classes
    AnyRef
    Annotations
    @native() @HotSpotIntrinsicCandidate()
  30. def supportsColumnarInput(schema: Seq[Attribute]): Boolean
    Definition Classes
    ParquetCachedBatchSerializer → CachedBatchSerializer
  31. def supportsColumnarOutput(schema: StructType): Boolean
    Definition Classes
    ParquetCachedBatchSerializer → CachedBatchSerializer
  32. final def synchronized[T0](arg0: ⇒ T0): T0
    Definition Classes
    AnyRef
  33. def toString(): String
    Definition Classes
    AnyRef → Any
  34. def vectorTypes(attributes: Seq[Attribute], conf: SQLConf): Option[Seq[String]]
    Definition Classes
    CachedBatchSerializer
  35. final def wait(arg0: Long, arg1: Int): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  36. final def wait(arg0: Long): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws( ... ) @native()
  37. final def wait(): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  38. def withResource[T <: AutoCloseable, V](h: CloseableHolder[T])(block: (CloseableHolder[T]) ⇒ V): V

    Executes the provided code block and then closes the resource

    Executes the provided code block and then closes the resource

    Definition Classes
    Arm
  39. def withResource[T <: AutoCloseable, V](r: ArrayBuffer[T])(block: (ArrayBuffer[T]) ⇒ V): V

    Executes the provided code block and then closes the array buffer of resources

    Executes the provided code block and then closes the array buffer of resources

    Definition Classes
    Arm
  40. def withResource[T <: AutoCloseable, V](r: Array[T])(block: (Array[T]) ⇒ V): V

    Executes the provided code block and then closes the array of resources

    Executes the provided code block and then closes the array of resources

    Definition Classes
    Arm
  41. def withResource[T <: AutoCloseable, V](r: Seq[T])(block: (Seq[T]) ⇒ V): V

    Executes the provided code block and then closes the sequence of resources

    Executes the provided code block and then closes the sequence of resources

    Definition Classes
    Arm
  42. def withResource[T <: AutoCloseable, V](r: Option[T])(block: (Option[T]) ⇒ V): V

    Executes the provided code block and then closes the Option[resource]

    Executes the provided code block and then closes the Option[resource]

    Definition Classes
    Arm
  43. def withResource[T <: AutoCloseable, V](r: T)(block: (T) ⇒ V): V

    Executes the provided code block and then closes the resource

    Executes the provided code block and then closes the resource

    Definition Classes
    Arm
  44. def withResourceIfAllowed[T, V](r: T)(block: (T) ⇒ V): V

    Executes the provided code block and then closes the value if it is AutoCloseable

    Executes the provided code block and then closes the value if it is AutoCloseable

    Definition Classes
    Arm

Deprecated Value Members

  1. def finalize(): Unit
    Attributes
    protected[lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] ) @Deprecated
    Deprecated

Inherited from Arm

Inherited from GpuCachedBatchSerializer

Inherited from CachedBatchSerializer

Inherited from Serializable

Inherited from Serializable

Inherited from AnyRef

Inherited from Any

Ungrouped