Packages

t

com.nvidia.spark.rapids

GpuGenerator

trait GpuGenerator extends Expression with GpuUnevaluable

GPU overrides of Generator, corporate with GpuGenerateExec.

Linear Supertypes
GpuUnevaluable, GpuExpression, Arm, Expression, TreeNode[Expression], TreePatternBits, Product, Equals, AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. GpuGenerator
  2. GpuUnevaluable
  3. GpuExpression
  4. Arm
  5. Expression
  6. TreeNode
  7. TreePatternBits
  8. Product
  9. Equals
  10. AnyRef
  11. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Abstract Value Members

  1. abstract def canEqual(that: Any): Boolean
    Definition Classes
    Equals
  2. abstract def children: Seq[Expression]
    Definition Classes
    TreeNode
  3. abstract def elementSchema: StructType

    The output element schema.

  4. abstract def generate(inputBatch: ColumnarBatch, generatorOffset: Int, outer: Boolean): ColumnarBatch

    Apply generator to produce result ColumnarBatch from input batch.

    Apply generator to produce result ColumnarBatch from input batch.

    This is a specialized method for GPU runtime, which is called by GpuGenerateExec who owns the generator. The reason of creating a new method rather than implementing columnarEval is that generator is an integrated Table transformer instead of column transformer in terms of cuDF.

    inputBatch

    projected input data, which ensures appending columns are ahead of generators' inputs. So, generators can distinguish them with an offset.

    generatorOffset

    column offset of generator's input columns in inputBatch

    outer

    when true, each input row will be output at least once, even if the output of the given generator is empty.

    returns

    result ColumnarBatch

  5. abstract def inputSplitIndices(inputBatch: ColumnarBatch, generatorOffset: Int, outer: Boolean, targetSizeBytes: Long): Array[Int]

    Compute split indices for generator's input batches.

    Compute split indices for generator's input batches.

    This is a specialized method for GPU runtime, which is called by GpuGenerateExec to split up input batches to reduce total memory cost during generating. It is necessary because most of generators may produce multiple records for each input record, which make output batch size much larger than input size.

    inputBatch

    projected input data, which ensures appending columns are ahead of generators' inputs. So, generators can distinguish them with an offset.

    generatorOffset

    column offset of generator's input columns in inputBatch

    outer

    when true, each input row will be output at least once, even if the output of the given generator is empty.

    targetSizeBytes

    the target number of bytes for a GPU batch, one of RapidsConf

    returns

    split indices of input batch

  6. abstract def productArity: Int
    Definition Classes
    Product
  7. abstract def productElement(n: Int): Any
    Definition Classes
    Product
  8. abstract def withNewChildrenInternal(newChildren: IndexedSeq[Expression]): Expression
    Attributes
    protected
    Definition Classes
    TreeNode

Concrete Value Members

  1. final def !=(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  4. def apply(number: Int): TreeNode[_]
    Definition Classes
    TreeNode
  5. def argString(maxFields: Int): String
    Definition Classes
    TreeNode
  6. def asCode: String
    Definition Classes
    TreeNode
  7. final def asInstanceOf[T0]: T0
    Definition Classes
    Any
  8. lazy val canonicalized: Expression
    Definition Classes
    GpuExpression → Expression
  9. def checkInputDataTypes(): TypeCheckResult
    Definition Classes
    Expression
  10. def childrenResolved: Boolean
    Definition Classes
    Expression
  11. def clone(): Expression
    Definition Classes
    TreeNode → AnyRef
  12. def closeOnExcept[T <: AutoCloseable, V](r: Option[T])(block: (Option[T]) ⇒ V): V

    Executes the provided code block, closing the resources only if an exception occurs

    Executes the provided code block, closing the resources only if an exception occurs

    Definition Classes
    Arm
  13. def closeOnExcept[T <: AutoCloseable, V](r: ArrayBuffer[T])(block: (ArrayBuffer[T]) ⇒ V): V

    Executes the provided code block, closing the resources only if an exception occurs

    Executes the provided code block, closing the resources only if an exception occurs

    Definition Classes
    Arm
  14. def closeOnExcept[T <: AutoCloseable, V](r: Array[T])(block: (Array[T]) ⇒ V): V

    Executes the provided code block, closing the resources only if an exception occurs

    Executes the provided code block, closing the resources only if an exception occurs

    Definition Classes
    Arm
  15. def closeOnExcept[T <: AutoCloseable, V](r: Seq[T])(block: (Seq[T]) ⇒ V): V

    Executes the provided code block, closing the resources only if an exception occurs

    Executes the provided code block, closing the resources only if an exception occurs

    Definition Classes
    Arm
  16. def closeOnExcept[T <: AutoCloseable, V](r: T)(block: (T) ⇒ V): V

    Executes the provided code block, closing the resource only if an exception occurs

    Executes the provided code block, closing the resource only if an exception occurs

    Definition Classes
    Arm
  17. def collect[B](pf: PartialFunction[Expression, B]): Seq[B]
    Definition Classes
    TreeNode
  18. def collectFirst[B](pf: PartialFunction[Expression, B]): Option[B]
    Definition Classes
    TreeNode
  19. def collectLeaves(): Seq[Expression]
    Definition Classes
    TreeNode
  20. final def columnarEval(batch: ColumnarBatch): Any

    Returns the result of evaluating this expression on the entire ColumnarBatch.

    Returns the result of evaluating this expression on the entire ColumnarBatch. The result of calling this may be a single GpuColumnVector or a scalar value. Scalar values typically happen if they are a part of the expression i.e. col("a") + 100. In this case the 100 is a literal that Add would have to be able to handle.

    By convention any GpuColumnVector returned by columnarEval is owned by the caller and will need to be closed by them. This can happen by putting it into a ColumnarBatch and closing the batch or by closing the vector directly if it is a temporary value.

    Definition Classes
    GpuUnevaluableGpuExpression
  21. final def containsAllPatterns(patterns: TreePattern*): Boolean
    Definition Classes
    TreePatternBits
  22. final def containsAnyPattern(patterns: TreePattern*): Boolean
    Definition Classes
    TreePatternBits
  23. lazy val containsChild: Set[TreeNode[_]]
    Definition Classes
    TreeNode
  24. final def containsPattern(t: TreePattern): Boolean
    Definition Classes
    TreePatternBits
    Annotations
    @inline()
  25. def convertToAst(numFirstTableColumns: Int): AstExpression

    Build an equivalent representation of this expression in a cudf AST.

    Build an equivalent representation of this expression in a cudf AST.

    numFirstTableColumns

    number of columns in the leftmost input table. Spark places the columns of all inputs in a single sequence, while cudf AST uses an explicit table reference to make column indices unique. This parameter helps translate input column references from Spark's single sequence into cudf's separate sequences.

    returns

    top node of the equivalent AST

    Definition Classes
    GpuExpression
  26. def copyTagsFrom(other: Expression): Unit
    Definition Classes
    TreeNode
  27. def dataType: DataType
    Definition Classes
    GpuGenerator → Expression
  28. lazy val deterministic: Boolean
    Definition Classes
    Expression
  29. def disableCoalesceUntilInput(): Boolean

    Override this if your expression cannot allow combining of data from multiple files into a single batch before it operates on them.

    Override this if your expression cannot allow combining of data from multiple files into a single batch before it operates on them. These are for things like getting the input file name. Which for spark is stored in a thread local variable which means we have to jump through some hoops to make this work.

    Definition Classes
    GpuExpression
  30. final def doGenCode(ctx: CodegenContext, ev: ExprCode): ExprCode
    Definition Classes
    GpuExpression → Expression
  31. final def eq(arg0: AnyRef): Boolean
    Definition Classes
    AnyRef
  32. def equals(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  33. final def eval(input: InternalRow = null): Any
    Definition Classes
    GpuExpression → Expression
  34. def fastEquals(other: TreeNode[_]): Boolean
    Definition Classes
    TreeNode
  35. def find(f: (Expression) ⇒ Boolean): Option[Expression]
    Definition Classes
    TreeNode
  36. def fixedLenLazyArrayGenerate(inputIterator: Iterator[ColumnarBatch], boundLazyProjectList: Seq[Expression], boundOthersProjectList: Seq[Expression], outputSchema: Array[DataType], outer: Boolean, numOutputRows: GpuMetric, numOutputBatches: GpuMetric, opTime: GpuMetric): Iterator[ColumnarBatch]

    Optimized lazy generation interface which is specialized for fixed length array input.

    Optimized lazy generation interface which is specialized for fixed length array input.

    For some generators (like explode), it is possible to improve performance through lazy evaluation when input schema is fixed length array.

    inputIterator

    input iterator from child plan

    boundLazyProjectList

    lazy expressions bounded with child outputs

    boundOthersProjectList

    other required expressions bounded with child outputs

    outputSchema

    result schema of GpuGenerateExec

    outer

    when true, each input row will be output at least once, even if the output of the given generator is empty.

    numOutputRows

    Gpu spark metric of output rows

    numOutputBatches

    Gpu spark metric of output batches

    opTime

    Gpu spark metric of time on GPU by GpuGenerateExec

    returns

    result iterator

  37. def fixedLenLazyExpressions: Seq[Expression]

    Extract lazy expressions from generator if exists.

    Extract lazy expressions from generator if exists.

    This is a specialized method for GPU runtime, which is called by GpuGenerateExec to determine whether current generation plan can be executed with optimized lazy array generation or not.

    returns

    fixed length lazy expressions for generation. Nil value means no lazy expressions to extract, which indicates fixed length lazy array generation is unavailable.

  38. def flatArguments: Iterator[Any]
    Attributes
    protected
    Definition Classes
    Expression
  39. def flatMap[A](f: (Expression) ⇒ TraversableOnce[A]): Seq[A]
    Definition Classes
    TreeNode
  40. def foldable: Boolean
    Definition Classes
    GpuGenerator → Expression
  41. def foreach(f: (Expression) ⇒ Unit): Unit
    Definition Classes
    TreeNode
  42. def foreachUp(f: (Expression) ⇒ Unit): Unit
    Definition Classes
    TreeNode
  43. def freeOnExcept[T <: RapidsBuffer, V](r: T)(block: (T) ⇒ V): V

    Executes the provided code block, freeing the RapidsBuffer only if an exception occurs

    Executes the provided code block, freeing the RapidsBuffer only if an exception occurs

    Definition Classes
    Arm
  44. def genCode(ctx: CodegenContext): ExprCode
    Definition Classes
    Expression
  45. def generateTreeString(depth: Int, lastChildren: Seq[Boolean], append: (String) ⇒ Unit, verbose: Boolean, prefix: String, addSuffix: Boolean, maxFields: Int, printNodeId: Boolean, indent: Int): Unit
    Definition Classes
    TreeNode
  46. final def getClass(): Class[_]
    Definition Classes
    AnyRef → Any
    Annotations
    @native() @HotSpotIntrinsicCandidate()
  47. def getDefaultTreePatternBits: BitSet
    Attributes
    protected
    Definition Classes
    TreeNode
  48. def getTagValue[T](tag: TreeNodeTag[T]): Option[T]
    Definition Classes
    TreeNode
  49. def hasSideEffects: Boolean

    Could evaluating this expression cause side-effects, such as throwing an exception?

    Could evaluating this expression cause side-effects, such as throwing an exception?

    Definition Classes
    GpuExpression
  50. def hashCode(): Int
    Definition Classes
    TreeNode → AnyRef → Any
  51. def innerChildren: Seq[TreeNode[_]]
    Definition Classes
    TreeNode
  52. final def isInstanceOf[T0]: Boolean
    Definition Classes
    Any
  53. def isRuleIneffective(ruleId: RuleId): Boolean
    Attributes
    protected
    Definition Classes
    TreeNode
  54. def jsonFields: List[JField]
    Attributes
    protected
    Definition Classes
    TreeNode
  55. final def legacyWithNewChildren(newChildren: Seq[Expression]): Expression
    Attributes
    protected
    Definition Classes
    TreeNode
  56. def makeCopy(newArgs: Array[AnyRef]): Expression
    Definition Classes
    TreeNode
  57. def map[A](f: (Expression) ⇒ A): Seq[A]
    Definition Classes
    TreeNode
  58. def mapChildren(f: (Expression) ⇒ Expression): Expression
    Definition Classes
    TreeNode
  59. def mapProductIterator[B](f: (Any) ⇒ B)(implicit arg0: ClassTag[B]): Array[B]
    Attributes
    protected
    Definition Classes
    TreeNode
  60. def markRuleAsIneffective(ruleId: RuleId): Unit
    Attributes
    protected
    Definition Classes
    TreeNode
  61. final def ne(arg0: AnyRef): Boolean
    Definition Classes
    AnyRef
  62. def nodeName: String
    Definition Classes
    TreeNode
  63. val nodePatterns: Seq[TreePattern]
    Attributes
    protected
    Definition Classes
    TreeNode
  64. final def notify(): Unit
    Definition Classes
    AnyRef
    Annotations
    @native() @HotSpotIntrinsicCandidate()
  65. final def notifyAll(): Unit
    Definition Classes
    AnyRef
    Annotations
    @native() @HotSpotIntrinsicCandidate()
  66. def nullable: Boolean
    Definition Classes
    GpuGenerator → Expression
  67. def numberedTreeString: String
    Definition Classes
    TreeNode
  68. val origin: Origin
    Definition Classes
    TreeNode
  69. def otherCopyArgs: Seq[AnyRef]
    Attributes
    protected
    Definition Classes
    TreeNode
  70. def p(number: Int): Expression
    Definition Classes
    TreeNode
  71. def prettyJson: String
    Definition Classes
    TreeNode
  72. def prettyName: String
    Definition Classes
    Expression
  73. def productIterator: Iterator[Any]
    Definition Classes
    Product
  74. def productPrefix: String
    Definition Classes
    Product
  75. def references: AttributeSet
    Definition Classes
    Expression
  76. lazy val resolved: Boolean
    Definition Classes
    Expression
  77. final def semanticEquals(other: Expression): Boolean
    Definition Classes
    Expression
  78. def semanticHash(): Int
    Definition Classes
    Expression
  79. def setTagValue[T](tag: TreeNodeTag[T], value: T): Unit
    Definition Classes
    TreeNode
  80. def simpleString(maxFields: Int): String
    Definition Classes
    Expression → TreeNode
  81. def simpleStringWithNodeId(): String
    Definition Classes
    Expression → TreeNode
  82. def sql: String
    Definition Classes
    Expression
  83. def stringArgs: Iterator[Any]
    Attributes
    protected
    Definition Classes
    TreeNode
  84. final def synchronized[T0](arg0: ⇒ T0): T0
    Definition Classes
    AnyRef
  85. def toJSON: String
    Definition Classes
    TreeNode
  86. def toString(): String
    Definition Classes
    Expression → TreeNode → AnyRef → Any
  87. def transform(rule: PartialFunction[Expression, Expression]): Expression
    Definition Classes
    TreeNode
  88. def transformDown(rule: PartialFunction[Expression, Expression]): Expression
    Definition Classes
    TreeNode
  89. def transformDownWithPruning(cond: (TreePatternBits) ⇒ Boolean, ruleId: RuleId)(rule: PartialFunction[Expression, Expression]): Expression
    Definition Classes
    TreeNode
  90. def transformUp(rule: PartialFunction[Expression, Expression]): Expression
    Definition Classes
    TreeNode
  91. def transformUpWithBeforeAndAfterRuleOnChildren(cond: (Expression) ⇒ Boolean, ruleId: RuleId)(rule: PartialFunction[(Expression, Expression), Expression]): Expression
    Definition Classes
    TreeNode
  92. def transformUpWithPruning(cond: (TreePatternBits) ⇒ Boolean, ruleId: RuleId)(rule: PartialFunction[Expression, Expression]): Expression
    Definition Classes
    TreeNode
  93. def transformWithPruning(cond: (TreePatternBits) ⇒ Boolean, ruleId: RuleId)(rule: PartialFunction[Expression, Expression]): Expression
    Definition Classes
    TreeNode
  94. lazy val treePatternBits: BitSet
    Definition Classes
    TreeNode → TreePatternBits
  95. def treeString(append: (String) ⇒ Unit, verbose: Boolean, addSuffix: Boolean, maxFields: Int, printOperatorId: Boolean): Unit
    Definition Classes
    TreeNode
  96. final def treeString(verbose: Boolean, addSuffix: Boolean, maxFields: Int, printOperatorId: Boolean): String
    Definition Classes
    TreeNode
  97. final def treeString: String
    Definition Classes
    TreeNode
  98. def unsetTagValue[T](tag: TreeNodeTag[T]): Unit
    Definition Classes
    TreeNode
  99. final def verboseString(maxFields: Int): String
    Definition Classes
    Expression → TreeNode
  100. def verboseStringWithSuffix(maxFields: Int): String
    Definition Classes
    TreeNode
  101. final def wait(arg0: Long, arg1: Int): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  102. final def wait(arg0: Long): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws( ... ) @native()
  103. final def wait(): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  104. final def withNewChildren(newChildren: Seq[Expression]): Expression
    Definition Classes
    TreeNode
  105. def withResource[T <: AutoCloseable, V](h: CloseableHolder[T])(block: (CloseableHolder[T]) ⇒ V): V

    Executes the provided code block and then closes the resource

    Executes the provided code block and then closes the resource

    Definition Classes
    Arm
  106. def withResource[T <: AutoCloseable, V](r: ArrayBuffer[T])(block: (ArrayBuffer[T]) ⇒ V): V

    Executes the provided code block and then closes the array buffer of resources

    Executes the provided code block and then closes the array buffer of resources

    Definition Classes
    Arm
  107. def withResource[T <: AutoCloseable, V](r: Array[T])(block: (Array[T]) ⇒ V): V

    Executes the provided code block and then closes the array of resources

    Executes the provided code block and then closes the array of resources

    Definition Classes
    Arm
  108. def withResource[T <: AutoCloseable, V](r: Seq[T])(block: (Seq[T]) ⇒ V): V

    Executes the provided code block and then closes the sequence of resources

    Executes the provided code block and then closes the sequence of resources

    Definition Classes
    Arm
  109. def withResource[T <: AutoCloseable, V](r: Option[T])(block: (Option[T]) ⇒ V): V

    Executes the provided code block and then closes the Option[resource]

    Executes the provided code block and then closes the Option[resource]

    Definition Classes
    Arm
  110. def withResource[T <: AutoCloseable, V](r: T)(block: (T) ⇒ V): V

    Executes the provided code block and then closes the resource

    Executes the provided code block and then closes the resource

    Definition Classes
    Arm
  111. def withResourceIfAllowed[T, V](r: T)(block: (T) ⇒ V): V

    Executes the provided code block and then closes the value if it is AutoCloseable

    Executes the provided code block and then closes the value if it is AutoCloseable

    Definition Classes
    Arm

Deprecated Value Members

  1. def finalize(): Unit
    Attributes
    protected[lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] ) @Deprecated
    Deprecated

Inherited from GpuUnevaluable

Inherited from GpuExpression

Inherited from Arm

Inherited from Expression

Inherited from TreeNode[Expression]

Inherited from TreePatternBits

Inherited from Product

Inherited from Equals

Inherited from AnyRef

Inherited from Any

Ungrouped