Packages

class CatalogFileIndex extends FileIndex

A FileIndex for a metastore catalog table.

Linear Supertypes
FileIndex, AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. CatalogFileIndex
  2. FileIndex
  3. AnyRef
  4. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Instance Constructors

  1. new CatalogFileIndex(sparkSession: SparkSession, table: CatalogTable, sizeInBytes: Long)

    sparkSession

    a SparkSession

    table

    the metadata of the table

    sizeInBytes

    the table's data size in bytes

Value Members

  1. final def !=(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  4. final def asInstanceOf[T0]: T0
    Definition Classes
    Any
  5. def clone(): AnyRef
    Attributes
    protected[lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... ) @native() @HotSpotIntrinsicCandidate()
  6. final def eq(arg0: AnyRef): Boolean
    Definition Classes
    AnyRef
  7. def equals(o: Any): Boolean
    Definition Classes
    CatalogFileIndex → AnyRef → Any
  8. def filterPartitions(filters: Seq[Expression]): InMemoryFileIndex

    Returns a InMemoryFileIndex for this table restricted to the subset of partitions specified by the given partition-pruning filters.

    Returns a InMemoryFileIndex for this table restricted to the subset of partitions specified by the given partition-pruning filters.

    filters

    partition-pruning filters

  9. final def getClass(): Class[_]
    Definition Classes
    AnyRef → Any
    Annotations
    @native() @HotSpotIntrinsicCandidate()
  10. val hadoopConf: Configuration
    Attributes
    protected
  11. def hashCode(): Int
    Definition Classes
    CatalogFileIndex → AnyRef → Any
  12. def inputFiles: Array[String]

    Returns the list of files that will be read when scanning this relation.

    Returns the list of files that will be read when scanning this relation. This call may be very expensive for large tables.

    Definition Classes
    CatalogFileIndexFileIndex
  13. final def isInstanceOf[T0]: Boolean
    Definition Classes
    Any
  14. def listFiles(partitionFilters: Seq[Expression], dataFilters: Seq[Expression]): Seq[PartitionDirectory]

    Returns all valid files grouped into partitions when the data is partitioned.

    Returns all valid files grouped into partitions when the data is partitioned. If the data is unpartitioned, this will return a single partition with no partition values.

    partitionFilters

    The filters used to prune which partitions are returned. These filters must only refer to partition columns and this method will only return files where these predicates are guaranteed to evaluate to true. Thus, these filters will not need to be evaluated again on the returned data.

    dataFilters

    Filters that can be applied on non-partitioned columns. The implementation does not need to guarantee these filters are applied, i.e. the execution engine will ensure these filters are still applied on the returned files.

    Definition Classes
    CatalogFileIndexFileIndex
  15. def metadataOpsTimeNs: Option[Long]

    Returns an optional metadata operation time, in nanoseconds, for listing files.

    Returns an optional metadata operation time, in nanoseconds, for listing files.

    We do file listing in query optimization (in order to get the proper statistics) and we want to account for file listing time in physical execution (as metrics). To do that, we save the file listing time in some implementations and physical execution calls it in this method to update the metrics.

    Definition Classes
    FileIndex
  16. final def ne(arg0: AnyRef): Boolean
    Definition Classes
    AnyRef
  17. final def notify(): Unit
    Definition Classes
    AnyRef
    Annotations
    @native() @HotSpotIntrinsicCandidate()
  18. final def notifyAll(): Unit
    Definition Classes
    AnyRef
    Annotations
    @native() @HotSpotIntrinsicCandidate()
  19. def partitionSchema: StructType

    Schema of the partitioning columns, or the empty schema if the table is not partitioned.

    Schema of the partitioning columns, or the empty schema if the table is not partitioned.

    Definition Classes
    CatalogFileIndexFileIndex
  20. def refresh(): Unit

    Refresh any cached file listings

    Refresh any cached file listings

    Definition Classes
    CatalogFileIndexFileIndex
  21. def rootPaths: Seq[Path]

    Returns the list of root input paths from which the catalog will get files.

    Returns the list of root input paths from which the catalog will get files. There may be a single root path from which partitions are discovered, or individual partitions may be specified by each path.

    Definition Classes
    CatalogFileIndexFileIndex
  22. val sizeInBytes: Long

    Sum of table file sizes, in bytes

    Sum of table file sizes, in bytes

    Definition Classes
    CatalogFileIndexFileIndex
  23. final def synchronized[T0](arg0: ⇒ T0): T0
    Definition Classes
    AnyRef
  24. val table: CatalogTable
  25. def toString(): String
    Definition Classes
    AnyRef → Any
  26. final def wait(arg0: Long, arg1: Int): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  27. final def wait(arg0: Long): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws( ... ) @native()
  28. final def wait(): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )

Deprecated Value Members

  1. def finalize(): Unit
    Attributes
    protected[lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] ) @Deprecated
    Deprecated

Inherited from FileIndex

Inherited from AnyRef

Inherited from Any

Ungrouped