class SparkReadConf extends AnyRef
A class for common Iceberg configs for Spark reads. Derived from Apache Iceberg's SparkReadConf class.
If a config is set at multiple levels, the following order of precedence is used (top to bottom):
- Read options
- Session configuration
- Table metadata
The most specific value is set in read options and takes precedence over all other configs. If no read option is provided, this class checks the session configuration for any overrides. If no applicable value is found in the session configuration, this class uses the table metadata.
Note this class is NOT meant to be serialized and sent to executors.
- Alphabetic
- By Inheritance
- SparkReadConf
- AnyRef
- Any
- Hide All
- Show All
- Public
- All
Instance Constructors
- new SparkReadConf(spark: SparkSession, table: Table, readOptions: Map[String, String])
Value Members
-
final
def
!=(arg0: Any): Boolean
- Definition Classes
- AnyRef → Any
-
final
def
##(): Int
- Definition Classes
- AnyRef → Any
-
final
def
==(arg0: Any): Boolean
- Definition Classes
- AnyRef → Any
-
final
def
asInstanceOf[T0]: T0
- Definition Classes
- Any
- def asOfTimestamp(): Long
- def caseSensitive(): Boolean
-
def
clone(): AnyRef
- Attributes
- protected[lang]
- Definition Classes
- AnyRef
- Annotations
- @throws( ... ) @native() @HotSpotIntrinsicCandidate()
- def endSnapshotId(): Long
-
final
def
eq(arg0: AnyRef): Boolean
- Definition Classes
- AnyRef
-
def
equals(arg0: Any): Boolean
- Definition Classes
- AnyRef → Any
- def fileScanTaskSetId(): String
-
final
def
getClass(): Class[_]
- Definition Classes
- AnyRef → Any
- Annotations
- @native() @HotSpotIntrinsicCandidate()
-
def
handleTimestampWithoutZone(): Boolean
Enables reading a timestamp without time zone as a timestamp with time zone.
Enables reading a timestamp without time zone as a timestamp with time zone.
Generally, this is not safe as a timestamp without time zone is supposed to represent the wall-clock time, i.e. no matter the reader/writer timezone 3PM should always be read as 3PM, but a timestamp with time zone represents instant semantics, i.e. the timestamp is adjusted so that the corresponding time in the reader timezone is displayed.
When set to false (default), an exception must be thrown while reading a timestamp without time zone.
- returns
boolean indicating if reading timestamps without timezone is allowed
-
def
hashCode(): Int
- Definition Classes
- AnyRef → Any
- Annotations
- @native() @HotSpotIntrinsicCandidate()
-
final
def
isInstanceOf[T0]: Boolean
- Definition Classes
- Any
- def localityEnabled(): Boolean
-
final
def
ne(arg0: AnyRef): Boolean
- Definition Classes
- AnyRef
-
final
def
notify(): Unit
- Definition Classes
- AnyRef
- Annotations
- @native() @HotSpotIntrinsicCandidate()
-
final
def
notifyAll(): Unit
- Definition Classes
- AnyRef
- Annotations
- @native() @HotSpotIntrinsicCandidate()
- def orcBatchSize(): Int
- def orcVectorizationEnabled(): Boolean
- def parquetBatchSize(): Int
- def parquetVectorizationEnabled(): Boolean
- def snapshotId(): Long
- def splitLookback(): Int
- def splitLookbackOption(): Integer
- def splitOpenFileCost(): Long
- def splitOpenFileCostOption(): Long
- def splitSize(): Long
- def splitSizeOption(): Long
- def startSnapshotId(): Long
- def streamFromTimestamp(): Long
- def streamingSkipDeleteSnapshots(): Boolean
-
final
def
synchronized[T0](arg0: ⇒ T0): T0
- Definition Classes
- AnyRef
-
def
toString(): String
- Definition Classes
- AnyRef → Any
-
final
def
wait(arg0: Long, arg1: Int): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws( ... )
-
final
def
wait(arg0: Long): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws( ... ) @native()
-
final
def
wait(): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws( ... )