Class ContinuousFileMonitoringFunction<OUT>
- All Implemented Interfaces:
Serializable,org.apache.flink.api.common.functions.Function,org.apache.flink.api.common.functions.RichFunction,CheckpointedFunction,SourceFunction<TimestampedFileInputSplit>
FileInputFormat and,
depending on the FileProcessingMode and the FilePathFilter, it is responsible
for:
- Monitoring a user-provided path.
- Deciding which files should be further read and processed.
- Creating the
splitscorresponding to those files. - Assigning them to downstream tasks for further processing.
The splits to be read are forwarded to the downstream ContinuousFileReaderOperator
which can have parallelism greater than one.
IMPORTANT NOTE: Splits are forwarded downstream for reading in ascending modification time order, based on the modification time of the files they belong to.
- See Also:
-
Nested Class Summary
Nested classes/interfaces inherited from interface org.apache.flink.streaming.api.functions.source.legacy.SourceFunction
SourceFunction.SourceContext<T> -
Field Summary
FieldsModifier and TypeFieldDescriptionstatic final longDeprecated.The minimum interval allowed between consecutive path scans. -
Constructor Summary
ConstructorsConstructorDescriptionContinuousFileMonitoringFunction(org.apache.flink.api.common.io.FileInputFormat<OUT> format, FileProcessingMode watchType, int readerParallelism, long interval) Deprecated. -
Method Summary
Modifier and TypeMethodDescriptionvoidcancel()Deprecated.Cancels the source.voidclose()Deprecated.longDeprecated.voidDeprecated.This method is called when the parallel function instance is created during distributed execution.voidopen(org.apache.flink.api.common.functions.OpenContext openContext) Deprecated.voidDeprecated.Starts the source.voidsnapshotState(FunctionSnapshotContext context) Deprecated.This method is called when a snapshot for a checkpoint is requested.Methods inherited from class org.apache.flink.api.common.functions.AbstractRichFunction
getIterationRuntimeContext, getRuntimeContext, setRuntimeContext
-
Field Details
-
MIN_MONITORING_INTERVAL
public static final long MIN_MONITORING_INTERVALDeprecated.The minimum interval allowed between consecutive path scans.NOTE: Only applicable to the
PROCESS_CONTINUOUSLYmode.- See Also:
-
-
Constructor Details
-
ContinuousFileMonitoringFunction
public ContinuousFileMonitoringFunction(org.apache.flink.api.common.io.FileInputFormat<OUT> format, FileProcessingMode watchType, int readerParallelism, long interval) Deprecated.
-
-
Method Details
-
getGlobalModificationTime
@VisibleForTesting public long getGlobalModificationTime()Deprecated. -
initializeState
Deprecated.Description copied from interface:CheckpointedFunctionThis method is called when the parallel function instance is created during distributed execution. Functions typically set up their state storing data structures in this method.- Specified by:
initializeStatein interfaceCheckpointedFunction- Parameters:
context- the context for initializing the operator- Throws:
Exception- Thrown, if state could not be created ot restored.
-
open
Deprecated.- Specified by:
openin interfaceorg.apache.flink.api.common.functions.RichFunction- Overrides:
openin classorg.apache.flink.api.common.functions.AbstractRichFunction- Throws:
Exception
-
run
Deprecated.Description copied from interface:SourceFunctionStarts the source. Implementations use theSourceFunction.SourceContextto emit elements. Sources that checkpoint their state for fault tolerance should use thecheckpoint lockto ensure consistency between the bookkeeping and emitting the elements.Sources that implement
CheckpointedFunctionmust lock on thecheckpoint lockcheckpoint lock (using a synchronized block) before updating internal state and emitting elements, to make both an atomic operation.Refer to the
top-level class docsfor an example.- Specified by:
runin interfaceSourceFunction<OUT>- Parameters:
context- The context to emit elements to and for accessing locks.- Throws:
Exception
-
close
Deprecated.- Specified by:
closein interfaceorg.apache.flink.api.common.functions.RichFunction- Overrides:
closein classorg.apache.flink.api.common.functions.AbstractRichFunction- Throws:
Exception
-
cancel
public void cancel()Deprecated.Description copied from interface:SourceFunctionCancels the source. Most sources will have a while loop inside theSourceFunction.run(SourceContext)method. The implementation needs to ensure that the source will break out of that loop after this method is called.A typical pattern is to have an
"volatile boolean isRunning"flag that is set tofalsein this method. That flag is checked in the loop condition.In case of an ungraceful shutdown (cancellation of the source operator, possibly for failover), the thread that calls
SourceFunction.run(SourceContext)will also beinterrupted) by the Flink runtime, in order to speed up the cancellation (to ensure threads exit blocking methods fast, like I/O, blocking queues, etc.). The interruption happens strictly after this method has been called, so any interruption handler can rely on the fact that this method has completed (for example to ignore exceptions that happen after cancellation).During graceful shutdown (for example stopping a job with a savepoint), the program must cleanly exit the
SourceFunction.run(SourceContext)method soon after this method was called. The Flink runtime will NOT interrupt the source thread during graceful shutdown. Source implementors must ensure that no thread interruption happens on any thread that emits records through theSourceContextfrom theSourceFunction.run(SourceContext)method; otherwise the clean shutdown may fail when threads are interrupted while processing the final records.Because the
SourceFunctioncannot easily differentiate whether the shutdown should be graceful or ungraceful, we recommend that implementors refrain from interrupting any threads that interact with theSourceContextat all. You can rely on the Flink runtime to interrupt the source thread in case of ungraceful cancellation. Any additionally spawned threads that directly emit records through theSourceContextshould use a shutdown method that does not rely on thread interruption.- Specified by:
cancelin interfaceSourceFunction<OUT>
-
snapshotState
Deprecated.Description copied from interface:CheckpointedFunctionThis method is called when a snapshot for a checkpoint is requested. This acts as a hook to the function to ensure that all state is exposed by means previously offered throughFunctionInitializationContextwhen the Function was initialized, or offered now byFunctionSnapshotContextitself.- Specified by:
snapshotStatein interfaceCheckpointedFunction- Parameters:
context- the context for drawing a snapshot of the operator- Throws:
Exception- Thrown, if state could not be created ot restored.
-
SourceFunctionAPI, which is due to be removed. Use the newSourceAPI instead.