Class AbstractAsyncStateStreamOperator<OUT>
- All Implemented Interfaces:
Serializable,org.apache.flink.api.common.state.CheckpointListener,KeyContext,KeyContextHandler,StreamOperator<OUT>,StreamOperatorStateHandler.CheckpointedStreamOperator,YieldingOperator<OUT>,AsyncStateProcessing,AsyncStateProcessingOperator
- Direct Known Subclasses:
AbstractAsyncStateUdfStreamOperator
AbstractStreamOperator the ability to
perform AsyncStateProcessing. The aim is to make any subclass of AbstractStreamOperator could manipulate async state with only a change of base class.- See Also:
-
Field Summary
FieldsFields inherited from class org.apache.flink.streaming.api.operators.AbstractStreamOperator
combinedWatermark, config, lastRecordAttributes1, lastRecordAttributes2, latencyStats, metrics, output, processingTimeService, stateHandler, stateKeySelector1, stateKeySelector2, timeServiceManager -
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescription<K> voidasyncProcessWithKey(K key, org.apache.flink.util.function.ThrowingRunnable<Exception> processing) Asynchronously process a code with a key provided.final voidInitialize necessary state components forAbstractStreamOperator.voidclose()This method is called at the very end of the operator's life, both in the case of a successful completion of the operation, and in the case of a failure and canceling.voidvoidfinish()This method is called at the end of data processing.<K> AsyncKeyedStateBackend<K>final DeclarationManagerGet the declaration manager for user-logic declaring.Get theElementOrderof this operator.<K,N> InternalTimerService<N> getInternalTimerService(String name, org.apache.flink.api.common.typeutils.TypeSerializer<N> namespaceSerializer, Triggerable<K, N> triggerable) Returns aInternalTimerServicethat can be used to query current processing time and event time and to set timers.<N,S extends org.apache.flink.api.common.state.v2.State, T>
SgetOrCreateKeyedState(N defaultNamespace, org.apache.flink.api.common.typeutils.TypeSerializer<N> namespaceSerializer, org.apache.flink.api.common.state.v2.StateDescriptor<T> stateDescriptor) Create new state (v2) based on new state descriptor.final <T> org.apache.flink.util.function.ThrowingConsumer<StreamRecord<T>,Exception> getRecordProcessor(int inputId) Get the record processor that could process record from input, which is the only entry for async processing.booleanIndicates whether this operator is enabling the async state.voidnewKeySelected(Object newKey) A hook that will be invoked after a new key is selected.final voidA callback that will be triggered after an element finishesprocessElement.voidpostProcessWatermark(Watermark watermark) A hook that will be invoked after finishing advancing the watermark.voidprepareSnapshotPreBarrier(long checkpointId) This method is called when the operator should do a snapshot, before it emits its own checkpoint barrier.preProcessWatermark(Watermark watermark) A hook that will be triggered when receiving a watermark.final voidpreserveRecordOrderAndProcess(org.apache.flink.util.function.ThrowingRunnable<Exception> processing) Check the order of same-key record, and then process the record.voidprocessRecordAttributes(RecordAttributes recordAttributes) voidprocessRecordAttributes1(RecordAttributes recordAttributes) voidprocessRecordAttributes2(RecordAttributes recordAttributes) voidprocessWatermark(WatermarkEvent watermark) final voidprocessWatermark(Watermark mark) Process a watermark when receiving it.voidprocessWatermark1(WatermarkEvent watermark) voidprocessWatermark1Internal(WatermarkEvent watermark) voidprocessWatermark2(WatermarkEvent watermark) voidprocessWatermark2Internal(WatermarkEvent watermark) voidprocessWatermarkInternal(WatermarkEvent watermark) voidprocessWatermarkStatus(WatermarkStatus watermarkStatus) protected voidprocessWatermarkStatus(WatermarkStatus watermarkStatus, int index) protected voidfinal <T> voidsetAsyncKeyedContextElement(StreamRecord<T> record, org.apache.flink.api.java.functions.KeySelector<T, ?> keySelector) Set key context for async state processing.voidsetKeyContextElement1(StreamRecord record) voidsetKeyContextElement2(StreamRecord record) Methods inherited from class org.apache.flink.streaming.api.operators.AbstractStreamOperator
getContainingTask, getExecutionConfig, getKeyedStateBackend, getKeyedStateStore, getMetricGroup, getOperatorConfig, getOperatorID, getOperatorName, getOperatorStateBackend, getOrCreateKeyedState, getPartitionedState, getPartitionedState, getProcessingTimeService, getRuntimeContext, getStateKeySelector1, getStateKeySelector2, getTimeServiceManager, getUserCodeClassloader, hasKeyContext1, hasKeyContext2, initializeState, initializeState, isUsingCustomRawKeyedState, notifyCheckpointAborted, notifyCheckpointComplete, open, processLatencyMarker, processLatencyMarker1, processLatencyMarker2, processWatermark1, processWatermark2, processWatermarkStatus1, processWatermarkStatus2, setCurrentKey, setMailboxExecutor, setProcessingTimeService, setup, snapshotState, snapshotState, useSplittableTimersMethods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitMethods inherited from interface org.apache.flink.streaming.api.operators.KeyContextHandler
hasKeyContextMethods inherited from interface org.apache.flink.streaming.api.operators.StreamOperator
getOperatorAttributes
-
Field Details
-
declarationManager
-
-
Constructor Details
-
AbstractAsyncStateStreamOperator
public AbstractAsyncStateStreamOperator()
-
-
Method Details
-
beforeInitializeStateHandler
public final void beforeInitializeStateHandler()Initialize necessary state components forAbstractStreamOperator.- Overrides:
beforeInitializeStateHandlerin classAbstractStreamOperator<OUT>
-
isAsyncStateProcessingEnabled
public boolean isAsyncStateProcessingEnabled()Description copied from class:AbstractStreamOperatorIndicates whether this operator is enabling the async state. Can be overridden by subclasses.- Specified by:
isAsyncStateProcessingEnabledin interfaceAsyncStateProcessing- Overrides:
isAsyncStateProcessingEnabledin classAbstractStreamOperator<OUT>- Returns:
- ture if async state processing is enabled.
-
getElementOrder
Description copied from interface:AsyncStateProcessingOperatorGet theElementOrderof this operator.- Specified by:
getElementOrderin interfaceAsyncStateProcessingOperator
-
setAsyncKeyedContextElement
public final <T> void setAsyncKeyedContextElement(StreamRecord<T> record, org.apache.flink.api.java.functions.KeySelector<T, ?> keySelector) throws ExceptionDescription copied from interface:AsyncStateProcessingOperatorSet key context for async state processing.- Specified by:
setAsyncKeyedContextElementin interfaceAsyncStateProcessingOperator- Type Parameters:
T- the type of the record.- Parameters:
record- the record.keySelector- the key selector to select a key from record.- Throws:
Exception
-
newKeySelected
A hook that will be invoked after a new key is selected. It is not recommended to perform async state here. Only some synchronous logic is suggested.- Parameters:
newKey- the new key selected.
-
postProcessElement
public final void postProcessElement()Description copied from interface:AsyncStateProcessingOperatorA callback that will be triggered after an element finishesprocessElement.- Specified by:
postProcessElementin interfaceAsyncStateProcessingOperator
-
preserveRecordOrderAndProcess
public final void preserveRecordOrderAndProcess(org.apache.flink.util.function.ThrowingRunnable<Exception> processing) Description copied from interface:AsyncStateProcessingOperatorCheck the order of same-key record, and then process the record. Mainly used when theAsyncStateProcessingOperator.getElementOrder()returnsElementOrder.RECORD_ORDER.- Specified by:
preserveRecordOrderAndProcessin interfaceAsyncStateProcessingOperator- Parameters:
processing- the record processing logic.
-
asyncProcessWithKey
public <K> void asyncProcessWithKey(K key, org.apache.flink.util.function.ThrowingRunnable<Exception> processing) Description copied from interface:AsyncStateProcessingOperatorAsynchronously process a code with a key provided.- Specified by:
asyncProcessWithKeyin interfaceAsyncStateProcessingOperator- Type Parameters:
K- the type of key.- Parameters:
key- the specified key.processing- the process logic.
-
getDeclarationManager
Description copied from interface:AsyncStateProcessingOperatorGet the declaration manager for user-logic declaring.- Specified by:
getDeclarationManagerin interfaceAsyncStateProcessingOperator
-
getRecordProcessor
public final <T> org.apache.flink.util.function.ThrowingConsumer<StreamRecord<T>,Exception> getRecordProcessor(int inputId) Description copied from interface:AsyncStateProcessingGet the record processor that could process record from input, which is the only entry for async processing.- Specified by:
getRecordProcessorin interfaceAsyncStateProcessing- Parameters:
inputId- the input identifier, start from 1. Borrow the design fromorg.apache.flink.streaming.api.operators.AbstractInput#inputId. This is only relevant if there is multiple inputs for the instance.
-
getOrCreateKeyedState
public <N,S extends org.apache.flink.api.common.state.v2.State, S getOrCreateKeyedStateT> (@Nonnull N defaultNamespace, @Nonnull org.apache.flink.api.common.typeutils.TypeSerializer<N> namespaceSerializer, @Nonnull org.apache.flink.api.common.state.v2.StateDescriptor<T> stateDescriptor) throws Exception Create new state (v2) based on new state descriptor.- Throws:
Exception
-
prepareSnapshotPreBarrier
Description copied from interface:StreamOperatorThis method is called when the operator should do a snapshot, before it emits its own checkpoint barrier.This method is intended not for any actual state persistence, but only for emitting some data before emitting the checkpoint barrier. Operators that maintain some small transient state that is inefficient to checkpoint (especially when it would need to be checkpointed in a re-scalable way) but can simply be sent downstream before the checkpoint. An example are opportunistic pre-aggregation operators, which have small the pre-aggregation state that is frequently flushed downstream.
Important: This method should not be used for any actual state snapshot logic, because it will inherently be within the synchronous part of the operator's checkpoint. If heavy work is done within this method, it will affect latency and downstream checkpoint alignments.
- Specified by:
prepareSnapshotPreBarrierin interfaceStreamOperator<OUT>- Overrides:
prepareSnapshotPreBarrierin classAbstractStreamOperator<OUT>- Parameters:
checkpointId- The ID of the checkpoint.- Throws:
Exception- Throwing an exception here causes the operator to fail and go into recovery.
-
getInternalTimerService
public <K,N> InternalTimerService<N> getInternalTimerService(String name, org.apache.flink.api.common.typeutils.TypeSerializer<N> namespaceSerializer, Triggerable<K, N> triggerable) Returns aInternalTimerServicethat can be used to query current processing time and event time and to set timers. An operator can have several timer services, where each has its own namespace serializer. Timer services are differentiated by the string key that is given when requesting them, if you call this method with the same key multiple times you will get the same timer service instance in subsequent requests.Timers are always scoped to a key, the currently active key of a keyed stream operation. When a timer fires, this key will also be set as the currently active key.
Each timer has attached metadata, the namespace. Different timer services can have a different namespace type. If you don't need namespace differentiation you can use
VoidNamespaceSerializeras the namespace serializer.- Overrides:
getInternalTimerServicein classAbstractStreamOperator<OUT>- Type Parameters:
N- The type of the timer namespace.- Parameters:
name- The name of the requested timer service. If no service exists under the given name a new one will be created and returned.namespaceSerializer-TypeSerializerfor the timer namespace.triggerable- TheTriggerablethat should be invoked when timers fire
-
setKeyContextElement1
- Specified by:
setKeyContextElement1in interfaceStreamOperator<OUT>- Overrides:
setKeyContextElement1in classAbstractStreamOperator<OUT>- Throws:
Exception
-
setKeyContextElement2
- Specified by:
setKeyContextElement2in interfaceStreamOperator<OUT>- Overrides:
setKeyContextElement2in classAbstractStreamOperator<OUT>- Throws:
Exception
-
getCurrentKey
- Specified by:
getCurrentKeyin interfaceKeyContext- Overrides:
getCurrentKeyin classAbstractStreamOperator<OUT>
-
reportOrForwardLatencyMarker
- Overrides:
reportOrForwardLatencyMarkerin classAbstractStreamOperator<OUT>
-
preProcessWatermark
A hook that will be triggered when receiving a watermark. Some async state can safely go within this method. Return the watermark that should be normally processed.- Parameters:
watermark- the receiving watermark.- Returns:
- the watermark that should be processed. Null if there is no need for following processing.
- Throws:
Exception
-
postProcessWatermark
A hook that will be invoked after finishing advancing the watermark. It is not recommended to perform async state here. Only some synchronous logic is suggested.- Parameters:
watermark- the advanced watermark.- Throws:
Exception
-
processWatermark
Process a watermark when receiving it. Do not override this method since the async processing is difficult to write. Please override the hooks, seepreProcessWatermark(Watermark)andpostProcessWatermark(Watermark). The basic logic of processWatermark with hooks in sync form would be:Watermark watermark = preProcessWatermark(mark); if (watermark != null) { super.processWatermark(watermark); postProcessWatermark(watermark); }- Overrides:
processWatermarkin classAbstractStreamOperator<OUT>- Throws:
Exception
-
processWatermarkStatus
- Overrides:
processWatermarkStatusin classAbstractStreamOperator<OUT>- Throws:
Exception
-
processWatermarkStatus
- Overrides:
processWatermarkStatusin classAbstractStreamOperator<OUT>- Throws:
Exception
-
processRecordAttributes
@Experimental public void processRecordAttributes(RecordAttributes recordAttributes) throws Exception - Overrides:
processRecordAttributesin classAbstractStreamOperator<OUT>- Throws:
Exception
-
processRecordAttributes1
- Overrides:
processRecordAttributes1in classAbstractStreamOperator<OUT>
-
processRecordAttributes2
- Overrides:
processRecordAttributes2in classAbstractStreamOperator<OUT>
-
processWatermarkInternal
- Throws:
Exception
-
processWatermark1Internal
- Throws:
Exception
-
processWatermark2Internal
- Throws:
Exception
-
processWatermark
- Overrides:
processWatermarkin classAbstractStreamOperator<OUT>- Throws:
Exception
-
processWatermark1
- Overrides:
processWatermark1in classAbstractStreamOperator<OUT>- Throws:
Exception
-
processWatermark2
- Overrides:
processWatermark2in classAbstractStreamOperator<OUT>- Throws:
Exception
-
getAsyncKeyedStateBackend
-
drainStateRequests
public void drainStateRequests() -
finish
Description copied from interface:StreamOperatorThis method is called at the end of data processing.The method is expected to flush all remaining buffered data. Exceptions during this flushing of buffered data should be propagated, in order to cause the operation to be recognized as failed, because the last data items are not processed properly.
After this method is called, no more records can be produced for the downstream operators.
WARNING: It is not safe to use this method to commit any transactions or other side effects! You can use this method to flush any buffered data that can later on be committed e.g. in a
CheckpointListener.notifyCheckpointComplete(long).NOTE:This method does not need to close any resources. You should release external resources in the
StreamOperator.close()method.- Specified by:
finishin interfaceStreamOperator<OUT>- Overrides:
finishin classAbstractStreamOperator<OUT>- Throws:
Exception- An exception in this method causes the operator to fail.
-
close
Description copied from interface:StreamOperatorThis method is called at the very end of the operator's life, both in the case of a successful completion of the operation, and in the case of a failure and canceling.This method is expected to make a thorough effort to release all resources that the operator has acquired.
NOTE:It can not emit any records! If you need to emit records at the end of processing, do so in the
StreamOperator.finish()method.- Specified by:
closein interfaceStreamOperator<OUT>- Overrides:
closein classAbstractStreamOperator<OUT>- Throws:
Exception
-