Class AsyncSinkWriter<InputT,RequestEntryT extends Serializable>
- All Implemented Interfaces:
AutoCloseable,org.apache.flink.api.connector.sink2.SinkWriter<InputT>,org.apache.flink.api.connector.sink2.StatefulSinkWriter<InputT,BufferedRequestState<RequestEntryT>>
At least once semantics is supported through prepareCommit as outstanding requests are
flushed or completed prior to checkpointing.
Designed to be returned at createWriter time by an AsyncSinkBase.
There are configuration options to customize the buffer size etc.
-
Nested Class Summary
Nested classes/interfaces inherited from interface org.apache.flink.api.connector.sink2.SinkWriter
org.apache.flink.api.connector.sink2.SinkWriter.Context -
Constructor Summary
ConstructorsConstructorDescriptionAsyncSinkWriter(ElementConverter<InputT, RequestEntryT> elementConverter, org.apache.flink.api.connector.sink2.WriterInitContext context, AsyncSinkWriterConfiguration configuration, Collection<BufferedRequestState<RequestEntryT>> states) -
Method Summary
Modifier and TypeMethodDescriptionvoidclose()voidflush(boolean flush) In flight requests will be retried if the sink is still healthy.protected abstract longgetSizeInBytes(RequestEntryT requestEntry) This method allows the getting of the size of aRequestEntryTin bytes.snapshotState(long checkpointId) All in-flight requests that are relevant for the snapshot have been completed, but there may still be request entries in the internal buffers that are yet to be sent to the endpoint.protected voidsubmitRequestEntries(List<RequestEntryT> requestEntries, ResultHandler<RequestEntryT> resultHandler) This method specifies how to persist buffered request entries into the destination.voidMethods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitMethods inherited from interface org.apache.flink.api.connector.sink2.SinkWriter
writeWatermark
-
Constructor Details
-
AsyncSinkWriter
public AsyncSinkWriter(ElementConverter<InputT, RequestEntryT> elementConverter, org.apache.flink.api.connector.sink2.WriterInitContext context, AsyncSinkWriterConfiguration configuration, Collection<BufferedRequestState<RequestEntryT>> states)
-
-
Method Details
-
submitRequestEntries
protected void submitRequestEntries(List<RequestEntryT> requestEntries, ResultHandler<RequestEntryT> resultHandler) This method specifies how to persist buffered request entries into the destination. It is implemented when support for a new destination is added.The method is invoked with a set of request entries according to the buffering hints (and the valid limits of the destination). The logic then needs to create and execute the request asynchronously against the destination (ideally by batching together multiple request entries to increase efficiency). The logic also needs to identify individual request entries that were not persisted successfully and resubmit them using the
requestToRetrycallback.From a threading perspective, the mailbox thread will call this method and initiate the asynchronous request to persist the
requestEntries. NOTE: The client must support asynchronous requests and the method called to persist the records must asynchronously execute and return a future with the results of that request. A thread from the destination client thread pool should complete the request and trigger theresultHandlerto complete the processing of the request entries. TheresultHandleractions will run on the mailbox thread.An example implementation of this method is included:
@Override protected void submitRequestEntries (List<RequestEntryT> records, ResultHandler<RequestEntryT> resultHandler) { Future<Response> response = destinationClient.putRecords(records); response.whenComplete( (response, error) -> { if(error != null && isFatal(error)){ resultHandler.completeExceptionally(error); }else if(error != null){ List<RequestEntryT> retryableFailedRecords = getRetryableFailed(response); resultHandler.retryForEntries(retryableFailedRecords); }else{ resultHandler.complete(); } } ); }During checkpointing, the sink needs to ensure that there are no outstanding in-flight requests.
- Parameters:
requestEntries- a set of request entries that should be sent to the destinationresultHandler- thecompletemethod should be called on this ResultHandler once the processing of therequestEntriesare complete. Any entries that encountered difficulties in persisting should be re-queued throughretryForEntriesby including that element in the collection ofRequestEntryTs passed to theretryForEntriesmethod. All other elements are assumed to have been successfully persisted. In case of encountering fatal exceptions, thecompleteExceptionallymethod should be called.
-
getSizeInBytes
This method allows the getting of the size of aRequestEntryTin bytes. The size in this case is measured as the total bytes that is written to the destination as a result of persisting this particularRequestEntryTrather than the serialized length (which may be the same).- Parameters:
requestEntry- the requestEntry for which we want to know the size- Returns:
- the size of the requestEntry, as defined previously
-
write
public void write(InputT element, org.apache.flink.api.connector.sink2.SinkWriter.Context context) throws IOException, InterruptedException - Specified by:
writein interfaceorg.apache.flink.api.connector.sink2.SinkWriter<InputT>- Throws:
IOExceptionInterruptedException
-
flush
In flight requests will be retried if the sink is still healthy. But if in-flight requests fail after a checkpoint has been triggered and Flink needs to recover from the checkpoint, the (failed) in-flight requests are gone and cannot be retried. Hence, there cannot be any outstanding in-flight requests when a commit is initialized.To this end, all in-flight requests need to completed before proceeding with the commit.
- Specified by:
flushin interfaceorg.apache.flink.api.connector.sink2.SinkWriter<InputT>- Throws:
InterruptedException
-
snapshotState
All in-flight requests that are relevant for the snapshot have been completed, but there may still be request entries in the internal buffers that are yet to be sent to the endpoint. These request entries are stored in the snapshot state so that they don't get lost in case of a failure/restart of the application.- Specified by:
snapshotStatein interfaceorg.apache.flink.api.connector.sink2.StatefulSinkWriter<InputT,RequestEntryT extends Serializable>
-
close
public void close()- Specified by:
closein interfaceAutoCloseable
-
getFatalExceptionCons
-