Interface VertexParallelismAndInputInfosDecider
- All Known Implementing Classes:
DefaultVertexParallelismAndInputInfosDecider
public interface VertexParallelismAndInputInfosDecider
VertexParallelismAndInputInfosDecider is responsible for deciding the parallelism and
JobVertexInputInfos of a job vertex, based on the information of the consumed blocking
results.-
Method Summary
Modifier and TypeMethodDescriptionintcomputeSourceParallelismUpperBound(JobVertexID jobVertexId, int maxParallelism) Compute source parallelism upper bound for the source vertex.decideParallelismAndInputInfosForVertex(JobVertexID jobVertexId, List<BlockingInputInfo> consumedResults, int vertexInitialParallelism, int vertexMinParallelism, int vertexMaxParallelism) Decide the parallelism andJobVertexInputInfos for this job vertex.longGet the average size of data volume to expect each task instance to process.
-
Method Details
-
decideParallelismAndInputInfosForVertex
ParallelismAndInputInfos decideParallelismAndInputInfosForVertex(JobVertexID jobVertexId, List<BlockingInputInfo> consumedResults, int vertexInitialParallelism, int vertexMinParallelism, int vertexMaxParallelism) Decide the parallelism andJobVertexInputInfos for this job vertex.- Parameters:
jobVertexId- The job vertex idconsumedResults- The information of consumed blocking resultsvertexInitialParallelism- The initial parallelism of the job vertex. If it's a positive number, it will be respected. If it's not set(equals toExecutionConfig.PARALLELISM_DEFAULT), a parallelism will be automatically decided for the vertex.vertexMinParallelism- The min parallelism of the job vertex.vertexMaxParallelism- The max parallelism of the job vertex.- Returns:
- the parallelism and vertex input infos.
-
computeSourceParallelismUpperBound
Compute source parallelism upper bound for the source vertex.- Parameters:
jobVertexId- The job vertex idmaxParallelism- The max parallelism of the job vertex.- Returns:
- the upper bound parallelism for the source vertex.
-
getDataVolumePerTask
long getDataVolumePerTask()Get the average size of data volume to expect each task instance to process.- Returns:
- the data volume.
-