Class PredictionServiceGrpc.PredictionServiceFutureStub (3.64.0)

public static final class PredictionServiceGrpc.PredictionServiceFutureStub extends AbstractFutureStub.PredictionServiceFutureStub>

A stub to allow clients to do ListenableFuture-style rpc calls to service PredictionService.

A service for online predictions and explanations.

Inheritance

java.lang.Object > io.grpc.stub.AbstractStub > io.grpc.stub.AbstractFutureStub > PredictionServiceGrpc.PredictionServiceFutureStub

Methods

build(Channel channel, CallOptions callOptions)

protected PredictionServiceGrpc.PredictionServiceFutureStub build(Channel channel, CallOptions callOptions)
Parameters
Name Description
channel io.grpc.Channel
callOptions io.grpc.CallOptions
Returns
Type Description
PredictionServiceGrpc.PredictionServiceFutureStub
Overrides
io.grpc.stub.AbstractStub.build(io.grpc.Channel,io.grpc.CallOptions)

countTokens(CountTokensRequest request)

public ListenableFuture countTokens(CountTokensRequest request)

Perform a token counting.

Parameter
Name Description
request CountTokensRequest
Returns
Type Description
com.google.common.util.concurrent.ListenableFuture<CountTokensResponse>

directPredict(DirectPredictRequest request)

public ListenableFuture directPredict(DirectPredictRequest request)

Perform an unary online prediction request to a gRPC model server for Vertex first-party products and frameworks.

Parameter
Name Description
request DirectPredictRequest
Returns
Type Description
com.google.common.util.concurrent.ListenableFuture<DirectPredictResponse>

directRawPredict(DirectRawPredictRequest request)

public ListenableFuture directRawPredict(DirectRawPredictRequest request)

Perform an unary online prediction request to a gRPC model server for custom containers.

Parameter
Name Description
request DirectRawPredictRequest
Returns
Type Description
com.google.common.util.concurrent.ListenableFuture<DirectRawPredictResponse>

explain(ExplainRequest request)

public ListenableFuture explain(ExplainRequest request)

Perform an online explanation. If deployed_model_id is specified, the corresponding DeployModel must have explanation_spec populated. If deployed_model_id is not specified, all DeployedModels must have explanation_spec populated.

Parameter
Name Description
request ExplainRequest
Returns
Type Description
com.google.common.util.concurrent.ListenableFuture<ExplainResponse>

generateContent(GenerateContentRequest request)

public ListenableFuture generateContent(GenerateContentRequest request)

Generate content with multimodal inputs.

Parameter
Name Description
request GenerateContentRequest
Returns
Type Description
com.google.common.util.concurrent.ListenableFuture<GenerateContentResponse>

predict(PredictRequest request)

public ListenableFuture predict(PredictRequest request)

Perform an online prediction.

Parameter
Name Description
request PredictRequest
Returns
Type Description
com.google.common.util.concurrent.ListenableFuture<PredictResponse>

rawPredict(RawPredictRequest request)

public ListenableFuture rawPredict(RawPredictRequest request)

Perform an online prediction with an arbitrary HTTP payload. The response includes the following HTTP headers:

  • X-Vertex-AI-Endpoint-Id: ID of the Endpoint that served this prediction.
  • X-Vertex-AI-Deployed-Model-Id: ID of the Endpoint's DeployedModel that served this prediction.
Parameter
Name Description
request RawPredictRequest
Returns
Type Description
com.google.common.util.concurrent.ListenableFuture<com.google.api.HttpBody>