Web single pattern

Usecase

Architecture

The web single pattern is an architecture that packs all the artifacts for prediction model in a web server. Since the single server REST (or GRPC) interface, preprocess, and trained model in one place, you can create and deploy as a simple predictor.
If you want to deploy multiple replications, you need to deploy with a load balancer or proxy. In case you are using GRPC for the interface, you need to consider client side load balancing or layer-7 load balancer.
To build your model into the web server, you can apply either model-in-image pattern or model-load pattern.

Diagram

diagram

Pros

Cons

Needs consideration