Model State

A deployed model version can assume a number of different states. To check the status of a model version, select the Versions tab from the Model Overview page in the console and view the Status field.

The model version can be in one of the following states:

  • STARTING: Resources are being provisioned.
  • ACTIVE: The replica/replicas are servicing requests.
  • SCALING: The replica count is adjusting to traffic.
  • IDLE: No replicas are handling requests, triggering a scale down.
  • OFFLINE: No active replicas, a cold start will be initiated on the next request.
  • ERROR: The model version failed to deploy.

where ACTIVE and IDLE directly refer to the activity of the model version replicas.

INFO

If an IDLE model does not receive any requests for 30 minutes a scale-down will be triggered, and if the replica count falls to 0, the model will go OFFLINE.

The diagram below illustrates the full model lifecycle, showing the relationship and transitions between these states when a model version is deployed.

Model State Diagram
Model State Diagram
TIP

If the model state is ERROR, there is likely an error in the model preparation. Please refer to the Prepare Model section and carefully ensure you have correctly followed all of the required steps.

Last updated: 6/19/2024, 9:44:03 PM