GetCustomModelDeployment
Retrieves information about a custom model deployment, including its status, configuration, and metadata. Use this operation to monitor the deployment status and retrieve details needed for inference requests.
The following actions are related to the GetCustomModelDeployment
operation:
Request Syntax
GET /model-customization/custom-model-deployments/customModelDeploymentIdentifier
HTTP/1.1
URI Request Parameters
The request uses the following URI parameters.
- customModelDeploymentIdentifier
-
The Amazon Resource Name (ARN) or name of the custom model deployment to retrieve information about.
Length Constraints: Minimum length of 1. Maximum length of 93.
Pattern:
(arn:aws(|-us-gov|-cn|-iso|-iso-b):bedrock:[a-z0-9-]{1,20}:[0-9]{12}:custom-model-deployment/[a-z0-9]{12})|^([0-9a-zA-Z][_-]?){1,63}
Required: Yes
Request Body
The request does not have a request body.
Response Syntax
HTTP/1.1 200
Content-type: application/json
{
"createdAt": "string",
"customModelDeploymentArn": "string",
"description": "string",
"failureMessage": "string",
"lastUpdatedAt": "string",
"modelArn": "string",
"modelDeploymentName": "string",
"status": "string"
}
Response Elements
If the action is successful, the service sends back an HTTP 200 response.
The following data is returned in JSON format by the service.
- createdAt
-
The date and time when the custom model deployment was created.
Type: Timestamp
- customModelDeploymentArn
-
The Amazon Resource Name (ARN) of the custom model deployment.
Type: String
Length Constraints: Minimum length of 0. Maximum length of 1011.
Pattern:
arn:aws(-[^:]+)?:bedrock:[a-z0-9-]{1,20}:[0-9]{12}:custom-model-deployment/[a-z0-9]{12}
- description
-
The description of the custom model deployment.
Type: String
Length Constraints: Minimum length of 1. Maximum length of 2048.
Pattern:
.*
- failureMessage
-
If the deployment status is
FAILED
, this field contains a message describing the failure reason.Type: String
Length Constraints: Minimum length of 0. Maximum length of 2048.
- lastUpdatedAt
-
The date and time when the custom model deployment was last updated.
Type: Timestamp
- modelArn
-
The Amazon Resource Name (ARN) of the custom model associated with this deployment.
Type: String
Length Constraints: Minimum length of 20. Maximum length of 1011.
Pattern:
arn:aws(-[^:]+)?:bedrock:[a-z0-9-]{1,20}:[0-9]{12}:custom-model/(imported|[a-z0-9-]{1,63}[.]{1}[a-z0-9-]{1,63}([a-z0-9-]{1,63}[.]){0,2}[a-z0-9-]{1,63}([:][a-z0-9-]{1,63}){0,2})/[a-z0-9]{12}
- modelDeploymentName
-
The name of the custom model deployment.
Type: String
Length Constraints: Minimum length of 1. Maximum length of 63.
Pattern:
([0-9a-zA-Z][_-]?){1,63}
- status
-
The status of the custom model deployment. Possible values are:
-
CREATING
- The deployment is being set up and prepared for inference. -
ACTIVE
- The deployment is ready and available for inference requests. -
FAILED
- The deployment failed to be created or became unavailable.
Type: String
Valid Values:
Creating | Active | Failed
-
Errors
For information about the errors that are common to all actions, see Common Errors.
- AccessDeniedException
-
The request is denied because of missing access permissions.
HTTP Status Code: 403
- InternalServerException
-
An internal server error occurred. Retry your request.
HTTP Status Code: 500
- ResourceNotFoundException
-
The specified resource Amazon Resource Name (ARN) was not found. Check the Amazon Resource Name (ARN) and try your request again.
HTTP Status Code: 404
- ThrottlingException
-
The number of requests exceeds the limit. Resubmit your request later.
HTTP Status Code: 429
- ValidationException
-
Input validation failed. Check your request parameters and retry the request.
HTTP Status Code: 400
See Also
For more information about using this API in one of the language-specific AWS SDKs, see the following: