Get Dedicated Deployment
Dedicated Inference
Get Dedicated Deployment
Get deployment details and all replicas.
Accepts either:
- User Bearer token — JWT is validated; user_id is taken from the token.
- INFERENCE_PROXY_SERVICE_TOKEN — service token; caller must supply X-User-Id header.
In both cases ownership is enforced: the deployment must belong to the resolved user_id.
If include_terminated is False (default), returns 404 for deployments whose status is stopped. Set include_terminated=true to retrieve any deployment regardless of status.
GET
Get Dedicated Deployment
Response
Successful Response
Response body for GET /api/v2/external/inference/get.

