Replicate allows users to run AI models via a cloud API without managing infrastructure.
Toolkit · replicate
v1
01Cancel Predictiontool
02Cancel Trainingtool
03Create Deploymenttool
04Create Filetool
API Key · verified
31T · 0TR
replicate
Supported tools and triggers
Each tool is a typed function your agent can call. Each trigger fires an agent run when something changes upstream.
Cancel Prediction
Tool to cancel a prediction that is still running. Use when you need to stop an in-progress prediction to free up resources or halt execution.
TOOL
Cancel Training
Tool to cancel an ongoing training operation in Replicate. Use when you need to stop a training job that is in progress.
TOOL
Create Deployment
Tool to create a new deployment with specified model, version, hardware, and scaling parameters. Use when you need to deploy a model for production use with auto-scaling.
TOOL
Create File
Tool to create or upload a file to Replicate. Use when you need to upload file content with optional metadata.
TOOL
Create Model
Tool to create a new Replicate model with specified owner, name, visibility, and hardware. Use when you need to create a destination model before launching LoRA/fine-tune training.
TOOL
Create Model Prediction
Tool to create a prediction using an official Replicate model. Use when you need to run inference with a specific model using its owner and name. Supports synchronous waiting (up to 60 seconds) and webhooks for async notifications.
TOOL
Create Prediction
Tool to create a prediction for a Replicate Deployment. IMPORTANT: This action ONLY works with Replicate Deployments (persistent instances you create and manage), NOT public models. Deployments are created via REPLICATE_DEPLOYMENTS_CREATE. To run public models (e.g., 'meta/llama-2-70b-chat', 'stability-ai/sdxl'), use REPLICATE_MODELS_PREDICTIONS_CREATE instead. Use 'wait_for' to wait until the prediction completes.
TOOL
Create Prediction
Tool to create a prediction to run a model by version ID. Use when you have a specific model version identifier and need to run inference with provided inputs. Supports synchronous waiting and webhook notifications.
TOOL
Create Training Job
Tool to create a training job for a specific model version. Use when you need to fine-tune a model with custom training data. Supports webhook notifications for training status updates.
TOOL
Delete Deployment
Tool to delete a deployment from your account. Use when you need to remove a deployment. Deployments must be offline and unused for at least 15 minutes before deletion.
TOOL
Delete File
Tool to delete a file by its ID. Use when you need to remove a file from storage. Returns 204 No Content on success.
TOOL
Get Account Information
Tool to get authenticated account information. Use when you need to retrieve details about the account associated with the API token.
TOOL
Get Deployment Details
Tool to get deployment details by owner and name. Use when you need information about a specific deployment including its release configuration and hardware settings.
TOOL
Get File Details
Tool to get details of a file by its ID. Use when you need to inspect uploaded file information before further operations. Returned URLs may be short-lived; download or persist needed files promptly after retrieval.
TOOL
Get model collection
Tool to get a specific collection of models by its slug. Use when you need detailed information about a collection and its models.
TOOL
Get Model Details
Tool to get details of a specific model by owner and name. Consult the returned input schema before constructing any prediction request — each model defines its own required/optional fields (e.g., `prompt`, `aspect_ratio`, `version`); missing or unknown keys cause validation errors. Model schemas and available versions may change over time; recheck before production use.
TOOL
Get Model README
Tool to get the README content for a model in Markdown format. Consult alongside REPLICATE_MODELS_EXAMPLES_LIST before calling REPLICATE_CREATE_PREDICTION — Replicate enforces strict JSON schemas on model inputs and returns 422 errors for incorrect keys or types. Use after retrieving model details when you want to view its documentation.
TOOL
Get Model Version
Tool to get a specific version of a model. Use when you need details about a particular model version including its schema and metadata.
TOOL
Get Prediction
Tool to get the status and output of a prediction by its ID. Use when you need to check on a running prediction or retrieve the results of a completed prediction.
TOOL
Get Webhook Signing Secret
Tool to get the signing secret for the default webhook. Use when you need to retrieve the secret key used to verify webhook authenticity.
TOOL
List All Predictions
Tool to list all predictions for the authenticated user or organization with pagination. Use when you need to retrieve prediction history or filter predictions by creation date.
TOOL
List Available Hardware
Tool to list available hardware SKUs for models and deployments. Use when you need to see what hardware options are available on the Replicate platform.
TOOL
List deployments
Tool to list all deployments associated with the account. Use when you need to retrieve deployment configurations and their latest releases.
TOOL
List Files
Tool to retrieve a paginated list of uploaded files. Use to view all files created by the authenticated user or organization. Files are sorted with most recent first. Pagination is cursor-based: follow the next cursor until empty to retrieve all files. Limit requests to 1–2/second to avoid 429 Too Many Requests errors. Use to validate current file_ids before passing to prediction tools, as stale file_ids cause runtime errors.
TOOL
List model collections
Tool to list all collections of models. Use when you need to retrieve available model collections. Collections are curated groupings of related models. Response includes only collection metadata (name, slug, description), not individual models within each collection; use REPLICATE_MODELS_GET for per-model details. Response may include a non-null `next` field indicating additional pages; follow it to enumerate all collections.
TOOL
List model examples
Tool to list example predictions for a specific model. Use when you want to retrieve author-provided illustrative examples after identifying the model. Returned examples are minimal working payloads; cross-reference with REPLICATE_MODELS_README_GET before calling REPLICATE_CREATE_PREDICTION to satisfy strict input validation.
TOOL
List Model Versions
Tool to list all versions of a specific model. Use when you need to see all available versions of a model, sorted by newest first.
TOOL
List Public Models
Tool to list public models with pagination and sorting. Use when you need to browse available models or find models sorted by creation date.
TOOL
List Training Jobs
Tool to list all training jobs for the authenticated user or organization with pagination. Use when you need to retrieve training history or check the status of training jobs.
TOOL
Search Models and Collections
Tool to search for models, collections, and docs using text queries (beta). Use when you need to find relevant models or collections based on keywords or descriptions.
TOOL
Update Model Metadata
Tool to update metadata for a model including description, URLs, and README. Use when you need to modify a model's visibility, documentation, or associated links.