Replicate
Run AI models in the cloud with a simple API
LLM APIs & InferencePay-per-useWorks with OpenClaw
About Replicate
Replicate is a cloud platform for running open-source AI models via a simple API. You can run Stable Diffusion, LLMs, video models, and more without managing infrastructure — just pay per prediction.
Features
Model hosting
API access
Fine-tuning
Community models
Streaming
The tally
FOR
- +Simple API for any model
- +No infrastructure management
- +Pay only for what you use
- +Community model sharing
- +Easy fine-tuning
AGAINST
- −Can be expensive at scale
- −Cold start latency
- −Dependent on cloud availability
- −Limited customization
Related concepts
Kept nearby
Vercel AI Gateway
Unified API gateway for routing app calls across hundreds of AI models
Free monthly credits; pay-as-you-go at provider list price with no markup
Hugging Face
The AI community platform with 500K+ models and datasets
Free + Pro $9/mo + Enterprise · ★ 140,000
Fireworks AI
Fast and efficient LLM inference platform
Pay-per-use
Together AI
Fast inference and fine-tuning for open-source models
Pay-per-use
Browse all LLM APIs & Inference tools →