Getting Started
Learn how to install and set up Inference Gateway.
Installation
Using Docker
bash
docker pull ghcr.io/inference-gateway/inference-gateway:latest
docker run --rm -it -p 8080:8080 -e OPENAI_API_KEY=your_key_here ghcr.io/inference-gateway/inference-gateway:latestUsing Docker Compose
Checkout the examples in the Docker Compose examples.
Using Kubernetes
Deploy to Kubernetes with the Kubernetes Operator - the recommended path. It manages the gateway and related resources declaratively as Custom Resources. Follow the Operator quick start to apply your first Gateway, or browse the runnable Kubernetes examples.
Basic Usage
Send a request to the Inference Gateway:
bash
curl -X POST http://localhost:8080/v1/chat/completions \
-d '{
"model": "deepseek/deepseek-v4-flash",
"messages": [
{
"role": "system",
"content": "You are a helpful assistant."
},
{
"role": "user",
"content": "Hello, world!"
}
]
}Next steps
- Explore the Architecture Overview to see how requests flow through the gateway.
- Connect tools and data sources with MCP Integration, or coordinate specialized agents with A2A Integration.
- Define an agent as code with the Agent Definition Language (ADL), then scaffold a Go or Rust A2A server using the ADL CLI.
