Getting Started

Learn how to install and set up Inference Gateway.

Installation

Using Docker

bash

docker pull ghcr.io/inference-gateway/inference-gateway:latest
docker run --rm -it -p 8080:8080 -e OPENAI_API_KEY=your_key_here ghcr.io/inference-gateway/inference-gateway:latest

Using Docker Compose

Checkout the examples in the Docker Compose examples.

Using Kubernetes

Deploy to Kubernetes with the Kubernetes Operator - the recommended path. It manages the gateway and related resources declaratively as Custom Resources. Follow the Operator quick start to apply your first Gateway, or browse the runnable Kubernetes examples.

Basic Usage

Send a request to the Inference Gateway:

bash

curl -X POST http://localhost:8080/v1/chat/completions \
-d '{
"model": "deepseek/deepseek-v4-flash",
"messages": [
        {
            "role": "system",
            "content": "You are a helpful assistant."
        },
        {
            "role": "user",
            "content": "Hello, world!"
        }
    ]
}

Next steps

Explore the Architecture Overview to see how requests flow through the gateway.
Connect tools and data sources with MCP Integration, or coordinate specialized agents with A2A Integration.
Define an agent as code with the Agent Definition Language (ADL), then scaffold a Go or Rust A2A server using the ADL CLI.

Getting Started ​

Installation ​

Using Docker ​

Using Docker Compose ​

Using Kubernetes ​

Basic Usage ​

Next steps ​