Models

Open-weight models, serverless

All models use OpenAI-compatible IDs. Drop them into your existing SDK and start paying per token — no GPU provisioning needed.

Quick start
// Works with any OpenAI-compatible SDK
const client = new OpenAI({
  baseURL: "https://api.lexora.network/v1",
  apiKey: process.env.LEXORA_API_KEY,
});

// Chat completion
await client.chat.completions.create({
  model: "Qwen/Qwen3-8B",  // ← use any model ID below
  messages: [{ role: "user", content: "Hello!" }],
});

// Image generation
await client.images.generate({
  model: "black-forest-labs/FLUX.1-schnell",
  prompt: "A futuristic city at night",
});
In-House Models
Same price for all plans

Run directly on Lexora's distributed worker network. Flat per-token pricing — no markup, no plan differences.

Live

Qwen3 8B

Qwen/Qwen3-8B

$0.10 / 1M tokens

Fast reasoning model on the network. Optimized for chat, code, classification, and high-throughput integrations.

ChatReasoningOpenAI-compatible128K tokens

Common use cases

ChatbotsClassificationSummarizationRAG
Live

FLUX.1 schnell

black-forest-labs/FLUX.1-schnell

$0.002 / image

Guidance-free 4-step diffusion model. Best speed-to-quality ratio for programmatic image generation at scale.

Image4-stepFast1024×1024 px

Common use cases

Image appsE-commerceCreative toolsAvatars
In the Pipeline
In Pipeline

Llama 3.1 8B Instruct

meta-llama/Llama-3.1-8B-Instruct

Balanced performance and cost. Strong on reasoning, code, and complex instruction following.

ChatReasoningCode128K tokens
In Pipeline

Llama 3.1 70B Instruct

meta-llama/Llama-3.1-70B-Instruct

Flagship open-weight LLM. Near-GPT-4 quality for complex reasoning and long-context tasks.

LargeReasoningComplex tasks128K tokens
In Pipeline

Stable Diffusion XL

stabilityai/stable-diffusion-xl-base-1.0

Industry-standard image model with a huge ecosystem of LoRAs, ControlNets, and refiners.

ImageSDXLEcosystem1024×1024 px
Partnership Models
Pro: +2% · PAYG: +10%

Served via provider partnerships. Pricing is market rate plus a small markup to cover routing and support.

Partnership

DeepSeek V4 Flash

deepseek/deepseek-v4-flash

in $0.143
out $0.29

per 1M tokens · Pro

Fast variant of DeepSeek V4. High-throughput reasoning at low cost, ideal for production pipelines.

ChatReasoningFast128K tokens
Partnership

DeepSeek V4 Pro

deepseek/deepseek-v4-pro

in $0.444
out $0.89

per 1M tokens · Pro

Full DeepSeek V4. Top-tier reasoning and coding performance, competitive with frontier models.

ChatReasoningCode128K tokens
Partnership

Kimi K2.6

moonshot/kimi-k2.6

in $0.969
out $4.08

per 1M tokens · Pro

Moonshot's flagship long-context model. Exceptional multilingual reasoning and document understanding.

ChatLong contextMultilingual200K tokens
Partnership

Kimi K2.7 Code

moonshot/kimi-k2.7-code

in $0.969
out $4.08

per 1M tokens · Pro

Code-specialized Kimi model. Strong at code generation, debugging, and technical documentation.

CodeDebugLong context200K tokens
Partnership

Kimi K2.7 Code Highspeed

moonshot/kimi-k2.7-code-highspeed

in $1.938
out $8.16

per 1M tokens · Pro

High-throughput variant of Kimi K2.7 Code. Lower latency for latency-sensitive coding pipelines.

CodeFastLong context200K tokens
Partnership

FLUX.1 dev

black-forest-labs/FLUX.1-dev

TBD

Guidance-distilled FLUX for higher prompt adherence and richer detail. Tunable CFG scale.

ImageGuidedHD1024×1024 px

Partnership model prices shown at Pro rate (+2%). Pay As You Go users are charged +10% on top of market rate. Upgrade to Pro →

Need a specific model?

We add models based on demand. If you need a model that isn't listed — a specific Mistral variant, a multimodal model, a code model, or a custom deployment — let us know and we'll prioritize it.

Request a model →

Start using these models

No credit card required to get started.