From first principles.
We build the smallest piece that solves the actual problem. Fewer abstractions, more measurements, and a clear paper trail from input to output.
DuguetLabs designs, trains, and ships AI systems that hold up under production load — sovereign by architecture, transparent by default, assembled from first principles.
Five lines of work, sequenced. One available today; the rest released as they pass internal evaluation.
An OpenAI- and OpenRouter-compatible endpoint serving frontier open-source and proprietary models on dedicated, sovereign infrastructure. Drop-in for existing SDKs; no migration.
Real-time 3-D scene understanding for indoor and outdoor capture — object detection, pose, depth, and segmentation in a single pass.
High-fidelity diffusion models with controllable composition, consistent characters, and edit-grade outputs. Trained on permissibly-licensed corpora.
Long-horizon planning, tool use, and multi-step orchestration with observability and human-in-the-loop checkpoints — built for systems that take consequential action.
End-user tools where the agent disappears into the work — research, ops, and back-office assistants delivered as finished products.
$ curl https://api.duguetlabs.com/v1/chat/completions \
-H "Authorization: Bearer dg_…" \
-H "Content-Type: application/json" \
-d '{
"model": "duguet-ai/llama-3.3-70b",
"messages": [
{ "role": "user",
"content": "Compose a haiku about sovereignty." }
],
"stream": true
}'
> data: {"id":"chatcmpl-…","provider":"duguet-ai",
> data: "choices":[{"delta":{"content":"Quiet servers hum,"}}]}
> data: {"choices":[{"delta":{"content":" data stays where it was born—"}}]}
> data: {"choices":[{"delta":{"content":" sovereignty in code."}}]}
> data: [DONE]
Transparent per-million-token pricing. No top-up fee, no hidden request surcharge. Billed by the token, paid from prepaid credit. First $5 on us.
| Model | Context | Input / MTok | Output / MTok | Backend |
|---|---|---|---|---|
Llama 3.1 8B duguet-ai/llama-3.1-8b | 131k | $0.05 | $0.08 | Azure EU |
Qwen 3 8B duguet-ai/qwen3-8b | 32k | $0.05 | $0.08 | Self-hosted |
Qwen 2.5 Coder 7B duguet-ai/qwen2.5-coder-7b | 32k | $0.05 | $0.08 | Self-hosted |
Mistral Nemo 12B duguet-ai/mistral-nemo-12b | 128k | $0.10 | $0.15 | Self-hosted |
GPT-OSS 120B duguet-ai/gpt-oss-120b | 128k | $0.15 | $0.60 | Azure |
Llama 3.3 70B duguet-ai/llama-3.3-70b | 131k | $0.30 | $0.40 | Azure |
DeepSeek V3.2 duguet-ai/deepseek-v3-2 | 164k | $0.40 | $0.60 | Azure |
Mistral Large 3 duguet-ai/mistral-large-3 | 131k | $0.40 | $1.20 | Azure |
DeepSeek R1 reasoning duguet-ai/deepseek-r1 | 164k | $0.50 | $2.00 | Azure |
| — Proprietary — | ||||
GPT-5 duguet-ai/gpt-5 | 400k | $2.50 | $10.00 | Azure OpenAI |
GPT-5.2 duguet-ai/gpt-5.2 | 400k | $2.50 | $10.00 | Azure OpenAI |
GPT-5.3 Codex duguet-ai/gpt-5.3-codex | 400k | $2.50 | $10.00 | Azure OpenAI |
| — Embeddings — | ||||
Nomic Embed duguet-ai/nomic-embed | 8k | $0.02 | — | Self-hosted |
A note on price: we aren't chasing the absolute floor. We undercut the major aggregators by 20–40 % on most SKUs and don't charge a top-up fee. If you need a bespoke quote for higher throughput, write.
We build the smallest piece that solves the actual problem. Fewer abstractions, more measurements, and a clear paper trail from input to output.
Compute, weights, and data stay where you can audit them. Self-host the entire stack, or use ours under a contract that forbids egress.
We prefer open weights and open formats; we contribute the generalisable parts back. Proprietary work earns its keep on results, not on lock-in.
Every release ships with throughput numbers, latency p50/p99, failure modes, and a runbook. If it cannot be operated, it is not finished.
$5 in free credit, no card. Drop-in replacement for
OpenAI or OpenRouter clients — point base URL at
api.duguetlabs.com/v1 and go.