Ship AI your customers' lawyers approve of.

Production inference and MCP hosting on Kubernetes you can audit. EU-hosted, GDPR-clean, no vendor lock-in.

Book a 30-minute scope call

// EVIDENCE

I run a Hetzner-hosted K3s cluster that powers every product I own — UnionStack (EU WordPress hosting), Peptalytix (peptide protocol tracking), this site. The same patterns I deployed for myself are what I deliver to you.

UnionStack

EU agency hosting on K3s — the platform that runs every product I own.

Read the case study →

K8s Hardening Sweep

20-issue write-up — public artifact, no client data, the opinions that matter.

Read the write-up →

// THE OFFER

One product, two tiers.

Audit & Roadmap

2 weeks · €15 – 25k

Current-state assessment: cluster, AI posture, compliance
Target architecture for cluster, inference, MCP, ops
Sizing + cost model — three throughput scenarios
GDPR + customer-contract gap analysis
Phase-2 statement of work — bid it elsewhere if you want
30 days of follow-up access

Output: written report + 90-minute readout call.

Full Build

6 weeks · €50 – 90k

Hardened K3s cluster: NetworkPolicies, RBAC, sealed-secrets, backups
vLLM (prod) + Ollama (dev) behind an OpenAI-compatible gateway
MCP gateway: per-tenant auth, rate limits, audit log
IaC in your repo: Terraform (or Pulumi on request)
Prometheus + Grafana + Loki with three pre-built dashboards
Markdown docs + 10-scenario runbook + 4-hour handover + 90-day Q&A

Output: running cluster in your repo + docs + handover.

Audit fee credited in full against the Build if commenced within 90 days.

Book a 30-minute scope call

// HOW IT WORKS

How it works

01
Scope call
Book a 30-minute scope call. I map your AI ambitions and compliance constraints. If we're not a fit I'll say so.
02
Audit & Roadmap
2 weeks. You walk away with an architecture, a sizing model, a GDPR gap analysis, and a Phase-2 statement of work.
03
Decision point
You proceed to the Build, take the report and bid it elsewhere, or shelve it. The audit deliverable stands on its own.
04
Full Build
6 weeks. Cluster, inference, MCP, IaC, docs. Handover with your on-call engineer.

// ABOUT

Solo, Berlin-based, EU-only.

I'm Lionel Pedro. I run my own K3s cluster in production — five live workloads, 99.97% uptime over 90 days — and synkraft is how I deliver the same thing to your team. EN and DE, peer-to-peer, no fluff.

// FAQ

FAQ

Why not just use OpenAI?

If your customers' contracts don't block it and your data isn't sensitive, OpenAI is often the right answer. I'll tell you that on the scope call if it applies. This offer exists for teams where 'just use OpenAI' isn't an option — GDPR, customer subprocessor lists, regulated industry, or data your team isn't comfortable streaming to a US vendor.

What if we don't have GPUs?

Hetzner GPU instances are the default recommendation — EU-hosted, monthly pricing, cancellable. The audit costs out the right GPU SKU for your workload. Client-owned hardware is supported with a small build surcharge.

What if our team can't run K8s after handover?

Handover includes a 10-scenario runbook and 90 days of async follow-up. After that, an optional retainer covers 4 hours/month for ops questions. You won't get locked in — the IaC lives in your repo, the cluster is in your account, you can move it or fire me at any time.

What models can you actually run?

Two bands. Self-hosted open weights — Llama, Mistral, Qwen, DeepSeek, Phi, Gemma, Command R, Mixtral — anything with downloadable weights and a permissive license, served via vLLM or Ollama. The specific model gets picked in the audit based on your workload and budget. Closed-weight commercial models (Claude, the proprietary Bedrock catalogue) aren't self-hostable, but I wire your existing Bedrock or Anthropic-direct subscription through the same OpenAI-compatible gateway — you get one unified API, one audit log, and self-hosting for the parts that can be self-hosted.

What if we already have a cluster?

The audit absorbs your existing cluster as Phase 0. The Build either hardens what you have or rebuilds it cleanly — whichever the audit recommends, with your input. We don't bin work that's already done.

How is this different from Cohere, Together, Replicate?

They run inference for you on their cluster. You're getting your own cluster, in your AWS, Hetzner, or colo account, that you own. They're a service. This is a build.

One product, two tiers.

Audit & Roadmap

2 weeks · €15 – 25k

Current-state assessment: cluster, AI posture, compliance
Target architecture for cluster, inference, MCP, ops
Sizing + cost model — three throughput scenarios
GDPR + customer-contract gap analysis
Phase-2 statement of work — bid it elsewhere if you want
30 days of follow-up access

Output: written report + 90-minute readout call.

Full Build

6 weeks · €50 – 90k

Hardened K3s cluster: NetworkPolicies, RBAC, sealed-secrets, backups
vLLM (prod) + Ollama (dev) behind an OpenAI-compatible gateway
MCP gateway: per-tenant auth, rate limits, audit log
IaC in your repo: Terraform (or Pulumi on request)
Prometheus + Grafana + Loki with three pre-built dashboards
Markdown docs + 10-scenario runbook + 4-hour handover + 90-day Q&A

Output: running cluster in your repo + docs + handover.

Audit fee credited in full against the Build if commenced within 90 days.

How it works

Scope call

Book a 30-minute scope call. I map your AI ambitions and compliance constraints. If we're not a fit I'll say so.

Audit & Roadmap

2 weeks. You walk away with an architecture, a sizing model, a GDPR gap analysis, and a Phase-2 statement of work.

Decision point

You proceed to the Build, take the report and bid it elsewhere, or shelve it. The audit deliverable stands on its own.

Full Build

6 weeks. Cluster, inference, MCP, IaC, docs. Handover with your on-call engineer.

FAQ

Why not just use OpenAI?

What if we don't have GPUs?

What if our team can't run K8s after handover?

What models can you actually run?

What if we already have a cluster?

How is this different from Cohere, Together, Replicate?

They run inference for you on their cluster. You're getting your own cluster, in your AWS, Hetzner, or colo account, that you own. They're a service. This is a build.