AI BUILD STUDIO - EST. 2026

Bring us the problem.
We'll build the AI.

ZeroTwo Labs designs, builds, and operates the AI, and the systems, apps, and workflows around it. End-to-end, for teams that want working products, not research papers. Tell us what you're stuck on; we ship it.

Data agentsAI insightsModel infrastructure

See what we work on

Operating since2026

Status Open for client work

02 / agent.runtime LIVE

-> pull1,240 rows - postgres.db

-> embedranked - deduped - scored

-> writeinsight.md - 6 pages

IN NUMBERS

4+yrs

Building AI in production environments

4+yrs

Engineering applications that scale

Practice areas: agents, insights, infra

100%

Senior, founder-led: no outsourced work

USE CASES - WHAT WE BUILD

What we work on.

Four shapes of AI problem we take end-to-end: strategy, build, deploy, operate. If your problem looks AI-shaped, we should talk.

01 / 04

Data agents

Agents that connect to your databases, tools, and SaaS apps, reason over what they find, and take action with evals and human review where it counts.

e.g. nightly invoice reconciliation across three systems

02 / 04

AI insights from your data

We process surveys, transcripts, tickets, and calls at scale, then hand stakeholders the briefs and dashboards they'll actually read.

e.g. 80,000 survey responses -> a 6-page leadership brief

03 / 04

Open-source model infrastructure

We deploy and run open-weights models on your own infrastructure, with routing, autoscaling, and monitoring included, at a fraction of API cost.

e.g. Llama 3.3-70B in production under $0.40 / M tokens

04 / 04

Anything else AI-shaped

Most problems don't fit a neat category. If it can be solved, or made meaningfully better, with AI, that's our favorite kind of project.

e.g. tell us what you're stuck on

SERVICES - WHAT WE SHIP

End-to-end, from first call to ongoing operations.

Four years of engagement notes, distilled into a short list of things we do well. We don't write reports; we ship and operate.

01 / 04
[ INFRA ]
Inference infrastructure
Multi-region GPU fleets, request routing, observability, autoscaling. We run the boring layer so your team ships the interesting one.
02 / 04
[ AGENTS ]
Agentic workflows
Production agent loops with evals, guardrails, and human-in-the-loop. Built for tasks your support and ops teams are already doing.
03 / 04
[ MODELS ]
Bespoke model work
Fine-tunes, distillations, and from-scratch training for teams whose problem doesn't fit a general-purpose model.
04 / 04
[ OPERATE ]
Operations & on-call
We stay on after launch: versioning, regressions, capacity planning, cost. Weekly reports your CFO will actually read.

WRITING - FROM THE LAB

Notes from active engagements.

Short, technical write-ups of what we're shipping. No thought leadership: just the parts that took us longer than they should have.

All writing

// 2026-04-22[ AGENTS ]9 min
Notes on building data agents that don't lie.
How we wire retrieval, evals, and human review so agents stay tethered to source data, and how we know when they're drifting.
Read entry
// 2026-03-10[ INFRA ]12 min
Hosting Llama 3.3-70B for under $0.40 per million tokens.
A walkthrough of the routing, batching, and quantization tricks we use to run open-weights models on client GPU fleets without blowing budget.
Read entry
// 2026-02-04[ INSIGHTS ]7 min
Reading 80,000 employee survey responses in a weekend.
How a small pipeline of classifiers, clustering, and LLM summarization turned a year-end HR survey into a 6-page brief leadership actually read.
Read entry

JOIN US

A lean team, building what we'd want to use.

We're researchers, engineers, and operators who would rather ship one thing that works than ten that don't. If our mission resonates, we'd love to talk: whether for client work, a research role, or a coffee.

~/zerotwolabs - careers

$ mail --to=hello@zerotwolabs
$ echo "I want to "

Bring us the problem.We'll build the AI.

What we work on.

Data agents

AI insights from your data

Open-source model infrastructure

Anything else AI-shaped

End-to-end, from first call to ongoing operations.

Inference infrastructure

Agentic workflows

Bespoke model work

Operations & on-call

Notes from active engagements.

Notes on building data agents that don't lie.

Hosting Llama 3.3-70B for under $0.40 per million tokens.

Reading 80,000 employee survey responses in a weekend.

A lean team, building what we'd want to use.

Bring us the problem.
We'll build the AI.