LAB ONLINE · RACK A4 · 8× H100 · DRAW 2.4kW

Where the machines live.

An operational AI lab, wide open. GPUs spinning in a London data centre, a fleet of agents running experiments, every kernel launch and cache hit tailed in public. This page is the front door. Everything below is live.

Tools → Open console →

LINK · STABLE PING · 21 ms REGION · EU-WEST ASN · 13335 UPTIME · 18.4 d ALERT · NOMINAL FPS · 60

CLUSTER UPTIME 0 d since last warm reboot

JOBS RUNNING 0 across 4 pools

NEURONS / SEC 0 Workers AI · 7d avg

DATA WRITTEN 0 TB artefacts past 30d

EDGE LATENCY 0 ms median p50

ALERT LEVEL NOMINAL 0 paged in 14d

// 00 · primary rack

Rack A4 · 8× H100

The pool that runs the inference batch. Each card pulls ~312 W under load. The rack peaks at 2.4 kW with all eight at full tilt. Direct-to-chip liquid cooling, A + B redundant feeds.

GPU-POOL-A · 8× H100 · 80GB HBM3 RACK · A4 · ONLINE

GPU00 61°

GPU01 75°

GPU02 76°

GPU03 68°

GPU04 82°

GPU05 70°

GPU06 62°

GPU07 64°

// 01 · provisioned hardware

The metal underneath

What the lab runs on. Not stock photos. These are the actual specs the inference pool is sized against.

PROVISIONED HARDWARE · DC-LON-04 SPEC · v2.4

COMPUTE 12.8 PF FP8 sparse tensor

MEMORY 5.12 TB HBM3e · 8 GB/s pin

BANDWIDTH 900 GB/s NVLink C2C

INTERCONNECT 400 Gbit InfiniBand NDR

POWER 57.6 kW rack peak

COOLING D2C direct-to-chip liquid

// 02 · the floor

Data centre · LON-04 · floor 2

6 rows · 30 racks · 240 U usable. Cold aisle 18°C, hot aisle 32°C. PDUs at each end run A+B redundant feeds.

BUSY WARN CRIT IDLE · DC-LON-04 · floor 2 · cold-aisle 18°C

PARAMS70.2B CTX131k QUANTFP8 TTFT168ms TOK/s312

// 03 · live telemetry

Inference signals

Six core readouts from the inference pipeline. Refresh cadence ≈ 1s. Bands tuned to a typical 8-GPU pool under normal demand.

GPU temp NOM

64.0 °C

neurons / sec NOM

6400

req / min NOM

188

p50 latency NOM

168 ms

queue depth NOM

3

tokens / sec NOM

268

// 04 · runtime trace

System output, tailed

A continuous read on what the lab is doing right now: agent turns, GPU jobs, edge cache, moderation review.

Live console

TAILING — lines/sec

72270 [ops] Boot sequence initiated. Pid 1 = supervisord.

72270 [k8s] Reconciled 14 / 14 desired pods. Drift OK.

72271 [fs] R2 bucket `ai-uploads` mounted (4MB used / quota 5GB).

72272 [ai] Model registry warmed: Llama-3.3-70B · SDXL · Whisper-large-v3.

72273 [gpu] Probe complete · 8 GPUs available · avg 62°C · 0 throttled.

72274 [net] Edge fanout 308/308 colos. Median RTT 21ms (p95 38ms).

72274 [tls] Cert rotation lock acquired. Chain valid for 78 days.

72275 [mod] Moderation pipeline ready. Threshold 0.82 · blocklist 12.4k entries.

72276 [agt] Agent fleet check-in: 6/6 online · 0 quarantined.

72277 [rt] WebSocket upgrade pool ready. Hold capacity 1024.

72278 [sec] Turnstile binding live. Bot pass-rate target 99.2%.

72278 [ai] Llama-3.3-70B warm-up complete. First-token latency 168ms.

72279 [ops] D1 migration check: 0 pending.

72280 [k8s] HPA scaled gpu-pool · 6 → 8 replicas (q-depth above threshold).

72281 [agt] agt://researcher initialised. Tools: web_search, code_run, files.

72282 [gpu] Job queued · id 7c4a · sdxl · 4 images · est 18s.

72282 [ai] Job 7c4a complete · 17.4s wall · neurons 21,840.

72283 [mod] Reviewed 1 image · score 0.04 · pass.

// 05 · compute fleet

32 node host map

Per-node temperature and load. Click any tile for full diagnostics. Critical-state cells pulse red and auto-page the on-call console.

Compute fleet

32 nodes · 88% utilised

28 busy 5 warn 2 crit 2 idle

// 06 · network

Edge topology & flow

Inference traffic fans out across 308 Cloudflare colos. Live RTT, packet flow, and active session count.

FLOW 3.2 Gbps

RTT 21 ms

COLOS 308 live

PKTS/s —

colo · 15 markers edge cache lane — sessions live

// 07 · experiments

Active research

Long-running experiments inside the lab. Each runs on a dedicated GPU slice. Results stream to the artefact store.

EX-014 RUNNING

Recursive critique tower

Two-model debate loop. Researcher proposes, critic refutes, three rounds, judge ranks. Tracking inter-rater agreement vs. round depth.

64%

#agents#judging#rlhf

EX-013 RUNNING

SDXL stylebank explorer

Sweeping 64 controlnet conditioning combos × 12 schedulers. Sampling 4 images per combo. Saving thumbnail grids for human review.

28%

#vision#sweep#styles

EX-012 PAUSED

Whisper diarisation eval

Comparing v3-large turbo against pyannote-3.1 on a 14-speaker meeting corpus. Measuring DER, JER, and word-attribution accuracy.

42%

#audio#eval

EX-011 DONE

Embedding cache half-life

Production trace replay against a TTL'd embed cache. Looking at hit-rate vs. evict policy: LRU, LFU, ARC, random.

100%

#embeddings#cache

EX-010 QUEUED

Tool-router fine-tune

LoRA on Llama-8b → tool-call routing classifier. Train on 1.2M synthetic dispatches. Eval on hand-labelled 4k set.

0%

#fine-tune#routing

EX-009 FAILED

Lab-wide drift watch

Continuous K-S test on response-length and tool-choice distributions. Alerts if today's drift exceeds 2σ from a 30-day baseline.

89%

#ops#monitoring

// 08 · agent fleet

Autonomous workers

Six long-running agents share a memory store and a tool registry. Each one owns a slice of recurring work.

agt://researcher

Researcher

THINKING

Hunts source material across web, papers, and the org's own docs, then produces digestible briefs with citations.

// last action tool_call(web_search) → 4 results

agt://editor

Editor

ACTIVE

Takes drafts and applies the house style: tone, hierarchy, claim-checking. Flags every unsourced sentence.

// last action rewriting section 3, clarity pass

agt://planner

Planner

IDLE

Breaks an arbitrary goal into a runnable plan with explicit dependencies. Owns the kanban for the lab itself.

// last action awaiting next dispatch

agt://critic

Critic

ACTIVE

Tries to break whatever the rest of the fleet ships. Adversarial probes, regression replays, sanity-checks.

// last action probing edge cases on EX-014 outputs

agt://librarian

Librarian

THINKING

Owns memory hygiene. Compacts the long-term store, deduplicates, tags new entries, evicts stale ones.

// last action compacting last 7d into a summary

agt://forecaster

Forecaster

PAUSED

Time-series brain. Watches the metrics and predicts trouble: thermal events, quota burn, traffic spikes.

// last action awaiting baseline refresh

// engage

Pick a surface to open up.

This is a working lab, not a brochure. Every section above is a live readout. The deeper systems live behind their own dashboards.

06 AGENT FLEET → 14 COMPUTE NODES → 06 EXPERIMENTS → 308 EDGE COLOS → LIVE CONSOLE →