LAB ONLINE · RACK A4 · 8× H100 · DRAW 2.4kW

Where the machines live.

An operational AI lab, wide open. GPUs spinning in a London data centre, a fleet of agents running experiments, every kernel launch and cache hit tailed in public. This page is the front door. Everything below is live.

LINK · STABLE PING · 21 ms REGION · EU-WEST ASN · 13335 UPTIME · 18.4 d ALERT · NOMINAL FPS · 60
CLUSTER UPTIME 0 d since last warm reboot
JOBS RUNNING 0 across 4 pools
NEURONS / SEC 0 Workers AI · 7d avg
DATA WRITTEN 0 TB artefacts past 30d
EDGE LATENCY 0 ms median p50
ALERT LEVEL NOMINAL 0 paged in 14d

// 00 · primary rack

Rack A4 · 8× H100

The pool that runs the inference batch. Each card pulls ~312 W under load. The rack peaks at 2.4 kW with all eight at full tilt. Direct-to-chip liquid cooling, A + B redundant feeds.

GPU-POOL-A · 8× H100 · 80GB HBM3 RACK · A4 · ONLINE
GPU00 61°
GPU01 75°
GPU02 76°
GPU03 68°
GPU04 82°
GPU05 70°
GPU06 62°
GPU07 64°
DRAW2.4kW FANS4280RPM INLET22°C PUE1.12

// 01 · provisioned hardware

The metal underneath

What the lab runs on. Not stock photos. These are the actual specs the inference pool is sized against.

PROVISIONED HARDWARE · DC-LON-04 SPEC · v2.4
COMPUTE 12.8 PF FP8 sparse tensor
MEMORY 5.12 TB HBM3e · 8 GB/s pin
BANDWIDTH 900 GB/s NVLink C2C
INTERCONNECT 400 Gbit InfiniBand NDR
POWER 57.6 kW rack peak
COOLING D2C direct-to-chip liquid

// 02 · the floor

Data centre · LON-04 · floor 2

6 rows · 30 racks · 240 U usable. Cold aisle 18°C, hot aisle 32°C. PDUs at each end run A+B redundant feeds.

BUSY WARN CRIT IDLE · DC-LON-04 · floor 2 · cold-aisle 18°C
PARAMS70.2B CTX131k QUANTFP8 TTFT168ms TOK/s312

// 03 · live telemetry

Inference signals

Six core readouts from the inference pipeline. Refresh cadence ≈ 1s. Bands tuned to a typical 8-GPU pool under normal demand.

GPU temp NOM
64.0 °C
min 56 · max 78
neurons / sec NOM
6400
min 3200 · max 9400
req / min NOM
188
min 80 · max 340
p50 latency NOM
168 ms
min 110 · max 260
queue depth NOM
3
min 0 · max 14
tokens / sec NOM
268
min 140 · max 480

// 04 · runtime trace

System output, tailed

A continuous read on what the lab is doing right now: agent turns, GPU jobs, edge cache, moderation review.

Live console
TAILING lines/sec
72270 [ops] Boot sequence initiated. Pid 1 = supervisord.
72270 [k8s] Reconciled 14 / 14 desired pods. Drift OK.
72271 [fs] R2 bucket `ai-uploads` mounted (4MB used / quota 5GB).
72272 [ai] Model registry warmed: Llama-3.3-70B · SDXL · Whisper-large-v3.
72273 [gpu] Probe complete · 8 GPUs available · avg 62°C · 0 throttled.
72274 [net] Edge fanout 308/308 colos. Median RTT 21ms (p95 38ms).
72274 [tls] Cert rotation lock acquired. Chain valid for 78 days.
72275 [mod] Moderation pipeline ready. Threshold 0.82 · blocklist 12.4k entries.
72276 [agt] Agent fleet check-in: 6/6 online · 0 quarantined.
72277 [rt] WebSocket upgrade pool ready. Hold capacity 1024.
72278 [sec] Turnstile binding live. Bot pass-rate target 99.2%.
72278 [ai] Llama-3.3-70B warm-up complete. First-token latency 168ms.
72279 [ops] D1 migration check: 0 pending.
72280 [k8s] HPA scaled gpu-pool · 6 → 8 replicas (q-depth above threshold).
72281 [agt] agt://researcher initialised. Tools: web_search, code_run, files.
72282 [gpu] Job queued · id 7c4a · sdxl · 4 images · est 18s.
72282 [ai] Job 7c4a complete · 17.4s wall · neurons 21,840.
72283 [mod] Reviewed 1 image · score 0.04 · pass.

// 05 · compute fleet

32 node host map

Per-node temperature and load. Click any tile for full diagnostics. Critical-state cells pulse red and auto-page the on-call console.

Compute fleet

32 nodes · 88% utilised

28 busy 5 warn 2 crit 2 idle

// 06 · network

Edge topology & flow

Inference traffic fans out across 308 Cloudflare colos. Live RTT, packet flow, and active session count.

FLOW 3.2 Gbps
RTT 21 ms
COLOS 308 live
PKTS/s
colo · 15 markers edge cache lane sessions live

// 07 · experiments

Active research

Long-running experiments inside the lab. Each runs on a dedicated GPU slice. Results stream to the artefact store.

EX-014 RUNNING

Recursive critique tower

Two-model debate loop. Researcher proposes, critic refutes, three rounds, judge ranks. Tracking inter-rater agreement vs. round depth.

64%
model llama-3.3-70b ETA 00:08:42
#agents#judging#rlhf
EX-013 RUNNING

SDXL stylebank explorer

Sweeping 64 controlnet conditioning combos × 12 schedulers. Sampling 4 images per combo. Saving thumbnail grids for human review.

28%
model sdxl-refiner-1.0 ETA 01:47:11
#vision#sweep#styles
EX-012 PAUSED

Whisper diarisation eval

Comparing v3-large turbo against pyannote-3.1 on a 14-speaker meeting corpus. Measuring DER, JER, and word-attribution accuracy.

42%
model whisper-large-v3
#audio#eval
EX-011 DONE

Embedding cache half-life

Production trace replay against a TTL'd embed cache. Looking at hit-rate vs. evict policy: LRU, LFU, ARC, random.

100%
model bge-large-en-v1.5
#embeddings#cache
EX-010 QUEUED

Tool-router fine-tune

LoRA on Llama-8b → tool-call routing classifier. Train on 1.2M synthetic dispatches. Eval on hand-labelled 4k set.

0%
model llama-3.1-8b ETA 03:00:00
#fine-tune#routing
EX-009 FAILED

Lab-wide drift watch

Continuous K-S test on response-length and tool-choice distributions. Alerts if today's drift exceeds 2σ from a 30-day baseline.

89%
model qwen-72b
#ops#monitoring

// 08 · agent fleet

Autonomous workers

Six long-running agents share a memory store and a tool registry. Each one owns a slice of recurring work.

agt://researcher

Researcher

THINKING

Hunts source material across web, papers, and the org's own docs, then produces digestible briefs with citations.

// last action tool_call(web_search) → 4 results
turns 1,842 mem 4,820KB tools 6
agt://editor

Editor

ACTIVE

Takes drafts and applies the house style: tone, hierarchy, claim-checking. Flags every unsourced sentence.

// last action rewriting section 3, clarity pass
turns 998 mem 2,140KB tools 4
agt://planner

Planner

IDLE

Breaks an arbitrary goal into a runnable plan with explicit dependencies. Owns the kanban for the lab itself.

// last action awaiting next dispatch
turns 612 mem 3,380KB tools 5
agt://critic

Critic

ACTIVE

Tries to break whatever the rest of the fleet ships. Adversarial probes, regression replays, sanity-checks.

// last action probing edge cases on EX-014 outputs
turns 1,276 mem 1,980KB tools 3
agt://librarian

Librarian

THINKING

Owns memory hygiene. Compacts the long-term store, deduplicates, tags new entries, evicts stale ones.

// last action compacting last 7d into a summary
turns 3,401 mem 9,210KB tools 2
agt://forecaster

Forecaster

PAUSED

Time-series brain. Watches the metrics and predicts trouble: thermal events, quota burn, traffic spikes.

// last action awaiting baseline refresh
turns 428 mem 1,502KB tools 3

// engage

Pick a surface to open up.

This is a working lab, not a brochure. Every section above is a live readout. The deeper systems live behind their own dashboards.