Skip to main content

Deploy Page

Route: /deploy/:tab — Source: ui/src/pages/DeployPage.tsx

Promote trained models to a serving endpoint, configure named endpoints, run side-by-side comparisons, watch live metrics, and manage edge devices. Five tabs in one page.

Deploy — overview tab

Tabbing through Deploy: models → endpoints → test → monitor → edge.

Tabs

TabRoutePurpose
Models/deploy/modelsThe model registry — every trained run with metrics, aliases, and the active flag
Endpoints/deploy/endpointsNamed serving endpoints (/endpoints/<slug>/predict)
Test/deploy/testDrag-and-drop demo: send an image to the active model and see the prediction
Monitor/deploy/monitorLive throughput, latency, and error rate per endpoint
Edge/deploy/edgeRegistered edge devices, last heartbeat, on-device model version

Models

Deploy — models

Click the star icon next to any model in the registry to mark it active. The default /predict/* endpoints will use it from the next request. Promote a model to a labeled alias (production, staging, canary) so endpoints pinned to that alias pick up the new version automatically.

Endpoints

Deploy — endpoints

Create endpoints with their own slug, model pin, and routing rules. Useful for:

  • Blue/green deploys — run a new model on /endpoints/staging/predict until you're confident, then promote.
  • Per-customer models/endpoints/customer-x/predict pinned to a custom-trained variant.
  • A/B testing — see Experiments.

Test

Deploy — test

A scratchpad for sanity-checking the active model. Drop an image; the page POSTs it to /api/v1/predict/image and renders the boxes + scores side-by-side with the source.

Monitor

Deploy — monitor

Live charts powered by GET /api/v1/metrics/summary:

  • Requests per second per endpoint
  • p50 / p95 / p99 latency
  • Error rate (HTTP 5xx + inference exceptions)
  • Class distribution of predictions

Edge

Deploy — edge devices

Registered Jetson and Pi+TPU devices. Each row shows last heartbeat, battery / temperature, on-device model version, and recent inference rate. Promoting a new model triggers an OTA download on the next heartbeat — see Edge deployment.