Question 1

What's the gap between a CV proof of concept and a production deployment?

Accepted Answer

A CV proof of concept works on clean, labeled test data in controlled conditions. A production CV system works on real-world data under variable lighting, occlusion, camera angle variation, and hardware heterogeneity — and doesn't degrade silently when conditions change. The production gap is the combination of: inference infrastructure (latency, throughput, hardware optimization), annotation and retraining pipelines (so the model improves from production errors), model drift monitoring (so degradation is caught before it causes failures), and integration with downstream systems (control signals, alert routing, evidence packaging).

Question 2

How do you handle model drift in production CV deployments?

Accepted Answer

Model drift is the most common failure mode in production CV that teams don't design for upfront. Causes include: seasonal lighting changes, new object classes the model hasn't seen, hardware replacement with different camera characteristics, and dataset shift from changes in the physical environment. Our standard approach: production monitoring that tracks confidence score distributions and detection rate changes over time, golden dataset evaluation on a scheduled cadence, and a retraining pipeline that can incorporate production corrections. Drift is detected before it reaches the failure threshold, not after users report incorrect outputs.

Question 3

What does edge CV deployment require that cloud deployment doesn't?

Accepted Answer

Edge CV adds constraints that cloud deployment doesn't face: power budget (limits model size and compute), intermittent connectivity (requires on-device inference for the critical path), hardware heterogeneity (different edge device generations with different GPU/NPU capabilities), thermal management (sustained inference under ambient temperature variation), and OTA update infrastructure (deploying model updates to devices in the field without downtime). The model optimization work — quantization, pruning, or distillation to fit the edge hardware — is typically 30–50% of the total CV engineering effort on an edge deployment.

Hayden AI	Smart-city perception, municipal scale
use_case	Automated traffic enforcement
deployment	Edge (vehicle-mounted + fixed)

Cargomatic	Logistics CV, dock operations
use_case	Container identification, loading CV
deployment	Fixed dock cameras

frameworks	PyTorch · TensorRT · ONNX
edge_runtime	Quantized / pruned models
inference_infra	SageMaker · custom streaming
monitoring	Drift detection + eval pipelines

typical_scope	$400K–$2.5M
duration	6–18 months + ops
buyer	VP ML · CTO · Head of Platform

Computer vision that works in production.

How we take CV to production

1. Data and annotation pipeline

2. Model development and optimization

3. Inference pipeline

4. Integration with downstream systems

5. Drift monitoring and retraining

CV that survives the real world.

Production CV — common questions.

CV system that needs to move from POC to production?