vi
ma.
spatial intelligence · CII ledger · no lidar

v i m a.

Video intelligence for construction sites. No lidar.

Hardhat video becomes an inspectable evidence chain. Depth-delta filtering drops 57% of bad frames, MASt3R reconstruction grounds the rest, episodic memory binds events to time and zone.

video → depth-delta filter → MASt3R reconstruction → episodic memory → zone-aware claim
inspect the proof chainopen dashboard
scroll

evidence chain · one scroll, one proof path

every payout starts as a frame you can inspect.

vima turns egocentric video into timestamped work claims, then binds those claims to spatial zones before settlement logic sees them.

sampled frames
30
wrench time
86.7%
mean P-confidence
0.939
depth-drop rate
57%

confidence stream

CII frame certaintysampled receipt confidence before settlement logic sees the claim.
proof railframe / claim / zone / payout
image evidenceinteractive chain
temporal safety score timeline from the vima evidence run
01 · frame evidence30 frames
01 · frame evidence

timestamped bodycam frames become an inspectable CII trail before anything touches payout logic.

sourcebodycam video
claimCII label stream
auditframe-level review

frame stream · drag to inspect

Showcase item
Showcase item
Showcase item
Showcase item
Showcase item
Showcase item
Showcase item
Showcase item
Showcase item
Showcase item
Showcase item
Showcase item
Showcase item
Showcase item
Showcase item
Showcase item
Showcase item
Showcase item
Showcase item
Showcase item
Showcase item
Showcase item
Showcase item
Showcase item
Showcase item
Showcase item
Showcase item
Showcase item
Showcase item
Showcase item
Showcase item
Showcase item
Showcase item
Showcase item
Showcase item
Showcase item

episode taxonomy · five shapes from the paper

every claim is one of five shapes.

Vima's episodic memory emits one of five episode types per detected event. The shape makes the claim reviewable in seconds. A human swipes, the claim resolves, the reviewed evidence enters the audit trail.

masonry_work_candidate — observed work in progress on the masonry plane

scaffold_zone_visible — scaffolding plank or rail in the active frame

safety_edge_context — exposed edge or fall hazard in view

foreground_worker_present — primary subject in motion or labor

frame_001
frame_004
frame_007
frame_010
frame_013
frame_015
frame_001
frame_004
frame_007
frame_010
frame_013
frame_015
frame_001
frame_004
frame_007
frame_010
frame_013
frame_015
frame_001
frame_004
frame_007
frame_010
frame_013
frame_015

ledger · settlement receipt

this is where a frame becomes inspectable evidence.

the ledger is the audit handoff: frame labels stay visible while the payout gate scores productive time, blocks idle work, and preserves the receipt.

eligible frames26 / 30
wrench time86.7%
reward weight0.867
audit hash9f2c...81a
frame receiptevidence gate · review pending
f-00000.0s
Pblock alignmentzone a
confidence0.95
weight1.00x
settles
f-183183.2s
Cmaterial stagingzone b
confidence0.88
weight0.34x
context
f-808808.1s
NCidle walkingzone b
confidence0.82
weight0.00x
blocked
f-12341234.0s
Psite setupzone c
confidence0.91
weight1.00x
settles
receipt hashframes:30 · zones:3 · evidence:ready
review pathopen frame trail before review

verify · human review of episodic memory claims

every claim earns its truth.

Centralized labeling pipelines move at a worker's reading speed. Vima reduces the unit of work to one tap on a phone. The model proposes its episodic memory claims, humans confirm or reject. Right to accept, left to reject, up to skip. The reviewed claims become the audit trail every safety incident depends on.

  • · tinder-style swipe deck
  • · accept · reject · skip per claim
  • · episode types match the paper's five
  • · human review feeds the audit trail
  • · every reviewed frame stays inspectable
  • · reviewer decisions become ground truth
vima · claim 17 of 30
masonry_work_candidate
worker raises trowel at frame 0:42
confidence 0.95 · zone a
swipe → accept · review pending
frame · 17 / 30session · 2:14

pipeline

video in. auditable work claims out.

open dashboard
frame crop
zone map
payout trace
route fieldvideo → claim → zone → payout

ready for the field

turn the demo stream into an auditable evidence chain.