Eventual Logo
Eventual

MultiBase

Your training data, ready when you wake up.

Understand your video. Serve it to your GPUs.

Why teams are stuck
01

Slow iteration

Teams iterate on model improvement once a week. Most of that time is waiting for data, not improving models.

02

Blind to your own data

Petabytes of video with no way to search by content. Finding failure modes means annotation sweeps that take days.

03

Starving GPUs

20–40% of training time lost to data loading. The path from storage to GPU is its own engineering project.

How we solve it

We're building the video-native index for Physical AI, turning weeks of data wrangling into hours.

Understand

Describe what you need. Get results in minutes.

  • VLM-annotate every clip at corpus scale, for a fraction of annotation vendor cost
  • Search by what's inside the video, not just metadata
  • Conversational curation agent: describe, inspect, execute
  • Managed inference, so you don’t run the models yourself
Serve

Curated data to your GPUs at line rate.

  • Video-native PyTorch dataloader: dict[str, Tensor] on GPU
  • Video + sensor co-loading as one unit
  • Rank-aware, NVMe-cached, byte-range random access
  • Close the 20–40% GPU idle time lost to data loading
Warehouse

One data warehouse, not five tools wired together.

  • Video and sensors on the same row, aligned on timecode
  • Dataset versioning and row-level provenance
  • Schema evolution to add embeddings, labels, and new sensor streams without rewriting data
Built for Physical AI Teams
Robotics labs
Video generation teams
Defense autonomy
Any team where the data pipeline (not compute) is the bottleneck
Why us
  • Built on Daft, an open-source data engine processing petabytes daily at companies like Amazon and Mobileye.
  • Founding team spent years building PB-scale sensor pipelines for autonomous vehicles. Same class of problem, at the same scale.
  • We build and operate the VLMs that power curation. Customers don’t stand up their own model-serving stack.

Design Partners

We're shaping MultiBase with leading robotics labs and GPU infrastructure providers.

We'll reach out to set up a brief discovery call.

Backed by

Y CombinatorFelicisCRVarray.vc