Perceptron
Specialty Lab · USA

Perceptron

Spatial intelligence for the physical world.

Founded 2025
HQ Bay Area
Country USA
Models tracked 1
Status Specialty

The Lab

A newer entrant founded in 2025, Perceptron focuses on spatial intelligence — AI that understands the physical world rather than just text. Perceptron Mk1 (Mark One) is the flagship vision-reasoning model, trained on multimodal data with heavy weighting toward 3D scene understanding, robotics simulation trajectories, and embodied environments where the model has to reason about object permanence, depth, and physics.

The pitch is AI for the layer below chatbots. Where general-purpose multimodal models like Gemini and GPT-5.5 do okay on visual tasks, Mk1 is purpose-built for spatial relationships — depth estimation from a single image, converting visual scenes into structured plans, video understanding with action-prediction baked in. The natural customer is robotics, AR/VR, and any system that needs to map the physical world rather than describe it.

The catch is that Perceptron is early. The dataset, the benchmarks, the product surface, the customer base — all are at early-stage scale. Whether spatial intelligence ends up being its own category or just a capability that frontier multimodal models absorb is the existential question. For now the technical lead in spatial-specific tasks is real, and the use cases that require it are growing.

Recent Coverage

Models

Perceptron Mk1

Flagship

Perceptron's highest-quality vision-reasoning model. Strong at spatial relationships, depth estimation from single images, and converting visual scenes into structured plans.

Context 35K
Released 2026-04
Input $0.15 / 1M
Output $1.5 / 1M
TextImageVideo

Best for

  • spatial reasoning
  • robotics planning
  • 3D scene understanding

When to Pick Perceptron · When to Pick Someone Else

✓ Pick Perceptron when

  • Spatial reasoning and 3D scene understanding
  • Robotics planning and embodied AI applications
  • Depth estimation from single images without dedicated sensors
  • Converting visual scenes into structured plans for robotic execution
  • Video understanding with spatial focus rather than narrative focus

Explore the other 11 labs

Each lab in the atlas comes with its own positioning, model line, and use cases. The point of organising the AI landscape by lab is that the answer to "which model should I use" almost always starts with "which lab is closest to what I'm trying to do."