Specialty Lab · USA

Perceptron

Spatial intelligence for the physical world.

All Labs Jump to models

Founded 2025

HQ Bay Area

Country USA

Models tracked 1

Status Specialty

The Lab

A newer entrant founded in 2025, Perceptron focuses on spatial intelligence — AI that understands the physical world rather than just text. Perceptron Mk1 (Mark One) is the flagship vision-reasoning model, trained on multimodal data with heavy weighting toward 3D scene understanding, robotics simulation trajectories, and embodied environments where the model has to reason about object permanence, depth, and physics.

The pitch is AI for the layer below chatbots. Where general-purpose multimodal models like Gemini and GPT-5.5 do okay on visual tasks, Mk1 is purpose-built for spatial relationships — depth estimation from a single image, converting visual scenes into structured plans, video understanding with action-prediction baked in. The natural customer is robotics, AR/VR, and any system that needs to map the physical world rather than describe it.

The catch is that Perceptron is early. The dataset, the benchmarks, the product surface, the customer base — all are at early-stage scale. Whether spatial intelligence ends up being its own category or just a capability that frontier multimodal models absorb is the existential question. For now the technical lead in spatial-specific tasks is real, and the use cases that require it are growing.

Recent Coverage

Agents

Broken Agents? It's Not the Model. It's the Harness.

Models

Perceptron Mk1

Flagship

Perceptron's highest-quality vision-reasoning model. Strong at spatial relationships, depth estimation from single images, and converting visual scenes into structured plans.

Context 35K

Released 2026-04

Input $0.15 / 1M

Output $1.5 / 1M

Best for

spatial reasoning
robotics planning
3D scene understanding

When to Pick Perceptron · When to Pick Someone Else

✓ Pick Perceptron when

Spatial reasoning and 3D scene understanding
Robotics planning and embodied AI applications
Depth estimation from single images without dedicated sensors
Converting visual scenes into structured plans for robotic execution
Video understanding with spatial focus rather than narrative focus

✕ Look elsewhere for

Conversational AI — OpenAI or Anthropic
Long-context text analysis — Anthropic Claude
Code generation — DeepSeek Coder or Claude
Image generation — OpenAI GPT-Image-2
General multimodal chat — Gemini Pro

Explore the other 11 labs

Each lab in the atlas comes with its own positioning, model line, and use cases. The point of organising the AI landscape by lab is that the answer to "which model should I use" almost always starts with "which lab is closest to what I'm trying to do."

Back to all labs Next: Nous Research →

Perceptron

The Sensor Gap Killing Physical AI

The House in the Photo Doesn't Exist

The Agentic Development Loop

Broken Agents? It's Not the Model. It's the Harness.

Perceptron Mk1

✓ Pick Perceptron when

✕ Look elsewhere for

Explore the other 11 labs