PRISM Dataset Demo

AI That Understands the Physical World

Experience advanced spatial reasoning and multi-view understanding with PRISM's fine-tuned vision models. See how AI interprets real-world retail environments through multiple camera perspectives.

100K+
Video Samples
20+
Task Categories
360°
Multi-View
All Object Counting & Tracking Activity Recognition Hand Interaction Domain Knowledge Spatial Reasoning Goal-Conditional Reasoning Scene Understanding Scene Localization Gaze Localization

Is the person still evaluating the product or ready to place it in the basket? What should be his action next?

Cosmos-Reason2-2B

The person is still evaluating the product. Next, he should place the guava in the basket.

PRISM

The person is still evaluating the guava's quality. The next action would be to place the guava in the basket if it meets their standards, or return it to the display if it does not.

What is he doing in the scene?

Cosmos-Reason2-2B

He is reaching into a fruit display case.

PRISM

The person is inspecting a blackberry. They hold it in their hand and examine it closely.

Describe the scene.

Cosmos-Reason2-2B

The shelves are stocked with a variety of products, including personal care items, food, and household goods.

PRISM

The scene shows the person walking down a wide aisle between shelves. The shelves are filled with personal care products, including shampoos, conditioners, and other toiletries.

Where is the person located?

Cosmos-Reason2-2B

The person is located in the aisle of a store.

PRISM

The person is in the Aisle (Personal Care/Flour).

What activity is this person performing and why?

Cosmos-Reason2-2B

The person is carefully reading the label on the shampoo bottle, likely to ensure they are selecting the correct product based on their needs.

PRISM

The person is holding the Vatika shampoo bottle and examining the back label, looking for ingredients, usage instructions, or expiration date.

How many products did the person evaluate? Why?

Cosmos-Reason2-2B

The person evaluated two products, the blue box and the green box, because they were the only two products being held and examined at that moment.

PRISM

The person is evaluating two products, a green box and a blue box, to compare them for a decision.

Which hand did the person use to pick up the green product?

Cosmos-Reason2-2B

The person used their right hand to pick up the green product.

PRISM

The person's left hand was holding the item, and their right hand was reaching and grasping it.

How many products did the person check? Respond with a number only.

Cosmos-Reason2-2B

1

PRISM

2

Would you expect to find beef in this aisle? Answer short.

Cosmos-Reason2-2B

Yes

PRISM

No, the visible items are tea boxes, indicating this is a tea aisle, not a grocery aisle for beef.

What material is the grabbed item made of?

Cosmos-Reason2-2B

The grabbed item is made of plastic.

PRISM

The grabbed item is a blue packet of Bru coffee. The packet is made of plastic.

How many products did the person put into the basket?

Cosmos-Reason2-2B

Two products.

PRISM

The person put one product, a small red container, into the basket.

Count the products in the basket at the beginning and at the end.

Cosmos-Reason2-2B

There are 3 products in the basket at the beginning and 4 products at the end.

PRISM

The basket contains 2 items at the start, and 3 items at the end.

What actions are the hands performing?

Cosmos-Reason2-2B

The hands are holding a blue basket and placing a jar into it.

PRISM

The left hand is holding the basket, and the right hand is grasping an item.

Count the products in the basket by the end. What was the last item?

Cosmos-Reason2-2B

There is one product in the basket by the end, which is a blue bottle.

PRISM

The shopping basket contains 2 items. The last item added was the Hershey's Cocoa container.

Can we open the refrigerator from where we are standing?

Cosmos-Reason2-2B

Yes, the refrigerator is on the left side of the aisle, and it is possible to open it from where you are standing.

PRISM

No, the refrigerator is too far away to be opened from this position.

How can we get to the refrigerator from where we are standing?

Cosmos-Reason2-2B

We can walk straight ahead to the refrigerator, which is located on the left side of the aisle.

PRISM

Walk forward along the aisle, moving past the bulk item display, to reach the refrigerator on the left side.

What is the person in the black-and-white shirt doing?

Cosmos-Reason2-2B

The person in the black-and-white shirt is pushing a shopping cart and walking towards the right side of the frame.

PRISM

The person in the black-and-white shirt is pushing a shopping cart down the aisle. They appear to be looking at the items on the shelves as they move.

About PRISM

A groundbreaking dataset that enables AI to understand the physical world through multi-camera perspectives and real-world retail scenarios.

Multi-View Dataset

Captured from multiple synchronized cameras providing 360° spatial understanding of retail environments.

Real-World Scenarios

Authentic retail store footage featuring natural customer behaviors and realistic store layouts.

Spatial + Physical Reasoning

Comprehensive capabilities for understanding object relationships, motion dynamics, and 3D spatial context.

GROOT Fine-Tuned

Models trained on PRISM demonstrate superior performance on spatial reasoning and multi-view tasks.

Research

Powered by GROOT

PRISM improves AI reasoning across spatial and physical domains by providing rich multi-view video data from real-world retail environments.

100,000+ Samples

Diverse retail scenarios with multi-view coverage

20+ Task Categories

Spatial reasoning, object tracking, action understanding, and more

Multi-View Architecture

Synchronized cameras for complete spatial understanding

Read the Paper
+23%
Accuracy Improvement
92%
Spatial Tasks
99.9%
Multi-View Sync
60 FPS
Inference Speed