We humans live in a 3D world and our training set is a continuous stereo stream ... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		ozgung on Feb 24, 2024 \| parent \| context \| favorite \| on: Generative Models: What do they know? Do they know... We humans live in a 3D world and our training set is a continuous stereo stream of a constant scene from different angles. Sora, on the other hand, learned the world by watching TV. It needs to play more video games, in order to learn 3D scenes (implicit representation of a world) and taking their pictures (rendering). Maybe that was the case I don’t know.

DinaCoder99 on Feb 24, 2024 [–]

> It needs to play more video games, in order to learn 3D scenes (implicit representation of a world) and taking their pictures (rendering).

This can be extrapolated from TV, too.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact