Comment by porphyra

Comment by porphyra 16 hours ago

1 reply

> if they have anything figured out besides "collect spatial data" like imagenet

I mean she launched her whole career with imagenet so you can hardly blame her for thinking that way. But on the other hand, there's something bitter lesson-pilled about letting a model "figure out" spatial relationships just by looking at tons of data. And tbh the recent progress [1] of worldlabs.ai (Dr Fei Fei Li's startup) looks quite promising for a model that understands stuff including reflections and stuff.

[1] https://www.worldlabs.ai/blog/rtfm

godelski 16 hours ago

  > looks quite promising for a model that understands stuff including reflections and stuff.
I got the opposite impression when trying their demo...[0]. Even in their examples some of these issues exist like how objects stay a constant size despite moving. Like missing the parallax or depth information. Not to mention that they show it walking on water lol

As for reflections, I don't get that impression either. They seem extremely brittle to movement.

[0] http://0x0.st/K95T.png