Comment by kevingadd

Comment by kevingadd 2 days ago

1 reply

I don't understand the purpose of effectively hard-coding things like edge detection and attention weight maps into the image. There are various ways to do edge detection and various ways to focus attention, so having that fixed and encoded into the image instead of synthesizing it on demand to suit your particular ends seems suboptimal.

Wouldn't the kind of metadata that's most useful be things that can't be synthesized, like labels or (for ai-generated images) the prompt used to generate the image?

kuberwastaken a day ago

Totally fair points, but the idea isn’t to stop at edge maps or simple overlays. This was meant as an early step toward expanding what an image can carry with it for AI workflows.

It’s definitely not finished, more like a poc right now for storing richer, AI-relevant metadata in a portable way. Appreciate you for taking the time to check it out.