Comment by a2128

Comment by a2128 a day ago

You're adding metadata, but what problems does this added metadata solve exactly? If your converter can automatically compute these image features, then AI training and inference pipelines can trivially do the same, so I don't see the point in needing a new file format that contains these.

Moreover, models and techniques get better over time, so these stored precomputed features are guaranteed to become obsolete. Even if they're there and it's simple to use in a pipeline and everybody is using this file format, pipelines still won't use it when they were precomputed years ago and state-of-the-art techniques give more accurate features.

jtsylve a day ago

The answer may be in your question.

- This is currently solved by inference pipelines. - Models and techniques improve over time.

The ability for different agents with different specialties to add additional content while being able to take advantage of existing context is what makes the pipeline work.

Storing that content in the format could allow us to continue to refine the information we get from the image over time. Each tool that touches the image can add new context or improve existing context and the image becomes more and more useful over time.

I like the idea.

Reply View 2 replies

kuberwastaken a day ago

Said it better than I could have
also, the idea is to integrate the conversion processes/ pipelines with other data that'll help with customized workflows.

Reply View | 0 replies
ai_critic a day ago

> Each tool that touches the image can add new context or improve existing context and the image becomes more and more useful over time.
This is literally the problem solved by chunk-based file formats. "How do we use multiple authoring tools without stepping on each other" is a very old and solved problem.

Reply View | 0 replies