Comment by leetrout
Quick feedback if you're still monitoring the thread:
I did /imagine cheeseburger and /imagine a fire extinguisher and both were correctly generated but the agent has no context. when I ask what they are holding in both cases they ramble about not holding anything and referencing lemons and lemon trees.
I expected it to retain the context as the chat continues. If I ask it what it imagined it just tells me I can use /imagine.
Good idea. We need to do that. I'm also excited to push the /imagine stuff further and have B-roll interspersed with the talking (like a documentary) or even follow the character around as they move (like a video game)