Comment by tifa2up
I'm building https://github.com/agentset-ai/agentset, RAG as a service that works quite well out of the box.
We achieve this performance by baking in the best practices before any tweaking
I'm building https://github.com/agentset-ai/agentset, RAG as a service that works quite well out of the box.
We achieve this performance by baking in the best practices before any tweaking
So retrieve once on the first message, and then use that context for the rest of the conversation?
How does it handle retrieval in a multi-turn conversation? Is there an intent graph involved?
Does it summarize past context or keep it all?