Comment by 76SlashDolphin
Comment by 76SlashDolphin 10 hours ago
That's a good question! We do use an LLM to categorise the MCP tools but that is at "add" or "configure" time, not at the time they are called. As such we don't actively run an LLM while the gateway is up, all the rules are already set and requests are blocked based on the hard-set rules. Plus, at this point we don't actually look at the data that is passed around, so even if we change the rules for the trifecta, there's no way for any LLM to be poisoned by a malicious actor feeding bad data.
couldnt the configuring LLM be poisoned by tool descriptions to grant the lethal trifecta to the run time LLM?