Comment by PaulHoule
The real problem with LLMs is that you can't get a probability estimate out of "Is {sentence_a} a plausible answer to {sentence_b}?"
See https://www.sbert.net/examples/applications/cross-encoder/RE...
The real problem with LLMs is that you can't get a probability estimate out of "Is {sentence_a} a plausible answer to {sentence_b}?"
See https://www.sbert.net/examples/applications/cross-encoder/RE...
Absolutely you can. Rip off the last layer, add a regression layer in it's place, fine tune.
With an open model, you could probably reverse engineer the token probabilities and get that probability estimate.
Something like: "Is {sentence_a} a plausible answer to {sentence_b}? Respond only with a single yes/no token" and then look at the probabilities of those.