Comment by kovek

Comment by kovek 3 days ago

View on Hacker News

What if you can check if the user responds positively/negatively to the output, and then you train the LLM on the input it got and the output it produced?