Comment by Onavo Comment by Onavo a day ago 1 reply Copy Link View on Hacker News Q learning is great as a hello world RL project for teaching undergraduates.
Copy Link msgodel a day ago Collapse Comment - I feel like the null TD algorithm is much better if you want a "hello world." Reply View | 0 replies
I feel like the null TD algorithm is much better if you want a "hello world."