Comment by Ericson2314
Comment by Ericson2314 2 days ago
https://news.ycombinator.com/item?id=44280505 I think that thead might help?
Total layman here, but maybe some tasks are "uniform" despite being "deep" in such a way that poor samples still suffice? I would call those "ergodic" tasks. But surely there are other tasks where this is not the case?
Good clarification. I have edited my post accordingly.
There are situations where states increase at much slower rates than exponential.
Those situations are a good fit for Q learning.