Comment by andy_xor_andrew
Comment by andy_xor_andrew 2 days ago
The article mentions AlphaGo/Mu/Zero was not based on Q-Learning - I'm no expert but I thought AlphaGo was based on DeepMind's "Deep Q-Learning"? Is that not right?
Comment by andy_xor_andrew 2 days ago
The article mentions AlphaGo/Mu/Zero was not based on Q-Learning - I'm no expert but I thought AlphaGo was based on DeepMind's "Deep Q-Learning"? Is that not right?
DeepMind's earlier success with Atari was based on offline Q-Learning