Submitted by verbigratia t3_zsvsic in MachineLearning
Every lunar lander tutorial or example I've found so far uses deep RL. Is classical Q learning such an obviously bad idea that no-one bothers with it? I've had some success recently applying Q learning to lunar lander (converting the continuous observations into discrete values) and am surprised there aren't more tutorials about this approach. Am I missing something?
[deleted] t1_j1ahe6n wrote
[deleted]