Off-Policy AC with linear state features. Includes elegibility traces.
read moreHorde: A Scalable Real-time Architecture for Learning Knowledge from Unsupervised Sensorimotor Interaction
Using RL value functions to encode semantic knowledge, specifically by a robot.
read more
