Media Summary: Agent trained to fetch the yellow bananas and avoid the blue ones, using the DQN algorithm discussed in this paper. Udacity Deep Reinforcement Learning Nanodegree - Navigation Picking gold avoiding blue. Achieved score: 23.
Udacity Deep Reinforcement Learning Project 1 - Detailed Analysis & Overview
Agent trained to fetch the yellow bananas and avoid the blue ones, using the DQN algorithm discussed in this paper. Udacity Deep Reinforcement Learning Nanodegree - Navigation Picking gold avoiding blue. Achieved score: 23. DRL - Multi-Agent DDPG Algorithm - Tennis Collaboration Using the Unity agent/environment "Tennis", this In this environment, a double-jointed arm can move to target locations. A reward of +0.1 is provided for each step that the agent's ...