AlphaGo Zero, all RL self-play.
read moreGetting a Keras LSTM layer to work on MaLPi
Training on batch sizes and/or sequence lengths longer than one, while still being able to run one image at a time on the robot.
read morePre-training Neural Networks with Human Demonstrations for Deep Reinforcement Learning
Pre-train using supervised learning on human provided demonstations.
read moreExperimenting with OpenAIs Baselines code
I forked Open AI's baseline code and made a few changes. This was my first full run before I started playing around with the model architecture.
Changes from OpenAI's: * Turned on Logging, including Tensorboard output * Log rewards * Add a command line option for setting number of cpus
Code: Commit used …
read moreAn Overview of Multi-Task Learning in Deep Neural Networks
A long review of the use of DL in robotics
read moreDeep Learning in Robotics: A Review of Recent Research
A long review of the use of DL in robotics
read moreEligibility Traces
Notes on using Eligibility Traces with neural networks
read moreOne Model To Learn Them All
A single ML model used for very different tasks.
read moreA simple neural network module for relational reasoning
Relationships between objects.
read moreQuestions and Intuition for Tackling Deep Learning Problems
Five questions to ask about your deep learning project.
read moreReinforcement Learning with Unsupervised Auxiliary Tasks
Increase speed of a Reinforcement Learning system with auxiliary task.
read moreAttention and Augmented Recurrent Neural Networks
Overview (with references) of attention and several types of augmentation for RNNs.
read moreReinforcement Learning: An Introduction (2nd Edition)
In-progress second edition of an RL textbook.
read moreDeep Reinforcement Learning with Double Q-learning
Improved Q-value estimation by reducing overestimates of Deep Q-networks.
read moreHuman-level control through deep reinforcement learning
One of the first deep reinforcement learning papers.
read moreMotors
Motors and controllers and a breadboard
read moreOff-Policy Actor-Critic
Off-Policy AC with linear state features. Includes elegibility traces.
read moreLego Chassis
Some progress on the hardware front.
The chassis really needs to be wider but I have a limited selection of Legos. I still don't have any motors or any way to control them but it's far enough along that I can try manually taking some images with the webcam and …
read moreEndurance Test
Test how long the PowerGen battery can run MaLPi on a single charge.
I ran an endurance test with MaLPi, running the Pi, the webcam and a shell script that logged uptime every ten seconds and the motion program (/usr/bin/motion) in an attempt to detect changes in the …
read moreMaLPi Intro
MaLPi (Machine Learning Pi)
First, the hardware. This is my current setup and although I've tested each piece separately, I haven't had them all working together, yet.
- Raspberry Pi, model B v7 (or 0xf, I'm not sure how to read /proc/cpuinfo)
- 3D printed case (sorry, I can't remember where …
