Tower of Bleyddyn ap Rhys

Mastering the game of Go without human knowledge

Published: Thu 26 October 2017
By Bleyddyn

Paper

tags: ML Papers RL

AlphaGo Zero, all RL self-play.
read more
Getting a Keras LSTM layer to work on MaLPi

Published: Thu 19 October 2017
By Bleyddyn

tags: keras lstm malpi ML

Training on batch sizes and/or sequence lengths longer than one, while still being able to run one image at a time on the robot.
read more
Pre-training Neural Networks with Human Demonstrations for Deep Reinforcement Learning

Published: Thu 14 September 2017
By Bleyddyn

Paper

tags: ML Papers RL

Pre-train using supervised learning on human provided demonstations.
read more
Experimenting with OpenAIs Baselines code

Published: Tue 05 September 2017
By Bleyddyn

tags: openai baselines model_search ML

I forked Open AI's baseline code and made a few changes. This was my first full run before I started playing around with the model architecture.

Changes from OpenAI's: * Turned on Logging, including Tensorboard output * Log rewards * Add a command line option for setting number of cpus

Code: Commit used …
read more
An Overview of Multi-Task Learning in Deep Neural Networks

Published: Thu 17 August 2017
By Bleyddyn

Paper

tags: ML Papers MultiTask

A long review of the use of DL in robotics
read more
Deep Learning in Robotics: A Review of Recent Research

Published: Tue 15 August 2017
By Bleyddyn

Paper

tags: ML Papers

A long review of the use of DL in robotics
read more
Eligibility Traces

Published: Mon 17 July 2017
By Bleyddyn

Paper

tags: ML Papers RL

Notes on using Eligibility Traces with neural networks
read more
One Model To Learn Them All

Published: Fri 16 June 2017
By Bleyddyn

Paper

tags: ML Papers

A single ML model used for very different tasks.
read more
A simple neural network module for relational reasoning

Published: Mon 05 June 2017
By Bleyddyn

Paper

tags: ML Papers

Relationships between objects.
read more
Questions and Intuition for Tackling Deep Learning Problems

Published: Tue 09 May 2017
By Bleyddyn

Paper

tags: ML Papers

Five questions to ask about your deep learning project.
read more
Reinforcement Learning with Unsupervised Auxiliary Tasks

Published: Wed 16 November 2016
By Bleyddyn

Paper

tags: ML Papers RL

Increase speed of a Reinforcement Learning system with auxiliary task.
read more
Attention and Augmented Recurrent Neural Networks

Published: Thu 08 September 2016
By Bleyddyn

Paper

tags: ML Papers RNN

Overview (with references) of attention and several types of augmentation for RNNs.
read more
Reinforcement Learning: An Introduction (2nd Edition)

Published: Thu 01 September 2016
By Bleyddyn

Paper

tags: ML RL

In-progress second edition of an RL textbook.
read more
Deep Reinforcement Learning with Double Q-learning

Published: Tue 08 December 2015
By Bleyddyn

Paper

tags: ML Papers RL

Improved Q-value estimation by reducing overestimates of Deep Q-networks.
read more
Human-level control through deep reinforcement learning

Published: Thu 26 February 2015
By Bleyddyn

Paper

tags: ML Papers RL

One of the first deep reinforcement learning papers.
read more
Motors

Published: Sat 24 August 2013
By Bleyddyn

tags: MaLPi Robot hardware

Motors and controllers and a breadboard
read more
Off-Policy Actor-Critic

Published: Thu 20 June 2013
By Bleyddyn

Paper

tags: ML Papers RL

Off-Policy AC with linear state features. Includes elegibility traces.
read more
Lego Chassis

Published: Wed 27 March 2013
By Bleyddyn

tags: MaLPi Robot hardware

Some progress on the hardware front.

The chassis really needs to be wider but I have a limited selection of Legos. I still don't have any motors or any way to control them but it's far enough along that I can try manually taking some images with the webcam and …
read more
Endurance Test

Published: Mon 18 March 2013
By Bleyddyn

tags: MaLPi Robot hardware

Test how long the PowerGen battery can run MaLPi on a single charge.

I ran an endurance test with MaLPi, running the Pi, the webcam and a shell script that logged uptime every ten seconds and the motion program (/usr/bin/motion) in an attempt to detect changes in the …
read more
MaLPi Intro
Published: Sun 17 March 2013
By Bleyddyn

tags: MaLPi Robot ML

MaLPi (Machine Learning Pi)

First, the hardware. This is my current setup and although I've tested each piece separately, I haven't had them all working together, yet.
- Raspberry Pi, model B v7 (or 0xf, I'm not sure how to read /proc/cpuinfo)
- 3D printed case (sorry, I can't remember where …
read more

Tags

Bleyddyn

social

links