Tower of Bleyddyn ap Rhys

Migrating From Github to Codeberg

Published: Thu 03 April 2025
By Bleyddyn

tags: Codeberg Github Pelican

Brief description of migrating a static website from Github to Codeberg
read more
On the link between conscious function and general intelligence in humans and machines

Published: Fri 01 July 2022
By Bleyddyn

Paper

tags: ML Papers

Access consciousness and its relation to general intelligence
read more
'Fastai Transforms for DonkeyCar'

Published: Sun 13 March 2022
By Bleyddyn

tags: malpi fastai augmentation ML

Fastai transforms don't seem to help DonkeyCar training
read more
'Effect of Dropout Layers in a VAE'

Published: Tue 14 January 2020
By Bleyddyn

tags: malpi vae ML

Dropout layers make VAE output worse
read more
Endgame Speculation

Published: Tue 02 April 2019
By Bleyddyn

tags: marvel movies

My speculation on how Avengers Endgame could go
read more
Stable Baselines Algorithms

Published: Sun 03 February 2019
By Bleyddyn

tags: baselines RL ML

Table of algorithms in the Stable Baselines repo
read more
Unicorn: Continual learning with a universal, off-policy agent

Published: Thu 19 July 2018
By Bleyddyn

Paper

tags: ML Papers RL

Continual learning with a universal, off-policy agent.
read more
Sample-Efficient Deep RL with Generative Adversarial Tree Search

Published: Wed 27 June 2018
By Bleyddyn

Paper

tags: ML Papers RL

Learned dynamics model with a GAN for image generation and MCTS for planning.
read more
Learning Real-World Robot Policies by Dreaming

Published: Mon 25 June 2018
By Bleyddyn

Paper

tags: ML Papers

Unsupervised learning of image encoding, dynamics and reward models.
read more
Unsupervised Predictive Memory in a Goal-Directed Agent

Published: Thu 12 April 2018
By Bleyddyn

Paper

tags: ML Papers

Unsupervised training of a memory that is used for prediction of state and reward.
read more
World Models

Published: Mon 09 April 2018
By Bleyddyn

Paper

tags: ML Papers RL

Unsupervised learning of image encoding and dynamics model.
read more
'First Real DAgger Results'

Published: Mon 12 March 2018
By Bleyddyn

tags: malpi dagger ML

Using DAgger to improve MaLPi's training, while MaLPi is driving
read more
'Initial DAgger Results'

Published: Tue 27 February 2018
By Bleyddyn

tags: malpi dagger ML

Using DAgger to improve MaLPi's training
read more
First (Second) Fully Autonomous Full Lap

Published: Sat 10 February 2018
By Bleyddyn

tags: MaLPi Robot

One of MaLPi's first fully autonomous laps
read more
'First attempts at Hyperparameter Optimization'

Published: Sat 03 February 2018
By Bleyddyn

tags: numpy malpi hyperparameters hyperopt ML

Hyperparameter optimization using hyperopt on racetrack data.
read more
Pretraining Deep Actor-Critic Reinforcement Learning Algorithms With Expert Demonstrations

Published: Thu 01 February 2018
By Bleyddyn

Paper

tags: ML Papers RL

Pretraining Deep Actor-Critic Reinforcement Learning Algorithms With Expert Demonstrations.
read more
'Normalizing image data before training - GRU version'

Published: Sun 24 December 2017
By Bleyddyn

tags: numpy malpi gru ML

Testing image normalization when using a GRU.
read more
'Normalizing image data before training - LSTM version'

Published: Thu 21 December 2017
By Bleyddyn

tags: numpy malpi lstm ML

Testing image normalization when using an LSTM.
read more
Tensorizing LSTMs

Published: Mon 18 December 2017
By Bleyddyn

Paper

tags: ML Papers RNN

Tensorizing LSTMs to make them wider and deeper without adding parameters and with minimal extra compute costs.
read more
'Normalizing image data before training'

Published: Wed 13 December 2017
By Bleyddyn

tags: numpy malpi ML

I had completely forgotten to normalize the images I'm feeding into MaLPi's network, so I thought I'd try to be a bit more formal about it than my usual.
read more
Mastering the game of Go without human knowledge

Published: Thu 26 October 2017
By Bleyddyn

Paper

tags: ML Papers RL

AlphaGo Zero, all RL self-play.
read more
Getting a Keras LSTM layer to work on MaLPi

Published: Thu 19 October 2017
By Bleyddyn

tags: keras lstm malpi ML

Training on batch sizes and/or sequence lengths longer than one, while still being able to run one image at a time on the robot.
read more
Pre-training Neural Networks with Human Demonstrations for Deep Reinforcement Learning

Published: Thu 14 September 2017
By Bleyddyn

Paper

tags: ML Papers RL

Pre-train using supervised learning on human provided demonstations.
read more
Experimenting with OpenAIs Baselines code

Published: Tue 05 September 2017
By Bleyddyn

tags: openai baselines model_search ML

I forked Open AI's baseline code and made a few changes. This was my first full run before I started playing around with the model architecture.

Changes from OpenAI's: * Turned on Logging, including Tensorboard output * Log rewards * Add a command line option for setting number of cpus

Code: Commit used …
read more
An Overview of Multi-Task Learning in Deep Neural Networks

Published: Thu 17 August 2017
By Bleyddyn

Paper

tags: ML Papers MultiTask

A long review of the use of DL in robotics
read more
Deep Learning in Robotics: A Review of Recent Research

Published: Tue 15 August 2017
By Bleyddyn

Paper

tags: ML Papers

A long review of the use of DL in robotics
read more
Eligibility Traces

Published: Mon 17 July 2017
By Bleyddyn

Paper

tags: ML Papers RL

Notes on using Eligibility Traces with neural networks
read more
One Model To Learn Them All

Published: Fri 16 June 2017
By Bleyddyn

Paper

tags: ML Papers

A single ML model used for very different tasks.
read more
A simple neural network module for relational reasoning

Published: Mon 05 June 2017
By Bleyddyn

Paper

tags: ML Papers

Relationships between objects.
read more
Questions and Intuition for Tackling Deep Learning Problems

Published: Tue 09 May 2017
By Bleyddyn

Paper

tags: ML Papers

Five questions to ask about your deep learning project.
read more
Reinforcement Learning with Unsupervised Auxiliary Tasks

Published: Wed 16 November 2016
By Bleyddyn

Paper

tags: ML Papers RL

Increase speed of a Reinforcement Learning system with auxiliary task.
read more
Attention and Augmented Recurrent Neural Networks

Published: Thu 08 September 2016
By Bleyddyn

Paper

tags: ML Papers RNN

Overview (with references) of attention and several types of augmentation for RNNs.
read more
Reinforcement Learning: An Introduction (2nd Edition)

Published: Thu 01 September 2016
By Bleyddyn

Paper

tags: ML RL

In-progress second edition of an RL textbook.
read more
Deep Reinforcement Learning with Double Q-learning

Published: Tue 08 December 2015
By Bleyddyn

Paper

tags: ML Papers RL

Improved Q-value estimation by reducing overestimates of Deep Q-networks.
read more
Human-level control through deep reinforcement learning

Published: Thu 26 February 2015
By Bleyddyn

Paper

tags: ML Papers RL

One of the first deep reinforcement learning papers.
read more
Motors

Published: Sat 24 August 2013
By Bleyddyn

tags: MaLPi Robot hardware

Motors and controllers and a breadboard
read more
Off-Policy Actor-Critic

Published: Thu 20 June 2013
By Bleyddyn

Paper

tags: ML Papers RL

Off-Policy AC with linear state features. Includes elegibility traces.
read more
Lego Chassis

Published: Wed 27 March 2013
By Bleyddyn

tags: MaLPi Robot hardware

Some progress on the hardware front.

The chassis really needs to be wider but I have a limited selection of Legos. I still don't have any motors or any way to control them but it's far enough along that I can try manually taking some images with the webcam and …
read more
Endurance Test

Published: Mon 18 March 2013
By Bleyddyn

tags: MaLPi Robot hardware

Test how long the PowerGen battery can run MaLPi on a single charge.

I ran an endurance test with MaLPi, running the Pi, the webcam and a shell script that logged uptime every ten seconds and the motion program (/usr/bin/motion) in an attempt to detect changes in the …
read more
MaLPi Intro
Published: Sun 17 March 2013
By Bleyddyn

tags: MaLPi Robot ML

MaLPi (Machine Learning Pi)

First, the hardware. This is my current setup and although I've tested each piece separately, I haven't had them all working together, yet.
- Raspberry Pi, model B v7 (or 0xf, I'm not sure how to read /proc/cpuinfo)
- 3D printed case (sorry, I can't remember where …
read more
Horde: A Scalable Real-time Architecture for Learning Knowledge from Unsupervised Sensorimotor Interaction

Published: Mon 02 May 2011
By Bleyddyn

Paper

tags: ML Papers

Using RL value functions to encode semantic knowledge, specifically by a robot.
read more