Posts

Nov 3, 2021
Reversibility-Aware Reinforcement Learning via Self-supervision

Improving model-free RL performance on Sokoban using reversibility estimation.
Jan 22, 2018
Semi-supervised image classification via Temporal Ensembling

Getting over 98% accuracy on weakly-supervised MNIST.