Posts
-
Reversibility-Aware Reinforcement Learning via Self-supervision
Improving model-free RL performance on Sokoban using reversibility estimation.
-
Semi-supervised image classification via Temporal Ensembling
Getting over 98% accuracy on weakly-supervised MNIST.