Delayed Bandits

This repository holds work that was done as part of a class project, exploring bandits and a formulation where the rewards are stochastically delayed. A writeup of the work can be found here.