Delayed Bandits

This repository holds work that was done as part of a class project, exploring bandits in a formulation where the rewards are stochastically delayed (and therefore aggregated). A writeup of the work can be found here.