Describir: Exploring optimal control of epidemic spread using reinforcement learning