My reproduction of the paper "Deliberation in Latent Space via Differentiable Cache Augmentation"