Ongoing research training transformer models at scale