I DMed my agent living on my cluster from my phone and it fixed my GPT2 reimplementation and pre-trained it on a local 4090 to surpass the real GPT2-small.
Super cool result it got while I was building other things. pic.x.com/yKeu0YKsom