https://platoaistream.net/plato-data/scaling-large-language-model-llm-training-with-amazon-ec2-trn1-ultraclusters/
Scaling Large Language Model (LLM) training with Amazon EC2 Trn1 UltraClusters