https://www.arxiv-summary.com/posts/2303.04245/
How Do Transformers Learn Topic Structure: Towards a Mechanistic Understanding