What is Transformer?
A deep learning architecture proposed by Google in 2017 that uses ‘attention mechanism’ to understand relationships between words. Nearly all modern LLMs (GPT, Claude, Gemini) are based on Transformers. It solved the old architectures’ inability to handle long texts.