Tag: transformer architecture
Tamara Weed, Jun, 17 2026
Explore how Query, Key, and Value projections work in LLM attention mechanisms. Understand what these matrices learn during training and how they enable context-aware processing in transformer models.
Categories:
Tags:
Tamara Weed, May, 25 2026
A technical walkthrough of Transformer architecture, explaining self-attention, multi-head mechanisms, and how LLMs process and generate text efficiently.
Categories:
Tags:
Tamara Weed, May, 21 2026
Explore the differences between sinusoidal and learned positional encoding in Transformers. Learn why modern LLMs favor RoPE and ALiBi for better long-context performance.
Categories:
Tags:
Tamara Weed, May, 20 2026
Explore the technical evolution of Generative AI, from early Markov chains and LSTMs to the transformer revolution. Understand the architectural shifts, key milestones, and future challenges shaping modern AI systems.
Categories:
Tags:
Tamara Weed, May, 15 2026
Explore how transformer architecture evolved since 2017. From RoPE embeddings to SwiGLU activation, discover the key innovations driving modern LLM efficiency and accuracy.
Categories:
Tags:
Tamara Weed, Apr, 23 2026
Learn how Large Language Models use token prediction and probability distributions to generate text, from the softmax function to decoding strategies like Top-P and Temperature.
Categories:
Tags:
Tamara Weed, Mar, 26 2026
Learn how positional encoding solves the word order problem in Transformers. We explore absolute, relative, and rotary methods, recent research findings, and future trends.
Categories:
Tags:






