I just published a blog: Causal Masking in Attention - Intro: What is causal masking? - Problem: Seeing future tokens - Solution: Causal masking - Implementation: Masked attention matrix - Result: Zero attention to future tokens Read here: https://lnkd.in/gZqKcCgk #machinelearning #llm #deeplearning #ai #transformers
#MachineLearning #AI
Nice