AMIT SHEKHAR’s Post

I just published a blog: Causal Masking in Attention - Intro: What is causal masking? - Problem: Seeing future tokens - Solution: Causal masking - Implementation: Masked attention matrix - Result: Zero attention to future tokens Read here: https://lnkd.in/gZqKcCgk #machinelearning #llm #deeplearning #ai #transformers

See more comments

To view or add a comment, sign in

Explore content categories