Skip to content

Pull requests: karpathy/llm.c

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Added packed layernorm_forward
#513 opened Jun 2, 2024 by ChrisDryden Loading… updated Jun 3, 2024
Removing B in encoder and replacing with calculated N in kernel
#520 opened Jun 2, 2024 by ChrisDryden Loading… updated Jun 2, 2024
Added kernel development file for permute_backwards
#477 opened May 27, 2024 by ChrisDryden Loading… updated Jun 1, 2024
Added additional layernorm forward kernel that does not recalculate mean and rstd
#506 opened May 31, 2024 by ChrisDryden Loading… updated May 31, 2024
Added constexpr for blocksizes to optimize compilation
#505 opened May 31, 2024 by ChrisDryden Loading… updated May 31, 2024
Added new cuda kernel for encoder forwards using three dimensional kernels
#459 opened May 25, 2024 by ChrisDryden Loading… updated May 27, 2024
Modified version of ademeure's fused gelu_forward kernel
#363 opened May 5, 2024 by ChrisDryden Loading… updated May 6, 2024
Experimenting with global instantiation for the layouts
#347 opened May 3, 2024 by ChrisDryden Draft updated May 3, 2024
Updated adamw to use packed data types
#303 opened Apr 30, 2024 by ChrisDryden Loading… updated May 2, 2024
ProTip! no:milestone will show everything without a milestone.