-
Notifications
You must be signed in to change notification settings - Fork 3.1k
Pull requests: karpathy/llm.c
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Replaced parameter B with the total parameters calculated outside of the kernel
#521
opened Jun 2, 2024 by
ChrisDryden
Loading…
Removing B in encoder and replacing with calculated N in kernel
#520
opened Jun 2, 2024 by
ChrisDryden
Loading…
Added additional layernorm forward kernel that does not recalculate mean and rstd
#506
opened May 31, 2024 by
ChrisDryden
Loading…
Added constexpr for blocksizes to optimize compilation
#505
opened May 31, 2024 by
ChrisDryden
Loading…
Added kernel development file for permute_backwards
#477
opened May 27, 2024 by
ChrisDryden
Loading…
Added new cuda kernel for encoder forwards using three dimensional kernels
#459
opened May 25, 2024 by
ChrisDryden
Loading…
Modified version of ademeure's fused gelu_forward kernel
#363
opened May 5, 2024 by
ChrisDryden
Loading…
Experimenting with global instantiation for the layouts
#347
opened May 3, 2024 by
ChrisDryden
•
Draft
gelu_backwards cuda dev file and float4 dtype for parrallel memory read
#241
opened Apr 24, 2024 by
ChrisDryden
•
Draft
ProTip!
Filter pull requests by the default branch with base:master.