Skip to content

Pull requests: karpathy/llm.c

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add SwiGLU support
#718 opened Jul 29, 2024 by gordicaleksa Loading…
Add SwiGLU support - llama3 feature branch
#755 opened Sep 13, 2024 by gordicaleksa Loading…
add llama 3 support to llm.c
#754 opened Sep 13, 2024 by karpathy Draft
Adamw thread coarsening kernel
#753 opened Sep 3, 2024 by saladpalad Loading…
Fix sizing typo in train_gpt2_fp32.cu
#748 opened Aug 25, 2024 by gajanan-choudhary Loading…
log with LINE and FILE for better addressing.
#746 opened Aug 22, 2024 by NEWPLAN Loading…
check libnccl instead of nccl to be more reliable
#742 opened Aug 14, 2024 by dengl11 Loading…
[WIP] initial curand implementation for model init
#741 opened Aug 13, 2024 by ngc92 Loading…
multi-threaded model initialization
#737 opened Aug 12, 2024 by ngc92 Loading…
Add external KV to LLaMA 3
#734 opened Aug 10, 2024 by gordicaleksa Loading…
Add option to remove biases
#675 opened Jul 10, 2024 by gordicaleksa Loading…
Add RoPE positional encoding
#714 opened Jul 28, 2024 by gordicaleksa Loading…
Add high perf mode
#708 opened Jul 23, 2024 by gordicaleksa Loading…
Add KV cache for inference
#707 opened Jul 22, 2024 by gordicaleksa Loading…
Major FP32 llm.c improvements/refactoring/etc.
#696 opened Jul 18, 2024 by ademeure Loading…
Update README.md with prerequisite of libomp
#691 opened Jul 17, 2024 by nzhang Loading…
FP8 work in progress
#678 opened Jul 12, 2024 by ademeure Draft
Recompute mlp
#676 opened Jul 11, 2024 by ngc92 Loading…
ProTip! Updated in the last three days: updated:>2025-06-01.