karpathy / llm.c Public

Notifications You must be signed in to change notification settings
Fork 3.1k
Star 26.8k

Code
Issues 83
Pull requests 121
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: karpathy/llm.c

Labels 11 Milestones 0

New pull request New

Clear current search query, filters, and sorts

121 Open 488 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Add SwiGLU support

#718 opened Jul 29, 2024 by gordicaleksa

Loading…

Add RoPE positional encoding - llama3 feature branch

#756 opened Sep 13, 2024 by gordicaleksa

Loading…

Add SwiGLU support - llama3 feature branch

#755 opened Sep 13, 2024 by gordicaleksa

Loading…

add llama 3 support to llm.c

#754 opened Sep 13, 2024 by karpathy • Draft

Adamw thread coarsening kernel

#753 opened Sep 3, 2024 by saladpalad

Loading…

Fix sizing typo in train_gpt2_fp32.cu

#748 opened Aug 25, 2024 by gajanan-choudhary

Loading…

log with LINE and FILE for better addressing.

#746 opened Aug 22, 2024 by NEWPLAN

Loading…

Re: Fixed modal script for updated cudnn version, and read errors

#743 opened Aug 14, 2024 by vyom1611

Loading…

check libnccl instead of nccl to be more reliable

#742 opened Aug 14, 2024 by dengl11

Loading…

[WIP] initial curand implementation for model init

#741 opened Aug 13, 2024 by ngc92

Loading…

multi-threaded model initialization

#737 opened Aug 12, 2024 by ngc92

Loading…

Add external KV to LLaMA 3

#734 opened Aug 10, 2024 by gordicaleksa

Loading…

Faster GELU forward & backward using MUFU.TANH for SM7.5+

#721 opened Jul 31, 2024 by ademeure

Loading…

Add option to remove biases

#675 opened Jul 10, 2024 by gordicaleksa

Loading…

Add RoPE positional encoding

#714 opened Jul 28, 2024 by gordicaleksa

Loading…

Outlier detection: catch more outliers by not updating moving average with skipped updates

#711 opened Jul 25, 2024 by ademeure • Draft

Add high perf mode

#708 opened Jul 23, 2024 by gordicaleksa

Loading…

Add KV cache for inference

#707 opened Jul 22, 2024 by gordicaleksa

Loading…

add batch limit to 124m script to prevent infinite loop

#704 opened Jul 20, 2024 by varun-a10ai

Loading…

Simplified/faster "backward bias" kernel (column reduction)

#699 opened Jul 19, 2024 by ademeure

Loading…

Major FP32 llm.c improvements/refactoring/etc.

#696 opened Jul 18, 2024 by ademeure

Loading…

Update README.md with prerequisite of libomp

#691 opened Jul 17, 2024 by nzhang

Loading…

demo how to track activations without too much boilerplate code

#679 opened Jul 12, 2024 by ngc92 • Draft

FP8 work in progress

#678 opened Jul 12, 2024 by ademeure • Draft

Recompute mlp

#676 opened Jul 11, 2024 by ngc92

Loading…

Previous 1 2 3 4 5 Next

Previous Next

ProTip! Updated in the last three days: updated:>2025-06-01.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!