ggml-org / llama.cpp Public

Notifications You must be signed in to change notification settings
Fork 13.5k
Star 89k

Code
Issues 277
Pull requests 571
Discussions
Actions
Projects 10
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Wiki
Security
Insights

Pull requests: ggml-org/llama.cpp

Labels 78 Milestones 0

New pull request New

571 Open 7,545 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Draft: #1776 making bos and eos available for user input

#1986 opened Jun 24, 2023 by HashemAlsaket • Draft

Disable _O_WTEXT when using main in MinGW

#1897 opened Jun 16, 2023 by asctime

Loading…

Added Arbitrary mixed quantization Less than 4 bits

Efforts related to viable quantized models using <4 bits

research 🔬

#1834 opened Jun 13, 2023 by Milkdrop

Loading…

Llama cpp low level python bindings

#1660 opened Jun 1, 2023 by dmahurin

Loading…

ggml : spread compute across threads in chunks demo

Demonstrate some concept or idea, not intended to be merged

threading

Parallel processing and thread management

#1507 opened May 17, 2023 by ggerganov • Draft

ci: add linux binaries to release build

#1505 opened May 17, 2023 by Green-Sky

Loading…

Upgrade v1/v2 format to v3 by leveraging quantize

#1504 opened May 17, 2023 by howard0su

Loading…

[Research] Steering vectors research 🔬

#1472 opened May 16, 2023 by SlyEcho • Draft

Implement get_num_physical_cores() for Windows

#1278 opened May 2, 2023 by DannyDaemonic

Loading…

Create run.py enhancement

New feature or request

obsolete?

Marker for potentially obsolete PR

python

python script changes

Review Complexity : Low

Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix

script

Script related

#1204 opened Apr 27, 2023 by jdpsl

Loading…

fix(LoRA): debugging

#1190 opened Apr 26, 2023 by jon-chuang

Loading…

main: add pledge call on OpenBSD

#1132 opened Apr 22, 2023 by codesoap

Loading…

llama : quantize attention results demo

Demonstrate some concept or idea, not intended to be merged

#1103 opened Apr 21, 2023 by ggerganov • Draft

Add a option to force the token end of text apears even on interative, and also shows loading porcentage

#1058 opened Apr 19, 2023 by jeffersoncgo

Loading…

Add command mode to interactive mode. enhancement

New feature or request

Review Complexity : Medium

Generally require more time to grok but manageable by beginner to medium expertise level

#1022 opened Apr 17, 2023 by wbpxre150

Loading…

Add mmap pages stats (disabled by default)

#1015 opened Apr 16, 2023 by prusnak

Loading…

Use Threadpool to schedule the work threading

Parallel processing and thread management

#851 opened Apr 8, 2023 by howard0su • Draft

Run several single thread operators parellel threading

Parallel processing and thread management

#850 opened Apr 8, 2023 by howard0su

Loading…

Q4_0 scale selection using RMSE enhancement

New feature or request

Less than 4 bits

Efforts related to viable quantized models using <4 bits

research 🔬 Review Complexity : High

Generally require indepth knowledge of LLMs or GPUs

#835 opened Apr 7, 2023 by sw • Draft

Optimize locking behavior threading

Parallel processing and thread management

#813 opened Apr 6, 2023 by janekb04

Loading…

Add "-e"/"--eval-threads" to distinguish thread counts for single-token eval and prompt eval threading

Parallel processing and thread management

#744 opened Apr 3, 2023 by MagisterLuddite • Draft

Previous 1 2 … 19 20 21 22 23 Next

Previous Next

ProTip! What’s not been updated in a month: updated:<2025-10-05.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!