-
Notifications
You must be signed in to change notification settings - Fork 13.5k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
common: Generalized XML-style tool-call parsing with streaming support (GLM 4.5/4.6 + MiniMax M2 + SeedOSS)
testing
Everything test related
#16932
opened Nov 2, 2025 by
hksdpc255
Loading…
hparams : add n_embd_inp() to support extended embed
examples
#16928
opened Nov 1, 2025 by
CISC
Loading…
vulkan: Fix GGML_VULKAN_CHECK_RESULTS to better handle fusion
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#16919
opened Nov 1, 2025 by
jeffbolznv
Loading…
add TheRock HIP backend build instructions
documentation
Improvements or additions to documentation
#16915
opened Nov 1, 2025 by
lihaofd
Loading…
Vulkan: improve mul_mat_vec_iq1_m
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#16907
opened Nov 1, 2025 by
lovedheart
Loading…
Vulkan: MMVQ Integer Dot K-Quant and MUL_MAT_ID support
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#16900
opened Oct 31, 2025 by
0cc4m
Loading…
rpc: join small packets in changes relating to the ggml tensor library for machine learning
send_msg and recv_msg
ggml
ggml-cpu : optimize RVV q2_k and q3_k kernels
ggml
changes relating to the ggml tensor library for machine learning
#16887
opened Oct 31, 2025 by
xctan
Loading…
cann: update cross_entropy_loss op support
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
#16886
opened Oct 31, 2025 by
TecJesh
Loading…
CUDA: fuse rope + set_rows
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#16884
opened Oct 31, 2025 by
am17an
Loading…
Disable NUMA-specific chunking for high-core-count HPC systems
ggml
changes relating to the ggml tensor library for machine learning
#16882
opened Oct 31, 2025 by
rageshh-fj
Loading…
server: add support for local image path loading for server
examples
server
#16874
opened Oct 30, 2025 by
cchadowitz
Loading…
CANN: GGML_CANN_ACL_GRAPH works only if USE_ACL_GRAPH was enabled
Ascend NPU
issues specific to Ascend NPUs
documentation
Improvements or additions to documentation
#16861
opened Oct 30, 2025 by
rauletorresc
Loading…
cann: update L2_NORM op support
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
#16856
opened Oct 30, 2025 by
TecJesh
Loading…
Enable CUDA graphs for embed gemma 300m
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#16844
opened Oct 29, 2025 by
ArshM17-NV
Loading…
improve CUDA cpy memory bandwidth when copying transposed tensor
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
#16841
opened Oct 29, 2025 by
bssrdf
Loading…
vulkan : refactor buffer handling in vk_op_f32
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#16840
opened Oct 29, 2025 by
Acly
Loading…
hip: add RDNA4 support for mmf and mma
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#16835
opened Oct 29, 2025 by
zhang-hui-yulo
•
Draft
docs: explain CUDA 11 compilation [no ci]
documentation
Improvements or additions to documentation
#16824
opened Oct 28, 2025 by
JohannesGaessler
Loading…
Implement SparseK Attention mechanism — new GGML operator with CPU backend (GPU planned next)
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
#16817
opened Oct 28, 2025 by
yael-works
Loading…
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.