Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

memory: Hybrid context shift examples
#17009 opened Nov 4, 2025 by gabe-l-hart Loading…
sync : ggml ggml changes relating to the ggml tensor library for machine learning script Script related
#17008 opened Nov 4, 2025 by ggerganov Loading…
sampling : add support for GPU sampling (wip) testing Everything test related
#17004 opened Nov 4, 2025 by danbev Draft
4 tasks
Q4/Q8 Tiled Gemm Optimization. ggml changes relating to the ggml tensor library for machine learning
#16999 opened Nov 4, 2025 by shalinib-ibm Loading…
kleidiai: add optimized per-channel kernels for Q8_0 ggml changes relating to the ggml tensor library for machine learning
#16993 opened Nov 4, 2025 by chaxu01 Loading…
CUDA: add stream-based concurrency ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#16991 opened Nov 4, 2025 by am17an Draft
2 tasks
CUDA: fix crash on uneven context ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#16988 opened Nov 4, 2025 by JohannesGaessler Loading…
ggml-hexagon: graceful fallback for older socs where rpcmem_alloc2 and FASTRPC_GET_URI is unsupported ggml changes relating to the ggml tensor library for machine learning
#16987 opened Nov 4, 2025 by l3utterfly Draft
Add circular tiling support to conv2d and pad, for Vulkan, CUDA, and CPU (used for making seamless textures) ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related Vulkan Issues specific to the Vulkan backend
#16985 opened Nov 4, 2025 by Phylliida Loading…
Mamba2 SSD Apple Metal https://en.wikipedia.org/wiki/Metal_(API) examples ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#16982 opened Nov 3, 2025 by gabe-l-hart Draft
vulkan: Use spec constants for conv2d s/d/p and kernel W/H ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#16978 opened Nov 3, 2025 by jeffbolznv Loading…
vulkan: fuse rms_norm + mul + rope (+ view + set_rows) ggml changes relating to the ggml tensor library for machine learning testing Everything test related Vulkan Issues specific to the Vulkan backend
#16977 opened Nov 3, 2025 by jeffbolznv Loading…
sycl: flash-attention implementation ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#16969 opened Nov 3, 2025 by ye-NX Loading…
s390x: disable vxe for cross-compilation by default ggml changes relating to the ggml tensor library for machine learning
#16966 opened Nov 3, 2025 by AlekseiNikiforovIBM Loading…
CUDA: add implicit conv3d ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#16948 opened Nov 2, 2025 by bssrdf Loading…
Model: Minimax M2 - chat support testing Everything test related
#16946 opened Nov 2, 2025 by pwilkin Loading…
Model: add openPangu-Embedded model Model specific python python script changes
#16941 opened Nov 2, 2025 by Lpzhan931 Loading…
Add e2e tests for embedding raw flag devops improvements to build systems and github actions examples python python script changes testing Everything test related
#16940 opened Nov 2, 2025 by SamMalayek Draft
doc: Windows + clang/ninja build guide format cleanup documentation Improvements or additions to documentation
#16939 opened Nov 2, 2025 by jsjtxietian Loading…
ProTip! no:milestone will show everything without a milestone.