Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

benches : add folder with benchmarks
#16931 opened Nov 2, 2025 by ggerganov Loading…
Add initial devcontainer configuration
#16926 opened Nov 1, 2025 by FXJEFE Loading…
vulkan: Fix GGML_VULKAN_CHECK_RESULTS to better handle fusion ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#16919 opened Nov 1, 2025 by jeffbolznv Loading…
add TheRock HIP backend build instructions documentation Improvements or additions to documentation
#16915 opened Nov 1, 2025 by lihaofd Loading…
Vulkan: improve mul_mat_vec_iq1_m ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#16907 opened Nov 1, 2025 by lovedheart Loading…
Vulkan: MMVQ Integer Dot K-Quant and MUL_MAT_ID support ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#16900 opened Oct 31, 2025 by 0cc4m Loading…
rpc: join small packets in send_msg and recv_msg ggml changes relating to the ggml tensor library for machine learning
#16892 opened Oct 31, 2025 by jukofyork Draft
ggml-cpu : optimize RVV q2_k and q3_k kernels ggml changes relating to the ggml tensor library for machine learning
#16887 opened Oct 31, 2025 by xctan Loading…
cann: update cross_entropy_loss op support Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning
#16886 opened Oct 31, 2025 by TecJesh Loading…
CUDA: fuse rope + set_rows ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#16884 opened Oct 31, 2025 by am17an Loading…
Disable NUMA-specific chunking for high-core-count HPC systems ggml changes relating to the ggml tensor library for machine learning
#16882 opened Oct 31, 2025 by rageshh-fj Loading…
CANN: GGML_CANN_ACL_GRAPH works only if USE_ACL_GRAPH was enabled Ascend NPU issues specific to Ascend NPUs documentation Improvements or additions to documentation
#16861 opened Oct 30, 2025 by rauletorresc Loading…
cann: update L2_NORM op support Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning
#16856 opened Oct 30, 2025 by TecJesh Loading…
Enable CUDA graphs for embed gemma 300m ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#16844 opened Oct 29, 2025 by ArshM17-NV Loading…
improve CUDA cpy memory bandwidth when copying transposed tensor ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#16841 opened Oct 29, 2025 by bssrdf Loading…
vulkan : refactor buffer handling in vk_op_f32 ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#16840 opened Oct 29, 2025 by Acly Loading…
hip: add RDNA4 support for mmf and mma ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#16835 opened Oct 29, 2025 by zhang-hui-yulo Draft
docs: explain CUDA 11 compilation [no ci] documentation Improvements or additions to documentation
#16824 opened Oct 28, 2025 by JohannesGaessler Loading…
Implement SparseK Attention mechanism — new GGML operator with CPU backend (GPU planned next) ggml changes relating to the ggml tensor library for machine learning testing Everything test related
#16817 opened Oct 28, 2025 by yael-works Loading…
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.