Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[SYCL] Support gpt-oss by OPs add-id, mul_mat for mxfp4, swiglu_oai documentation Improvements or additions to documentation ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#17826 opened Dec 6, 2025 by NeoZhangJianyu Loading…
cann : fix ops broken by circular padding guard Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning
#17825 opened Dec 6, 2025 by CISC Loading…
cli: new CLI experience devops improvements to build systems and github actions examples script Script related server testing Everything test related
#17824 opened Dec 6, 2025 by ngxson Draft
1 of 5 tasks
llama : add token matching support to llama-grammar testing Everything test related
#17816 opened Dec 6, 2025 by aldehir Draft
3 tasks done
CANN: support gated linear attn Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning
#17814 opened Dec 6, 2025 by YushengZhao Loading…
vulkan: faster q6_k matmul ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#17813 opened Dec 6, 2025 by netrunnereve Loading…
model: support Rnj-1 model Model specific python python script changes
#17811 opened Dec 6, 2025 by philip-essential Loading…
[DRAFT] CUDA: Improve performance via less synchronizations between token ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#17795 opened Dec 5, 2025 by aendk Draft
Make graph_max_nodes vary by ubatch size
#17794 opened Dec 5, 2025 by pwilkin Loading…
SOLVE_TRI extension to more dimensions ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#17793 opened Dec 5, 2025 by pwilkin Loading…
ggml-cpu: add repack GEMM and GEMV for floating-point ggml changes relating to the ggml tensor library for machine learning
#17791 opened Dec 5, 2025 by taimur-10x Draft
ggml-cpu: add ggml_thread_cpu_relax with Zihintpause support ggml changes relating to the ggml tensor library for machine learning
#17784 opened Dec 5, 2025 by ixgbe Loading…
CANN : Optimize mul_mat_id quantization Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning
#17782 opened Dec 5, 2025 by jjjxp03 Loading…
sycl: add missing BF16 conversion support for Intel oneAPI ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#17780 opened Dec 5, 2025 by yingying0906 Loading…
Add link to AshkanYarmoradi/go-llama.cpp
#17776 opened Dec 5, 2025 by AshkanYarmoradi Loading…
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.