-
Notifications
You must be signed in to change notification settings - Fork 14k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
common : change --color to accept on/off/auto, default to auto
#17827
opened Dec 6, 2025 by
CISC
Loading…
[SYCL] Support gpt-oss by OPs add-id, mul_mat for mxfp4, swiglu_oai
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#17826
opened Dec 6, 2025 by
NeoZhangJianyu
Loading…
cann : fix ops broken by circular padding guard
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
#17825
opened Dec 6, 2025 by
CISC
Loading…
CANN: support gated linear attn
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
#17814
opened Dec 6, 2025 by
YushengZhao
Loading…
vulkan: faster q6_k matmul
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#17813
opened Dec 6, 2025 by
netrunnereve
Loading…
model: support Rnj-1
model
Model specific
python
python script changes
#17811
opened Dec 6, 2025 by
philip-essential
Loading…
webui: Fix parsing non-LaTeX occurrencies of
\( or \)
examples
server
#17810
opened Dec 6, 2025 by
allozaur
Loading…
[DRAFT] CUDA: Improve performance via less synchronizations between token
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
SOLVE_TRI extension to more dimensions
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
#17793
opened Dec 5, 2025 by
pwilkin
Loading…
ggml-cpu: add repack GEMM and GEMV for floating-point
ggml
changes relating to the ggml tensor library for machine learning
#17791
opened Dec 5, 2025 by
taimur-10x
•
Draft
ggml-cpu: add ggml_thread_cpu_relax with Zihintpause support
ggml
changes relating to the ggml tensor library for machine learning
#17784
opened Dec 5, 2025 by
ixgbe
Loading…
CANN : Optimize mul_mat_id quantization
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
#17782
opened Dec 5, 2025 by
jjjxp03
Loading…
sycl: add missing BF16 conversion support for Intel oneAPI
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#17780
opened Dec 5, 2025 by
yingying0906
Loading…
Move common_chat_parse and common_chat_peg_parse to chat-parser.h
examples
server
testing
Everything test related
#17772
opened Dec 4, 2025 by
sheldonrobinson
Loading…
Add a search field on model selector / improve mobile display
examples
server
#17765
opened Dec 4, 2025 by
ServeurpersoCom
Loading…
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.