Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

convert_hf_to_gguf.py: refactor modify_tensors to call super python python script changes
#18866 opened Jan 15, 2026 by am17an Loading…
sampling : update outdated comment about has_sampled [no ci]
#18863 opened Jan 15, 2026 by danbev Loading…
wasm, tests: fix ctests with emscripten build Compilation issues ggml changes relating to the ggml tensor library for machine learning testing Everything test related
#18861 opened Jan 15, 2026 by aviallon Draft
ggml-cpu: aarm64: q5_K repack gemm and gemv (and generic) implementations (i8mm) ggml changes relating to the ggml tensor library for machine learning
#18860 opened Jan 15, 2026 by Alcpz Loading…
ggml-cpu: add RVV vec dot kernels for quantization types ggml changes relating to the ggml tensor library for machine learning
#18859 opened Jan 15, 2026 by rehan-10xengineer Loading…
ggml-cpu: add q4_0 repack support for wasm ggml changes relating to the ggml tensor library for machine learning
#18858 opened Jan 15, 2026 by aviallon Draft
enforce response_format and json_schema for Kimi K2 testing Everything test related
#18851 opened Jan 15, 2026 by akoumjian Loading…
Deepseek v3.2 dense attention support from @fairydreaming python python script changes
#18849 opened Jan 14, 2026 by createthis Loading…
# [RFC] Integrate sparse-ternary-fma for TQ2_0 quantization ggml changes relating to the ggml tensor library for machine learning testing Everything test related
#18836 opened Jan 14, 2026 by HyperFoldUK Loading…
vulkan: Revert forced full subgroup for FlashAttention ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#18831 opened Jan 14, 2026 by rillomas Draft
model: Add PaddleOCR-VL model support examples model Model specific python python script changes
#18825 opened Jan 14, 2026 by megemini Loading…
ggml-blas: hide warnings from included BLAS headers ggml changes relating to the ggml tensor library for machine learning
#18818 opened Jan 13, 2026 by DaAwesomeP Loading…
ggml-backend: Separate dynamic lib install and search paths, add relative search ggml changes relating to the ggml tensor library for machine learning
#18817 opened Jan 13, 2026 by DaAwesomeP Loading…
HIP: tune mmq/rocblas switching for RDNA4 ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#18816 opened Jan 13, 2026 by jiachengjason Loading…
sampling : remove sampling branching in output_reserve
#18811 opened Jan 13, 2026 by danbev Loading…
CANN: fix an issue where get_env was not fully renamed Ascend NPU issues specific to Ascend NPUs devops improvements to build systems and github actions ggml changes relating to the ggml tensor library for machine learning
#18796 opened Jan 13, 2026 by noemotiovon Loading…
Unified delta net handling for Qwen3Next and Kimi Linear models model Model specific
#18792 opened Jan 12, 2026 by pwilkin Loading…
ggml-cpu: add RVV vec dot kernels for quantization types ggml changes relating to the ggml tensor library for machine learning
#18784 opened Jan 12, 2026 by taimur-10x Draft
vocab: add tokenizer support for jina-embeddings-v2-base-zh python python script changes
#18756 opened Jan 11, 2026 by o7si Draft
ProTip! Type g i on any issue or pull request to go back to the issue listing page.