Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

feat: add --threads-all option to llama-bench examples
#19971 opened Feb 28, 2026 by hobostay Loading…
4 tasks done
vendor : update cpp-httplib to 0.35.0 python python script changes script Script related
#19969 opened Feb 28, 2026 by angt Loading…
Fix logic for retrieving schema items in json_schema_to_grammar.py examples python python script changes
#19968 opened Feb 28, 2026 by RayXu14 Loading…
cuda: fix ggml_cuda_cpy crash on partial GPU offload ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#19966 opened Feb 28, 2026 by yossiovadia Loading…
ggml webgpu: fix workgroup dispatch limit for large batch sizes ggml changes relating to the ggml tensor library for machine learning
#19965 opened Feb 28, 2026 by abhijitramesh Loading…
Use fp32 in cuBLAS V100 to avoid overflows, env variables to override cuBLAS compute type ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#19959 opened Feb 27, 2026 by wallentri88 Loading…
llama : add native param2moe architecture support model Model specific python python script changes
#19958 opened Feb 27, 2026 by iambhuvan Loading…
scripts : improve get-wikitext-2.sh script Script related
#19952 opened Feb 27, 2026 by angt Loading…
[New quant] Q3_PT examples ggml changes relating to the ggml tensor library for machine learning python python script changes
#19941 opened Feb 26, 2026 by pwilkin Draft
scripts: ini_to_opencode.py python python script changes script Script related
#19938 opened Feb 26, 2026 by am17an Loading…
fix dots.ocr: correct RoPE sections and FFN tensor mapping examples python python script changes
#19936 opened Feb 26, 2026 by anthony-maio Loading…
1 of 2 tasks
common : update completion executables list [no ci]
#19934 opened Feb 26, 2026 by danbev Loading…
tool parser: add GigaChatV3/3.1 models support in PEG format testing Everything test related
#19931 opened Feb 26, 2026 by Mishusha Loading…
metal: add CONV_3D Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning
#19927 opened Feb 26, 2026 by Ra5hidIslam Loading…
llama/ggml: multi-GPU pipeline parallelism (xdev host staging) + faster model loading ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#19922 opened Feb 26, 2026 by mxxm-t Loading…
ggml-cuda: add mem check for fusion ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#19916 opened Feb 26, 2026 by am17an Loading…
vendors: update miniaudio library to 0.11.24 python python script changes script Script related
#19914 opened Feb 26, 2026 by data-man Loading…
[ggml-quants] Add memsets and other fixes for IQ quants ggml changes relating to the ggml tensor library for machine learning
#19861 opened Feb 24, 2026 by bartowski1182 Loading…
ggml-webgpu: Support non-contiguous src0 and overlapping src0/src1 in binary ops ggml changes relating to the ggml tensor library for machine learning testing Everything test related
#19850 opened Feb 24, 2026 by yomaytk Loading…
server : add chat truncation to keep chat going examples python python script changes server testing Everything test related
#19841 opened Feb 23, 2026 by ltoniazzi Loading…
5 of 6 tasks
ProTip! Updated in the last three days: updated:>2026-02-24.