-
Notifications
You must be signed in to change notification settings - Fork 15.1k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat: add --threads-all option to llama-bench
examples
#19971
opened Feb 28, 2026 by
hobostay
Loading…
4 tasks done
server: batch checkpoints to support kvcache context truncation
examples
server
#19970
opened Feb 28, 2026 by
aagit
Loading…
vendor : update cpp-httplib to 0.35.0
python
python script changes
script
Script related
#19969
opened Feb 28, 2026 by
angt
Loading…
Fix logic for retrieving schema items in json_schema_to_grammar.py
examples
python
python script changes
#19968
opened Feb 28, 2026 by
RayXu14
Loading…
cuda: fix ggml_cuda_cpy crash on partial GPU offload
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#19966
opened Feb 28, 2026 by
yossiovadia
Loading…
ggml webgpu: fix workgroup dispatch limit for large batch sizes
ggml
changes relating to the ggml tensor library for machine learning
#19965
opened Feb 28, 2026 by
abhijitramesh
Loading…
Use fp32 in cuBLAS V100 to avoid overflows, env variables to override cuBLAS compute type
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#19959
opened Feb 27, 2026 by
wallentri88
Loading…
llama : add native param2moe architecture support
model
Model specific
python
python script changes
#19958
opened Feb 27, 2026 by
iambhuvan
Loading…
tools : enable kvu in perplexity for hellaswag, winogrande, multiple-choice
examples
#19954
opened Feb 27, 2026 by
angt
Loading…
scripts : improve get-wikitext-2.sh
script
Script related
#19952
opened Feb 27, 2026 by
angt
Loading…
webui: use date in more human readable exported filename
examples
server
#19939
opened Feb 26, 2026 by
woof-dog
Loading…
scripts: ini_to_opencode.py
python
python script changes
script
Script related
#19938
opened Feb 26, 2026 by
am17an
Loading…
fix dots.ocr: correct RoPE sections and FFN tensor mapping
examples
python
python script changes
#19936
opened Feb 26, 2026 by
anthony-maio
Loading…
1 of 2 tasks
tool parser: add GigaChatV3/3.1 models support in PEG format
testing
Everything test related
#19931
opened Feb 26, 2026 by
Mishusha
Loading…
metal: add CONV_3D
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
#19927
opened Feb 26, 2026 by
Ra5hidIslam
Loading…
llama/ggml: multi-GPU pipeline parallelism (xdev host staging) + faster model loading
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#19922
opened Feb 26, 2026 by
mxxm-t
Loading…
ggml-cuda: add mem check for fusion
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#19916
opened Feb 26, 2026 by
am17an
Loading…
vendors: update miniaudio library to 0.11.24
python
python script changes
script
Script related
#19914
opened Feb 26, 2026 by
data-man
Loading…
test-backend-ops: allow loading tests from JSON and parsing model operators into JSON
examples
testing
Everything test related
#19896
opened Feb 25, 2026 by
0cc4m
Loading…
[ggml-quants] Add memsets and other fixes for IQ quants
ggml
changes relating to the ggml tensor library for machine learning
#19861
opened Feb 24, 2026 by
bartowski1182
Loading…
server : add default-model preset and fallback logic
examples
server
#19855
opened Feb 24, 2026 by
mikhail-shevtsov-wiregate
Loading…
ggml-webgpu: Support non-contiguous changes relating to the ggml tensor library for machine learning
testing
Everything test related
src0 and overlapping src0/src1 in binary ops
ggml
#19850
opened Feb 24, 2026 by
yomaytk
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2026-02-24.