-
Notifications
You must be signed in to change notification settings - Fork 1
Pull requests: auroralabs-loci/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
UPSTREAM PR #21870: common: skip reasoning budget sampler when no budget is requested
#1349
opened Apr 14, 2026 by
loci-dev
Loading…
UPSTREAM PR #21821: llama : add --hugepages for HugeTLB-backed weight loading (Linux)
#1347
opened Apr 13, 2026 by
loci-dev
Loading…
UPSTREAM PR #21554: hexagon: optimization for HMX mat_mul
#1346
opened Apr 12, 2026 by
loci-dev
Loading…
UPSTREAM PR #21787: vulkan: fix output corruption on GCN 2.0/3.0 (Vulkan 1.2)
#1345
opened Apr 12, 2026 by
loci-dev
Loading…
UPSTREAM PR #21753: vulkan: Support asymmetric FA in coopmat2 path
#1344
opened Apr 11, 2026 by
loci-dev
Loading…
UPSTREAM PR #21344: gfx1151 nwarps, tile sizing to curb VGPR pressure
#1342
opened Apr 10, 2026 by
loci-dev
Loading…
UPSTREAM PR #21431: vulkan: Tweak Xe2 warptile configuration
#1341
opened Apr 10, 2026 by
loci-dev
Loading…
UPSTREAM PR #21597: SYCL: fix multi-GPU system RAM exhaustion by using Level Zero allocations
#1340
opened Apr 8, 2026 by
loci-dev
Loading…
7 tasks done
UPSTREAM PR #21421: mtmd: add Gemma 4 audio conformer encoder support
#1336
opened Apr 6, 2026 by
loci-dev
Loading…
9 tasks done
UPSTREAM PR #21216: common : simplify autoparser tagged parser rules
#1335
opened Apr 6, 2026 by
loci-dev
Loading…
UPSTREAM PR #21187: llama-server: translating structured generation request parameters from responses API format to completions API format
#1333
opened Apr 5, 2026 by
loci-dev
Loading…
UPSTREAM PR #21391: [SYCL] Add BF16 support to GET_ROWS operation
#1332
opened Apr 4, 2026 by
loci-dev
Loading…
UPSTREAM PR #21405: vendor : update cpp-httplib to 0.40.1
#1331
opened Apr 4, 2026 by
loci-dev
Loading…
UPSTREAM PR #21331: docs: build.md / HSA_OVERRIDE_GFX_VERSION does not exist on Windows
#1330
opened Apr 3, 2026 by
loci-dev
Loading…
UPSTREAM PR #21315: ggml-zendnn : add MUL_MAT_ID op support for MoE models
#1329
opened Apr 3, 2026 by
loci-dev
Loading…
UPSTREAM PR #21245: model : refactor QKV into common build_qkv and create_tensor_qkv helpers
#1328
opened Apr 2, 2026 by
loci-dev
Loading…
UPSTREAM PR #20831: cuda : dynamic MMVQ nwarps for narrow matrices
#1327
opened Apr 2, 2026 by
loci-dev
Loading…
UPSTREAM PR #21283: [SYCL] fix llama_kv_cache hang when kv_cache is huge: 5GB
#1326
opened Apr 2, 2026 by
loci-dev
Loading…
UPSTREAM PR #21242: fix: tool call parsing for LFM2 and LFM2.5 models
#1325
opened Apr 1, 2026 by
loci-dev
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:overlay.