Releases: ggml-org/llama.cpp
Releases · ggml-org/llama.cpp
b6210
musa: fix build warnings (#15258) * musa: fix build warnings Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com> * fix warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare] Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com> --------- Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>
b6209
opencl: mark `argsort` unsupported if cols exceed workgroup limit (#1…
b6208
model : add gpt-oss type strings (#15424)
b6207
common : Add top-nsigma sampler to help globally (#15428) Fixes #15423.
b6205
CANN: optimize rope operator (#15335) * optimize rope ops * amendment * delete trailing whitespace * change the variable name
b6204
musa: handle __hgt2_mask, available starting from MUSA SDK rc4.3.0 (#…
b6202
chat : clarify the meaning of reasoning_format (#15408) * chat : clarify the meaning of reasoning_format * add link to this PR
b6201
server : remove swa_full warning (#15399)
b6199
mtmd : clean up clip_n_output_tokens (#15391)
b6195
llama : merge conts and reshapes and remove unnecessary cont (#15380) * remove unnecessary conts and merge reshapes * restore necessary conts * merge more conts and reshapes * merge even more conts and reshapes