Skip to content

Releases: ggml-org/llama.cpp

b6210

20 Aug 03:00
a094f38
Compare
Choose a tag to compare
musa: fix build warnings (#15258)

* musa: fix build warnings

Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>

* fix warning: comparison of integers of different signs: 'const int' and 'unsigned int' [-Wsign-compare]

Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>

---------

Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>

b6209

19 Aug 18:38
fb22dd0
Compare
Choose a tag to compare
opencl: mark `argsort` unsupported if cols exceed workgroup limit (#1…

b6208

19 Aug 17:28
9ef6b0b
Compare
Choose a tag to compare
model : add gpt-oss type strings (#15424)

b6207

19 Aug 17:18
1e19f5d
Compare
Choose a tag to compare
common : Add top-nsigma sampler to help globally (#15428)

Fixes #15423.

b6205

19 Aug 14:18
a6d3cfe
Compare
Choose a tag to compare
CANN: optimize rope operator (#15335)

* optimize rope ops

* amendment

* delete trailing whitespace

* change the variable name

b6204

19 Aug 11:23
67f09a3
Compare
Choose a tag to compare
musa: handle __hgt2_mask, available starting from MUSA SDK rc4.3.0 (#…

b6202

19 Aug 08:55
e9288e8
Compare
Choose a tag to compare
chat : clarify the meaning of reasoning_format (#15408)

* chat : clarify the meaning of reasoning_format

* add link to this PR

b6201

19 Aug 06:29
9d262f4
Compare
Choose a tag to compare
server : remove swa_full warning (#15399)

b6199

18 Aug 21:24
f08c4c0
Compare
Choose a tag to compare
mtmd : clean up clip_n_output_tokens (#15391)

b6195

18 Aug 17:58
baa9255
Compare
Choose a tag to compare
llama : merge conts and reshapes and remove unnecessary cont (#15380)

* remove unnecessary conts and merge reshapes

* restore necessary conts

* merge more conts and reshapes

* merge even more conts and reshapes