llama-graph : fix text position for mrope #13159

ngxson · 2025-04-28T22:04:54Z

I misunderstood the original code:

        // TODO: add mrope pos ids somewhere else
        pos.resize(batch.n_tokens * 4);
        std::fill(pos.begin(), pos.end(), 0);
        for (int j = 0; j < batch.n_tokens * 3; j ++) {
            pos[j] = *st_pos_id + (j % batch.n_tokens);
        }
        batch.pos = pos.data();

Which means 3 first dims are the same, and 4th dim is 0, something like this:

1234...1234...1234...0000...

Thanks @mattjcly for spotting this!

mattjcly · 2025-04-28T22:10:31Z

Can confirm this fixes bugs we observed! Thanks @ngxson :)

* llama-graph : fix text position for mrope * fix typo * explicitly set 4th dim in the loop Change-Id: Iecca261107270aff9d26dd893c1039e355a05eb5

llama-graph : fix text position for mrope

418a31f

ngxson requested a review from ggerganov April 28, 2025 22:04

ngxson mentioned this pull request Apr 28, 2025

mtmd : add qwen2vl and qwen2.5vl #13141

Merged

ngxson added 2 commits April 29, 2025 00:07

fix typo

492345d

explicitly set 4th dim in the loop

bea76aa

ggerganov approved these changes Apr 29, 2025

View reviewed changes

ggerganov merged commit b6ce743 into ggml-org:master Apr 29, 2025
48 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

llama-graph : fix text position for mrope #13159

llama-graph : fix text position for mrope #13159

Uh oh!

ngxson commented Apr 28, 2025

Uh oh!

mattjcly commented Apr 28, 2025

Uh oh!

Uh oh!

Uh oh!

llama-graph : fix text position for mrope #13159

llama-graph : fix text position for mrope #13159

Uh oh!

Conversation

ngxson commented Apr 28, 2025

Uh oh!

mattjcly commented Apr 28, 2025

Uh oh!

Uh oh!

Uh oh!