llama : fix KV shift for qwen2vl #13870

ngxson · 2025-05-28T18:14:41Z

Provided a model with 3 mrope sections, a rotated vector looks like this: [time, time+x, time+y]

This works with the assumption that mrope with x == y == 0 is identical to doing a neox rope with theta = +time

For example, when we want to shift the time to time-1, that mean we shift from [time, time+x, time+y] to [time-1, time-1+x, time-1+y] ; We can simply apply neox rope with theta = -1 to archive the same effect

ggerganov

🪄

llama : fix KV shift for qwen2vl

9822f2c

ngxson requested a review from ggerganov May 28, 2025 18:14

add ref to the PR

3db4cb0

ggerganov approved these changes May 28, 2025

View reviewed changes

ngxson merged commit 763d06e into ggml-org:master May 28, 2025
41 of 46 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

llama : fix KV shift for qwen2vl #13870

llama : fix KV shift for qwen2vl #13870

Uh oh!

ngxson commented May 28, 2025 •

edited

Loading

Uh oh!

ggerganov left a comment

Uh oh!

Uh oh!

Uh oh!

llama : fix KV shift for qwen2vl #13870

llama : fix KV shift for qwen2vl #13870

Uh oh!

Conversation

ngxson commented May 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ggerganov left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ngxson commented May 28, 2025 •

edited

Loading