[BUG] Do you collaborate with huggingface's transformer? There is long-lasting performance drop of Qwen2.5-VL models.

### 是否已有关于该错误的issue或讨论？ | Is there an existing issue / discussion for this?

- [x] 我已经搜索过已有的issues和讨论 | I have searched the existing issues / discussions

### 该问题是否在FAQ中有解答？ | Is there an existing answer for this in FAQ?

- [x] 我已经搜索过FAQ | I have searched FAQ

### 当前行为 | Current Behavior

Over last 3 months, I've been working on the Qwen2.5-VL from huggingface's transformers. 

There is a persistent performance drop in it due to the features introduced in huggingface. The causes are wrong 3D rope PE. I have to make a lots of monkey patch to make sure they are correct.....

**I wonder are you aware of this? Can you work with huggingface to solve the problem?**

**I can't even image how much impact this will be if tons of people are using a wrong Qwen2.5-VL implementation..**


Issues raised by users:

1. https://github.com/huggingface/transformers/issues/40154
2. https://github.com/huggingface/transformers/issues/40136
3. https://github.com/huggingface/transformers/issues/41180

PR attempt to fix: https://github.com/huggingface/transformers/pull/40490

But seems like according to the latest issue https://github.com/huggingface/transformers/issues/41180 the performance drop still exist.

### 期望行为 | Expected Behavior

Qwen2.5-VL should behave normally in inference...

### 复现方法 | Steps To Reproduce

_No response_

### 运行环境 | Environment

```Markdown
- OS:
- Python:
- Transformers:
- PyTorch:
- CUDA (`python -c 'import torch; print(torch.version.cuda)'`):
```

### 备注 | Anything else?

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[BUG] Do you collaborate with huggingface's transformer? There is long-lasting performance drop of Qwen2.5-VL models. #515

是否已有关于该错误的issue或讨论？ | Is there an existing issue / discussion for this?

该问题是否在FAQ中有解答？ | Is there an existing answer for this in FAQ?

当前行为 | Current Behavior

期望行为 | Expected Behavior

复现方法 | Steps To Reproduce

运行环境 | Environment

备注 | Anything else?

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[BUG] Do you collaborate with huggingface's transformer? There is long-lasting performance drop of Qwen2.5-VL models. #515

Description

是否已有关于该错误的issue或讨论？ | Is there an existing issue / discussion for this?

该问题是否在FAQ中有解答？ | Is there an existing answer for this in FAQ?

当前行为 | Current Behavior

期望行为 | Expected Behavior

复现方法 | Steps To Reproduce

运行环境 | Environment

备注 | Anything else?

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions