Skip to content

Conversation

hehesangsj
Copy link

Motivation

Support InternVL2.5

Modifications

Based on PR #3351.

Support both InternLM2ForCausalLM & Qwen2ForCausalLM as language model.

Fix bugs in multi-nodes deployment

Checklist

@zhaochenyang20
Copy link
Collaborator

@hehesangsj thansk, please fix the conflicts and we can merge this ASAP.

@hehesangsj
Copy link
Author

@mickqian @zhaochenyang20 Could you review the code~

@zhaochenyang20
Copy link
Collaborator

@mickqian @zhaochenyang20 Could you review the code~

hey, is this #4456 (comment) provided by your team? Otherwise, please provide the benchmark score at #4456

Also, @yizhang2077 could you help to review? thanks so much!

Copy link
Collaborator

@yizhang2077 yizhang2077 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hi, I leave some simple comments here, I will keep reviewing~ and plz format your code since lint failed.

has_flash_attn = False


class FlashAttention(nn.Module):
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We provide some attention code in vision.py, could you try to use it instead of writing attention again?

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think it's a little bit hard because the num_heads of InternViT is 25. Using VisionAttention can easily leads to bugs when deploying using --tp 8

@mickqian
Copy link
Collaborator

mickqian commented Mar 19, 2025

@zhaochenyang20 @yizhang2077 #3351 These are duplicated

@zhaochenyang20
Copy link
Collaborator

@zhaochenyang20 @yizhang2077 #3351 These are duplicated

I let them decouple and I will credit you! thanks ! @mickqian

@zhaochenyang20
Copy link
Collaborator

@hehesangsj fix the lint and solve the conflicts?

@zhaochenyang20
Copy link
Collaborator

@hehesangsj do you sovle all the reviews above? if so, please ping us and we can continue to review. thanks a lot

@hehesangsj
Copy link
Author

@hehesangsj do you sovle all the reviews above? if so, please ping us and we can continue to review. thanks a lot

okay, I will solve the reviews ASAP.

@hehesangsj
Copy link
Author

@hehesangsj do you sovle all the reviews above? if so, please ping us and we can continue to review. thanks a lot

reviews are fixed and latest main branch is merged. check the code plz~

@zhaochenyang20
Copy link
Collaborator

@yizhang2077 yi and @mickqian could you help to review?

@Fridge003 Fridge003 mentioned this pull request Apr 7, 2025
2 tasks
@xiaomin-D xiaomin-D mentioned this pull request Apr 13, 2025
5 tasks
@github-actions github-actions bot closed this May 30, 2025
Copy link
Contributor

This pull request has been automatically closed due to inactivity. Please feel free to reopen it if needed.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants