[Falcon H1] Fix slow path forward pass #38320

dhiaEddineRhaiem · 2025-05-23T11:55:39Z

This PR:

adds MuP multipliers in the fwd pass of the slow path torch_forward
Fixes the repeat pattern in Mamba heads , repeat -> repeat_interleave see Fix Mamba2 Grouped SSD Support in the torch_forward Path #37533
@ArthurZucker

…sformers into add-falcon-h1

vasqu

LGTM! Checking in with the slow tests :)

vasqu · 2025-05-26T09:06:32Z

run-slow: falcon_h1

cc @ydshieh anything I should do differently?

ydshieh · 2025-05-26T09:31:00Z

@vasqu It's triggered, but failed at a step (reply to comment) and the tests are not run.

https://github.com/huggingface/transformers/actions/runs/15250152202/job/42884998108

I'm not sure why, but I will re-run it and see how it goes.

github-actions · 2025-05-26T09:31:39Z

This comment contains run-slow, running the specified jobs:

models: ['models/falcon_h1']
quantizations: [] ...

github-actions · 2025-05-26T09:33:18Z

This comment contains run-slow, running the specified jobs:

models: ['models/falcon_h1']
quantizations: [] ...

ydshieh · 2025-05-26T09:34:07Z

triggered

https://github.com/huggingface/transformers/actions/runs/15250152202

younesbelkada · 2025-05-26T09:38:21Z

Slow tests failed, but not sure if this PR caused the failure since we only changed the slow path which is not executed on GPUs, perhaps they failed since the beginning?

younesbelkada · 2025-05-26T09:42:08Z

We're not able to extract the expected text from the logs and we don't have access to T4 GPUs.. is there a way to extract the output from the logs?

ydshieh · 2025-05-26T10:01:50Z

I can update the expected value this afternoon

dhiaEddineRhaiem · 2025-05-26T10:14:55Z

i pushed a fix in the Integration test EXPECTED_TEXT
could you please rerun the slow-path test?

vasqu · 2025-05-26T10:18:36Z

run-slow: falcon_h1

github-actions · 2025-05-26T10:19:59Z

This comment contains run-slow, running the specified jobs:

models: ['models/falcon_h1']
quantizations: [] ...

ydshieh · 2025-05-26T12:50:55Z

remote: Permission to younesbelkada/transformers.git denied to ydshieh.

@younesbelkada you don't love @ydshieh anymore ? You owe me instructblip ....

younesbelkada · 2025-05-26T12:55:13Z

ahaha sorry let me give you access now ! yes I owe you that one ... which took so long to fix together

ydshieh · 2025-05-26T12:55:32Z

or younesbelkada#7

update

ydshieh · 2025-05-26T12:58:49Z

The updated value works for T4. No more need to run-slow

Thank you for the effort 🙏

ArthurZucker

🤗

younesbelkada · 2025-05-26T13:00:08Z

Thank you very much @ydshieh and HF team for your continuous support ! 🚀

ydshieh · 2025-05-26T13:00:35Z

wait don't merge yet.

I will check again and merge once ok

dhiaEddineRhaiem · 2025-05-26T13:01:14Z

Thanks all guys for your help

ydshieh · 2025-05-26T13:11:26Z

younesbelkada#8

😰

* update * update --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

* Create push-important-models.yml * feat: add falcon-h1 * fixup * address comment * fix * fix copies * fix copies * fix * fix * fix * fix * fix copies * fix * fix copies * fix test import to at least trigget the cis * yups * update * fix make fix copies * fix inits? * fix style * skip annoying test * add integration test for Falcon H1 * fix copies * fix * fix typo * make style * fix slow path generations * clean debug traces * debug * remove debug traces final confirmation * clean debug traces final * fix format and lineup * make style * debug * Update src/transformers/models/falcon_h1/modular_falcon_h1.py Co-authored-by: Anton Vlasjuk <73884904+vasqu@users.noreply.github.com> * adress comments * fix fix-copies * fix integration test * Merge pull request huggingface#7 from ydshieh/fix-slow-path update * another update (huggingface#8) * update * update --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> --------- Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> Co-authored-by: Younes Belkada <younesbelkada@gmail.com> Co-authored-by: younesbelkada <younes.belkada@tii.ae> Co-authored-by: Arthur Zucker <arthur.zucker@gmail.com> Co-authored-by: Anton Vlasjuk <73884904+vasqu@users.noreply.github.com> Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com> Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

younesbelkada and others added 30 commits February 23, 2024 07:33

Create push-important-models.yml

91c72e7

merge

580c7f8

Merge remote-tracking branch 'upstream/main'

7e7810a

feat: add falcon-h1

ed2f8f3

fixup

303a7f8

address comment

6f292cf

fix

6688c9e

fix copies

e044445

fix copies

b167ede

fix

df485a3

fix

332b143

fix

f3c21a8

fix

387e4af

fix copies

efe9108

fix

1c6a4c5

fix copies

250ca80

fix test import to at least trigget the cis

a62e45b

yups

2178c00

update

7c2c331

fix make fix copies

c1162ae

fix inits?

817f146

fix style

f1257e3

skip annoying test

184491d

add integration test for Falcon H1

e2493d8

fix copies

a3dbbe4

Merge branch 'add-falcon-h1' of https://github.com/younesbelkada/tran…

e4dcb70

…sformers into add-falcon-h1

fix

a4d5141

fix typo

0a30bee

Merge branch 'main' into add-falcon-h1

4392b31

make style

e542fc1

fix fix-copies

30efac4

dhiaEddineRhaiem requested a review from vasqu May 24, 2025 14:37

vasqu approved these changes May 26, 2025

View reviewed changes

fix integration test

588da11

Merge pull request #7 from ydshieh/fix-slow-path

c2b59bd

update

ArthurZucker approved these changes May 26, 2025

View reviewed changes

another update (#8)

35ee36a

* update * update --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

ydshieh enabled auto-merge (squash) May 26, 2025 13:24

ydshieh disabled auto-merge May 26, 2025 13:30

ydshieh merged commit 7a9b071 into huggingface:main May 26, 2025
12 checks passed

[Falcon H1] Fix slow path forward pass #38320

[Falcon H1] Fix slow path forward pass #38320

Uh oh!

Conversation

dhiaEddineRhaiem commented May 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vasqu left a comment

Choose a reason for hiding this comment

Uh oh!

vasqu commented May 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ydshieh commented May 26, 2025

Uh oh!

github-actions bot commented May 26, 2025

Uh oh!

github-actions bot commented May 26, 2025

Uh oh!

ydshieh commented May 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

younesbelkada commented May 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

younesbelkada commented May 26, 2025

Uh oh!

ydshieh commented May 26, 2025

Uh oh!

dhiaEddineRhaiem commented May 26, 2025

Uh oh!

vasqu commented May 26, 2025

Uh oh!

github-actions bot commented May 26, 2025

Uh oh!

ydshieh commented May 26, 2025

Uh oh!

younesbelkada commented May 26, 2025

Uh oh!

ydshieh commented May 26, 2025

Uh oh!

ydshieh commented May 26, 2025

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

younesbelkada commented May 26, 2025

Uh oh!

ydshieh commented May 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dhiaEddineRhaiem commented May 26, 2025

Uh oh!

ydshieh commented May 26, 2025

Uh oh!

Uh oh!

Uh oh!

dhiaEddineRhaiem commented May 23, 2025 •

edited

Loading

vasqu commented May 26, 2025 •

edited

Loading

ydshieh commented May 26, 2025 •

edited

Loading

younesbelkada commented May 26, 2025 •

edited

Loading

ydshieh commented May 26, 2025 •

edited

Loading