Skip to content

Updates for causal mask #40

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 2 commits into from
Mar 21, 2025
Merged

Updates for causal mask #40

merged 2 commits into from
Mar 21, 2025

Conversation

awni
Copy link
Member

@awni awni commented Mar 20, 2025

Pre/Post for 4-bit Mistral 7B on M2 Utra

Prompt: 33898 tokens, 676.212 tokens-per-sec
Peak memory: 12.257 GB

Prompt: 33898 tokens, 869.279 tokens-per-sec
Peak memory: 9.111 GB

@awni awni requested review from angeloskath and jagrit06 March 20, 2025 23:47
Copy link
Member

@angeloskath angeloskath left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

👍

@awni awni merged commit fd175f1 into main Mar 21, 2025
2 checks passed
@awni awni deleted the causal_mask branch March 21, 2025 15:50
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants