Skip to content

Conversation

Muennighoff
Copy link
Contributor

No description provided.

@@ -13,12 +13,7 @@

GRIT_LM_TRAINING_DATA = {
**E5_TRAINING_DATA, # source https://arxiv.org/pdf/2402.09906
# also uses medi2 which contains fever and hotpotqa:
"FEVER": ["train"],
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

doesn't it appear in Table 29?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sorry fixed it - There was a bug with E5_TRAINING_DATA being used for e5-mistral thus I reused that one here but should have been E5_MISTRAL_..

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

a great!

@KennethEnevoldsen KennethEnevoldsen changed the title Fix training data Fix training data for gritlm Feb 4, 2025
@Muennighoff
Copy link
Contributor Author

Good to merge this one?

@@ -13,12 +13,7 @@

GRIT_LM_TRAINING_DATA = {
**E5_TRAINING_DATA, # source https://arxiv.org/pdf/2402.09906
# also uses medi2 which contains fever and hotpotqa:
"FEVER": ["train"],
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

a great!

@KennethEnevoldsen KennethEnevoldsen enabled auto-merge (squash) February 7, 2025 09:47
@KennethEnevoldsen KennethEnevoldsen merged commit d810e4e into main Feb 7, 2025
9 checks passed
@KennethEnevoldsen KennethEnevoldsen deleted the Muennighoff-patch-1 branch February 7, 2025 09:59
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants