-
Notifications
You must be signed in to change notification settings - Fork 466
Adding Turkish Movie Sentiment Dataset #389
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Adding Turkish Movie Sentiment Dataset #389
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please add points too for the dataset. 2+4(6) points for yourself and 1 point for me for the review.
Thanks a lot for the changes!
mteb/tasks/Classification/tur/TurkishMovieSentimentClassification.py
Outdated
Show resolved
Hide resolved
mteb/tasks/Classification/tur/TurkishMovieSentimentClassification.py
Outdated
Show resolved
Hide resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
A few more small things around metadata. I have found mteb/abstasks/TaskMetadata.py
to be a helpful reference.
mteb/tasks/Classification/tur/TurkishMovieSentimentClassification.py
Outdated
Show resolved
Hide resolved
mteb/tasks/Classification/tur/TurkishMovieSentimentClassification.py
Outdated
Show resolved
Hide resolved
mteb/tasks/Classification/tur/TurkishMovieSentimentClassification.py
Outdated
Show resolved
Hide resolved
mteb/tasks/Classification/tur/TurkishMovieSentimentClassification.py
Outdated
Show resolved
Hide resolved
mteb/tasks/Classification/tur/TurkishMovieSentimentClassification.py
Outdated
Show resolved
Hide resolved
mteb/tasks/Classification/tur/TurkishMovieSentimentClassification.py
Outdated
Show resolved
Hide resolved
mteb/tasks/Classification/tur/TurkishMovieSentimentClassification.py
Outdated
Show resolved
Hide resolved
mteb/tasks/Classification/tur/TurkishMovieSentimentClassification.py
Outdated
Show resolved
Hide resolved
…ion.py Co-authored-by: Isaac Chung <chungisaac1217@gmail.com>
…ion.py Co-authored-by: Isaac Chung <chungisaac1217@gmail.com>
…ion.py Co-authored-by: Isaac Chung <chungisaac1217@gmail.com>
…ion.py Co-authored-by: Isaac Chung <chungisaac1217@gmail.com>
…ion.py Co-authored-by: Isaac Chung <chungisaac1217@gmail.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for iterating. We are very close. Sorry that I missed a few things in the first round. Please add a tur/__init__.py
file as well.
Looks good otherwise!
Oh, and once you have merged updates from the main
branch, please add 2+4=6 points for yourself, and 1 point each for myself and @Sakshamrzt for the review.
mteb/tasks/Classification/tur/TurkishMovieSentimentClassification.py
Outdated
Show resolved
Hide resolved
mteb/tasks/Classification/tur/TurkishMovieSentimentClassification.py
Outdated
Show resolved
Hide resolved
mteb/tasks/Classification/tur/TurkishMovieSentimentClassification.py
Outdated
Show resolved
Hide resolved
…ion.py Co-authored-by: Isaac Chung <chungisaac1217@gmail.com>
…ion.py Co-authored-by: Isaac Chung <chungisaac1217@gmail.com>
Updated everything according to reviews @isaac-chung |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
So close!
Could you please also resolve the merge conflicts by syncing your fork and merging in the updates from main?
mteb/tasks/Classification/tur/TurkishMovieSentimentClassification.py
Outdated
Show resolved
Hide resolved
Should be ok now |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@asparius Thanks for your contribution!
@isaac-chung, when merging datasets into main, please prefix it with "fix:" to ensure that the version is bumped. You can find more on this in the contributing guidelines |
Yep will do. Apologies! |
No worries, with the number of PR atm. it is unlikely to cause any issues |
Checklist for adding MMTEB dataset
Reason for dataset addition:
It is a collected Turkish movie sentiment dataset and it is the first classification dataset for Turkish. Publication of the dataset.
mteb
package.mteb run -m {model_name} -t {task_name}
command.sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2
intfloat/multilingual-e5-small
make test
.make lint
.