Skip to content

Conversation

aittalam
Copy link
Member

What's changing

Dataset saving is triggered inside the JobService only on inference jobs which explicitly provide a True store_to_dataset config variable.

Within an inference + evaluation workflow, inference jobs are called with an explicit store_to_dataset=False and dataset saving is immediately enqueued afterwards to guarantee the dataset is present before evaluation.

If this PR is related to an issue or closes one, please link it here.

Closes #877

How to test it

Steps to test the changes:

  1. run a full workflow on a dataset with GT
  2. verify that there's exactly one dataset generated by the inference job
  3. run an annotation job on a dataset without GT
  4. verify that there's exactly one dataset generated by the inference job

I already...

  • Tested the changes in a working environment to ensure they work as expected
  • Added some tests for any new functionality: there are no new functionalities, it's just a bug fix.
  • Updated the documentation: no need for updated prod documentation, but I updated comments in code
  • Checked if a (backend) DB migration step was required and included it if required: none required

Dataset saving is triggered inside the JobService only on inference jobs which
explicitly provide a True `store_to_dataset` config variable.

Within an inference + evaluation workflow, inference jobs are called with an
explicit `store_to_dataset`=False and dataset saving is immediately enqueued
afterwards to guarantee the dataset is present before evaluation.
@aittalam aittalam linked an issue Feb 17, 2025 that may be closed by this pull request
1 task
@HareeshBahuleyan HareeshBahuleyan self-requested a review February 17, 2025 12:06
Copy link
Contributor

@HareeshBahuleyan HareeshBahuleyan left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @aittalam for the fix! I tested the dataset creation and it works as per the steps 👍

@aittalam aittalam enabled auto-merge (squash) February 17, 2025 15:54
@aittalam aittalam merged commit b784985 into main Feb 17, 2025
15 checks passed
@aittalam aittalam deleted the 877-bug-double-datasets branch February 17, 2025 16:09
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[BUG]: a new workflow always generates two new datasets
3 participants