Optional blacklist VCF to for labeling artifacts in Permutect weak-labeling mode #9136

davidbenjamin · 2025-04-03T17:00:09Z

@LeeTL1220 Small change for Permutect data here, usually irrelevant. In most use cases, it will be possible to train with a GIAB truth VCF to label data. When there is not, this PR allows users to optionally specify a PoN as a hint that unlabeled data are probably actually artifacts.

This should not in any way be interpreted as Permutect using a PoN for variant calling. It has zero effect on our DREAM and Linseq evaluations, which use an NA12878 sample for training.

LeeTL1220

If you address these comments, then feel free to count this as an approval. No need for me to re-review.

src/main/java/org/broadinstitute/hellbender/tools/walkers/mutect/PermutectDatasetEngine.java

using the PoN to label things as artifacts in weak-labeling mode

804e98c

davidbenjamin requested a review from LeeTL1220 April 3, 2025 17:00

davidbenjamin assigned LeeTL1220 Apr 3, 2025

LeeTL1220 reviewed Apr 3, 2025

View reviewed changes

src/main/java/org/broadinstitute/hellbender/tools/walkers/mutect/PermutectDatasetEngine.java Outdated Show resolved Hide resolved

src/main/java/org/broadinstitute/hellbender/tools/walkers/mutect/PermutectDatasetEngine.java Outdated Show resolved Hide resolved

review edits

10ae5f0

LeeTL1220 approved these changes Apr 4, 2025

View reviewed changes

davidbenjamin merged commit fdd4333 into master Apr 4, 2025
20 checks passed

davidbenjamin deleted the db_permutect_training_blacklist branch April 4, 2025 13:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Optional blacklist VCF to for labeling artifacts in Permutect weak-labeling mode #9136

Optional blacklist VCF to for labeling artifacts in Permutect weak-labeling mode #9136

Uh oh!

davidbenjamin commented Apr 3, 2025

Uh oh!

LeeTL1220 left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Optional blacklist VCF to for labeling artifacts in Permutect weak-labeling mode #9136

Optional blacklist VCF to for labeling artifacts in Permutect weak-labeling mode #9136

Uh oh!

Conversation

davidbenjamin commented Apr 3, 2025

Uh oh!

LeeTL1220 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!