Commit Graph

7366 Commits (cb4a50b1478bb54c19ae57703733e4f330bd7a2f)

Author SHA1 Message Date
Ryan Poplin cb4a50b147 Adding ability to try both small and large kmer lengths. Highest likelihood wins. 2011-09-17 16:42:49 -04:00
Ryan Poplin 981b78ea50 Changing the VQSR command line syntax back to the parsed tags approach. This cleans up the code and makes sure we won't be parsing the same rod file multiple times. I've tried to update the appropriate qscripts. 2011-09-12 12:17:43 -04:00
Ryan Poplin 60ebe68aff Fixing issue in VariantEval in which insertion and deletion events weren't treated symmetrically. Added new option to require strict allele matching. 2011-09-12 09:43:23 -04:00
Ryan Poplin 07d365ce39 Fixing units in queue job report Gantt plots 2011-09-12 09:01:34 -04:00
Ryan Poplin 09050a01db Adding qscript to run the HaplotypeCaller in parallel 2011-09-11 22:53:40 -04:00
Ryan Poplin 30be6d8bd6 Removing the copy of the original assembler since it has been successfully assimilated 2011-09-11 21:00:27 -04:00
Guillermo del Angel 9344938360 Uncomment code to add deleted bases covering an indel to per-sample genotype reporting, update integration tests accordingly 2011-09-10 19:41:01 -04:00
Guillermo del Angel b399424a9c Fix integration test affected by non-calling all-zero PL samples, and add a more complicated multi-sample integration test from a phase 1 case, GBR with mixed technologies and complex input alleles 2011-09-09 20:44:47 -04:00
Guillermo del Angel e95d484757 Merge branch 'master' of ssh://nickel.broadinstitute.org/humgen/gsa-scr1/gsa-engineering/git/unstable 2011-09-09 18:31:14 -04:00
Guillermo del Angel a807205fc3 a) Minor optimization to softMax() computation to avoid redundant operations, results in about 5-10% increase in speed in indel calling.
b) Added (but left commented out since it may affect integration tests and to isolate commits) fix to per-sample DP reporting, so that deletions are included in count.
c) Bug fix to avoid having non-reference genotypes assigned to samples with PL=0,0,0. Correct behavior should be to no-call these samples, and to ignore these samples when computing AC distribution since their likelihoods are not informative.
2011-09-09 18:00:23 -04:00
Mauricio Carneiro 9e650dfc17 Fixing SelectVariants documentation
getting rid of messages telling users to go for the YAML file. The idea is to not support these anymore.
2011-09-09 16:25:31 -04:00
Mauricio Carneiro 7f9000382e Making indel calls default in the MDCP
You can turn off indel calling by using -noIndels.
2011-09-09 14:09:26 -04:00
Ryan Poplin 1953edcd2d updating Validate Variants deletion integration test 2011-09-09 13:39:08 -04:00
Ryan Poplin 9ada9b3ed4 Merge branch 'master' of ssh://nickel.broadinstitute.org/humgen/gsa-scr1/gsa-engineering/git/unstable 2011-09-09 13:15:36 -04:00
Ryan Poplin 354529bff3 adding Validate Variants integration test with a deletion 2011-09-09 13:15:24 -04:00
Ryan Poplin 91c949db74 Fixing ValidateVariants so that it validates deletion records. Fixing GATKdocs. 2011-09-09 12:57:14 -04:00
David Roazen 6bd8a53efd Fix nasty bug involving the build report generation when multiple test targets are specified on the same command line.
Ant immutable properties: sometimes your friend, often your enemy.
2011-09-09 12:13:10 -04:00
David Roazen 60a3618845 Added "alltests" build targets.
To run the same set of tests as the bamboo "All Tests" plan (unit tests,
integration tests, and pipeline tests):

ant alltests

To do the same as above on only the public portion of the codebase:

ant alltests.public
2011-09-09 12:13:10 -04:00
Eric Banks 51eb95d638 Missed these tests before 2011-09-09 11:46:37 -04:00
Eric Banks b318fcba35 Added ability to profile the VCF output system too. Also, don't make the index test required. 2011-09-09 11:13:11 -04:00
Eric Banks 6ad8943ca0 CompOverlap no longer keeps track of the number of comp sites since it wasn't (and cannot) keeping track of them correctly. 2011-09-09 09:45:24 -04:00
Khalid Shakir 510d5e7730 Merged bug fix from Stable into Unstable 2011-09-09 01:34:55 -04:00
Khalid Shakir 367bbee25a Fixed typo when printing the contents or last N lines of a file. Thanks to larryns. 2011-09-09 01:33:25 -04:00
David Roazen 388c9a9c55 Enable public-only tests.
Public-only tests will allow us to check for runtime public -> private
dependencies when bamboo updates the github repository (currently, we
only check for *compile-time* public -> private dependencies).

To compile/run only public tests, append ".public" to the name of an existing
test target:

ant test.public
ant integrationtest.public
ant performancetest.public
ant pipelinetest.public
ant pipelinetestrun.public
2011-09-08 15:58:42 -04:00
Eric Banks eaaba6eb51 Confirming that when stratifying by sample in VE the monomorphic sites for a given sample are not counted for the relevant metrics. Adding integration test to cover it. 2011-09-08 13:17:34 -04:00
Mark DePristo c204a08319 Merge branch 'cancer' 2011-09-08 13:10:37 -04:00
Mark DePristo beabb67669 Working version of AssignSomaticStatus
-- Called 2400 somatic mutations (~ right number) when calling NA12878 vs. her parents.
-- Now ready for testing in the 100 T/N pairs
2011-09-08 13:09:26 -04:00
Mark DePristo 5edc8f8578 Moved to private package (intended home) 2011-09-08 11:54:55 -04:00
Mark DePristo 7557f4a03a AssignSomaticStatus, now with the correct mathematical model 2011-09-08 11:54:14 -04:00
Ryan Poplin 2636d216de Adding indel vqsr integration test 2011-09-08 10:38:13 -04:00
Ryan Poplin 9cba1019c8 Another fix for genotype given alleles for indels. Expanding the indel integration tests to include multiallelics and indel records that overlap 2011-09-08 09:25:13 -04:00
Ryan Poplin e0020b2b29 Fixing PrintRODs. Now has input and only prints out one copy of each record 2011-09-08 08:58:37 -04:00
Mark DePristo 6e6bf796d5 first version of somatic detector 2011-09-08 08:48:02 -04:00
Ryan Poplin 29c968ab60 clean up 2011-09-08 08:42:43 -04:00
Ryan Poplin 59841f8232 Fixing genotype given alleles for indels. Only take the records that start at this locus. 2011-09-08 08:41:16 -04:00
Ryan Poplin 7f9047aec1 updating parameters in assembler to decrease runtime 2011-09-07 20:35:08 -04:00
Ryan Poplin 7ba6c29d32 Merge branch 'master' of ssh://nickel.broadinstitute.org/humgen/gsa-scr1/gsa-engineering/git/unstable 2011-09-07 19:18:24 -04:00
Ryan Poplin 0e8536f5f3 Adding the ability to add multiple kmer lengths in the debruijn graph. 2011-09-07 19:17:45 -04:00
Guillermo del Angel 45d54f6258 Merge branch 'master' of ssh://nickel.broadinstitute.org/humgen/gsa-scr1/gsa-engineering/git/unstable 2011-09-07 16:49:49 -04:00
Guillermo del Angel 9604fb2ba3 Necessary but not sufficient step to fix GenotypeGivenAlleles mode in UG which is now busted 2011-09-07 16:49:16 -04:00
Mark DePristo 2ded027762 Removed dysfunctional tranches support from VariantEval 2011-09-07 16:09:24 -04:00
Eric Banks aa9e32f2f1 Reverting Mark's previous commit as per the open discussion. Now the eval modules check isPolymorphic() before accruing stats when appropriate. Fixed the IndelLengthHistogram module not to error out if the indel isn't simple (that would have been bad). Only integration test that needed to be updated was the tranches one based on a separate commit from Mark. 2011-09-07 15:48:06 -04:00
Mark DePristo d7e355b4b6 Merge branch 'master' of ssh://gsa1/humgen/gsa-scr1/gsa-engineering/git/unstable 2011-09-07 14:54:16 -04:00
Mark DePristo 9127849f5d BugFix for unit test 2011-09-07 14:54:10 -04:00
Mark DePristo 0037b61e5d Class of scala file should be close to filename, not MDP 2011-09-07 14:48:54 -04:00
Eric Banks 3a04955a30 We already had isPolymorphic and isMonomorphic in the VariantContext, but the implementation was incorrect for many edge cases (e.g. sites-only files, sites with samples who were no-called). Fixing. Moving on to VE now. 2011-09-07 14:01:42 -04:00
Mauricio Carneiro ee9d599558 Just cleaning up
clean up old commented code from tha data processing pipeline.
2011-09-07 13:32:40 -04:00
Guillermo del Angel 743bf7784c Merge branch 'master' of ssh://nickel.broadinstitute.org/humgen/gsa-scr1/gsa-engineering/git/unstable 2011-09-07 13:21:26 -04:00
Guillermo del Angel 5f22ef9a8c Added missing javadoc info to Beagle arguments 2011-09-07 13:21:11 -04:00
Mark DePristo 3bcbfa6e06 Merge branch 'master' of ssh://gsa1/humgen/gsa-scr1/gsa-engineering/git/unstable 2011-09-07 13:13:17 -04:00