gatk-3.8/protected/java/test/org/broadinstitute/sting/gatk/walkers
Mark DePristo b115e5c582 Critical bugfix for CommonSuffixSplitter to avoid infinite loops
-- The previous version would enter into an infinite loop in the case where we have a graph that looks like:

X -> A -> B
Y -> A -> B

So that the incoming vertices of B all have the same sequence.  This would cause us to remodel the graph endless by extracting the common sequence A and rebuilding exactly the same graph.  Fixed and unit tested

-- Additionally add a max to the number of simplification cycles that are run (100), which will throw an error and write out the graph for future debugging.  So the GATK will always error out, rather than just go on forever
-- After 5 rounds of simplification we start keeping a copy of the previous graph, and then check if the current graph is actually different from the previous graph.  Equals here means that all vertices have equivalents in both graphs, as do all edges.  If the two graphs are equal we stop simplifying.  It can be a bit expensive but it only happens when we end up cycling due to the structure of the graph.
-- Added a unittest that goes into an infinite loop (found empirically in running the CEU trio) and confirmed that the new approach aborts out correctly
-- #resolves GSA-924
-- See https://jira.broadinstitute.org/browse/GSA-924 for more details
-- Update MD5s due to change in assembly graph construction
2013-04-09 16:19:26 -04:00
..
annotator Updated all JAVA file licenses accordingly 2013-01-10 17:06:41 -05:00
beagle Updated all JAVA file licenses accordingly 2013-01-10 17:06:41 -05:00
bqsr Replace uses of NestedHashMap with NestedIntegerArray. 2013-02-27 14:03:39 -05:00
compression/reducereads Updated AssessReducedQuals and applied it systematically to all ReduceReads integration tests. 2013-03-31 00:27:14 -04:00
diagnostics walker to calculate per base coverage distribution 2013-02-07 16:33:05 -05:00
diffengine Fixed issues raised by Appistry QA (mostly small fixes, corrections & clarifications to GATKDocs) 2013-03-12 10:57:14 -04:00
fasta Updated all JAVA file licenses accordingly 2013-01-10 17:06:41 -05:00
filters Don't allow users to specify keys and IDs that contain angle brackets or equals signs (not allowed in VCF spec). 2013-04-05 00:52:32 -04:00
genotyper Fix caching indices in the PairHMM 2013-04-08 11:05:12 -04:00
haplotypecaller Critical bugfix for CommonSuffixSplitter to avoid infinite loops 2013-04-09 16:19:26 -04:00
indels Fixed IndelRealigner reference length bug (GSA-774) 2013-02-19 16:00:36 -05:00
phasing Updated all JAVA file licenses accordingly 2013-01-10 17:06:41 -05:00
validation MathUtils.randomSubset() now uses Collections.shuffle() (indirectly, through the other methods 2013-03-29 14:52:10 -04:00
varianteval Move some VCF/VariantContext methods back to the GATK based on feedback 2013-01-29 16:56:55 -05:00
variantrecalibration Updated all JAVA file licenses accordingly 2013-01-10 17:06:41 -05:00
variantutils Using --keepOriginalAC in SelectVariants was causing it to emit bad VCFs 2013-04-05 00:53:28 -04:00