gatk-3.8/public/java/test/org/broadinstitute/sting/gatk
David Roazen 3645ea9bb6 Sequence dictionary validation: detect problematic contig indexing differences
The GATK engine does not behave correctly when contigs are indexed
differently in the reads sequence dictionaries vs. the reference
sequence dictionary, and the inconsistently-indexed contigs are included
in the user's intervals. For example, given the dictionaries:

Reference dictionary = { chrM, chr1, chr2, ... }
BAM dictionary       = { chr1, chr2, ... }

and the interval "-L chr1", the engine would fail to correctly retrieve
the reads from chr1, since chr1 has a different index in the two dictionaries.

With this patch, we throw an exception if there are contig index differences
between the dictionaries for reads and reference, AND the user's intervals
include at least one of the mismatching contigs.

The user can disable this exception via -U ALLOW_SEQ_DICT_INCOMPATIBILITY

In all other cases, dictionary validation behaves as before.

I also added comprehensive unit tests for the (previously-untested)
SequenceDictionaryUtils class.

GSA-768 #resolve
2013-02-25 11:14:22 -05:00
..
datasources Re-enabled fastBAMindexing by replacing the FileChannel with a SeekableBufferedStream 2013-02-14 17:51:15 -05:00
downsampling Updated all JAVA file licenses accordingly 2013-01-10 17:06:41 -05:00
executive Updated all JAVA file licenses accordingly 2013-01-10 17:06:41 -05:00
filters Updated all JAVA file licenses accordingly 2013-01-10 17:06:41 -05:00
iterators Refactor LIBS into utils.locusiterator before refactoring 2013-01-11 15:17:16 -05:00
refdata Updated all JAVA file licenses accordingly 2013-01-10 17:06:41 -05:00
report Updated all JAVA file licenses accordingly 2013-01-10 17:06:41 -05:00
samples Updating TestNG to the latest version 2013-02-22 09:40:23 -05:00
traversals Collapse the downsampling fork in the GATK engine 2013-01-28 01:50:30 -05:00
walkers Sequence dictionary validation: detect problematic contig indexing differences 2013-02-25 11:14:22 -05:00
CommandLineGATKUnitTest.java Updated all JAVA file licenses accordingly 2013-01-10 17:06:41 -05:00
EngineFeaturesIntegrationTest.java Updated all JAVA file licenses accordingly 2013-01-10 17:06:41 -05:00
GenomeAnalysisEngineUnitTest.java ActivityProfile and ActiveRegions respects engine interval boundaries 2013-02-18 10:40:25 -05:00
MaxRuntimeIntegrationTest.java Updated all JAVA file licenses accordingly 2013-01-10 17:06:41 -05:00
WalkerManagerUnitTest.java Updated all JAVA file licenses accordingly 2013-01-10 17:06:41 -05:00