gatk-3.8/protected/java/test/org/broadinstitute/sting/utils
Mauricio Carneiro 285ab2ac62 Better caching for the HaplotypeCaller
Problem
-------
Caching strategy is incompatible with the current sorting of the haplotypes, and is rendering the cache nearly useless.

Before the PairHMM updates, we realized that a lexicographically sorted list of haplotypes would optimize the use of the cache. This was only true until we've added the initial condition to the first row of the deletion matrix, which depends on the length of the haplotype. Because of that, every time the haplotypes differ in length, the cache has to be wiped. A lexicographic sorting of the haplotypes will put different lengths haplotypes clustered together therefore wasting *tons* of re-compute.

Solution
-------
Very simple. Sort the haplotypes by LENGTH and then in lexicographic order.
2013-08-02 01:27:29 -04:00
..
genotyper Relaxing the constraints on the readIsPoorlyModelled function. 2013-06-13 11:06:23 -04:00
gvcf Working version of HaplotypeCaller ReferenceConfidenceModel that accounts for indels as well as SNP confidences 2013-07-02 15:46:38 -04:00
haplotype Better caching for the HaplotypeCaller 2013-08-02 01:27:29 -04:00
haplotypeBAMWriter Working version of HaplotypeCaller ReferenceConfidenceModel that accounts for indels as well as SNP confidences 2013-07-02 15:46:38 -04:00
nanoScheduler A whole slew of improvements to the Haplotype Caller and related code. 2013-07-12 10:09:10 -04:00
pairhmm Removed native method call 2013-07-30 13:59:58 -04:00
recalibration Added hidden mode for BQSR to force all read groups to be the same one. 2013-05-06 19:09:10 -04:00
smithwaterman New faster Smith-Waterman implementation that is edge greedy and assumes that ref and haplotype have same global start/end points. 2013-05-13 09:36:39 -04:00
ContigComparatorUnitTest.java Generalize and fixup ContigComparator 2013-02-09 09:52:13 -05:00