gatk-3.8/public/java/test/org/broadinstitute/sting
Mark DePristo 39e4396de0 New ActiveRegionShardBalancer allows efficient NanoScheduling
-- Previously we used the LocusShardBalancer for the haplotype caller, which meant that TraverseActiveRegions saw its shards grouped in chunks of 16kb bits on the genome.  These locus shards are useful when you want to use the HierarchicalMicroScheduler, as they provide fine-grained accessed to the underlying BAM, but they have two major drawbacks (1) we have to fairly frequently reset our state in TAR to handle moving between shard boundaries and (2) with the nano scheduled TAR we end up blocking at the end of each shard while our threads all finish processing.
-- This commit changes the system over to using an ActiveRegionShardBalancers, that combines all of the shard data for a single contig into a single combined shard.  This ensures that TAR, and by extensions the HaplotypeCaller, gets all of the data on a single contig together so the the NanoSchedule runs efficiently instead of blocking over and over at shard boundaries.  This simple change allows us to scale efficiently to around 8 threads in the nano scheduler:
  -- See https://www.dropbox.com/s/k7f280pd2zt0lyh/hc_nano_linear_scale.pdf
  -- See https://www.dropbox.com/s/fflpnan802m2906/hc_nano_log_scale.pdf
-- Misc. changes throughout the codebase so we Use the ActiveRegionShardBalancer where appropriate.
-- Added unit tests for ActiveRegionShardBalancer to confirm it does the merging as expected.
-- Fix bad toString in FilePointer
2013-05-13 11:09:02 -04:00
..
commandline Fixed the logic of the @Output annotation and its interaction with 'required'. 2013-03-14 11:58:51 -04:00
gatk New ActiveRegionShardBalancer allows efficient NanoScheduling 2013-05-13 11:09:02 -04:00
jna Fix more tests that fail when run in parallel on the farm 2013-03-12 16:53:30 -04:00
utils New faster Smith-Waterman implementation that is edge greedy and assumes that ref and haplotype have same global start/end points. 2013-05-13 09:36:39 -04:00
BaseTest.java Replace org.broadinstitute.variant with jar built from the Picard repo 2013-02-05 17:24:25 -05:00
ExampleToCopyUnitTest.java Replace org.broadinstitute.variant with jar built from the Picard repo 2013-02-05 17:24:25 -05:00
MD5DB.java Enable convenient display of diff engine output in Bamboo, plus misc. minor test-related improvements 2013-05-10 19:00:33 -04:00
MD5Mismatch.java Enable convenient display of diff engine output in Bamboo, plus misc. minor test-related improvements 2013-05-10 19:00:33 -04:00
StingTextReporter.java Updated all JAVA file licenses accordingly 2013-01-10 17:06:41 -05:00
TestNGTestTransformer.java Fix tests that were consistently or intermittently failing when run in parallel on the farm 2013-03-06 13:56:54 -05:00
WalkerTest.java Enable convenient display of diff engine output in Bamboo, plus misc. minor test-related improvements 2013-05-10 19:00:33 -04:00