gatk-3.8

Commit Graph

Author	SHA1	Message	Date
Mark DePristo	371f3752c1	Subshard timeouts in the GATK -- The previous implementation of the maxRuntime would require us to wait until all of the work was completed within a shard, which can be a substantial amount of work in the case of a locus walker with 16kb shards. -- This implementation ensures that we exit from the traversal very soon after the max runtime is exceeded, without completely all of our work within the shard. This is done by updating all of the traversal engines to return false for hasNext() in the nano scheduled input provider. So as soon as the timeout is exceeeded, we stop generating additional data to process, and we only have to wait until the currently executing data processing unit (locus, read, active region) completes. -- In order to implement this timeout efficiently at this fine scale, the progress meter now lives in the genome analysis engine, and the exceedsTimeout() call in the engine looks at a periodically updated runtime variable in the meter. This variable contains the elapsed runtime of the engine, but is updated by the progress meter daemon thread so that the engine doesn't call System.nanotime() in each cycle of the engine, which would be very expense. Instead we basically wait for the daemon to update this variable, and so our precision of timing out is limited by the update frequency of the daemon, which is on the order of every few hundred milliseconds, totally fine for a timeout. -- Added integration tests to ensure that subshard timeouts are working properly	2013-05-15 07:00:39 -04:00
David Roazen	a67d8c8dd6	Bump timeout for MaxRuntimeIntegrationTest Looks like returning this timeout to its original value was a bit too aggressive -- adding 40 seconds to the tolerance limit.	2013-03-17 16:17:29 -04:00
David Roazen	742a7651e9	Further tweaking of test timeouts Increase one timeout, restore others that were only timing out due to the Java crypto lib bug to their original values. -DOUBLE timeout for NanoSchedulerUnitTest.testNanoSchedulerInLoop() -REDUCE timeout for EngineFeaturesIntegrationTest to its original value -REDUCE timeout for MaxRuntimeIntegrationTest to its original value -REDUCE timeout for GATKRunReportUnitTest to its original value	2013-03-15 14:49:21 -04:00
David Roazen	cdb1fa1105	Fix more tests that fail when run in parallel on the farm -Allow the default S3 put timeout of 30 seconds for GATKRunReports to be overridden via a constructor argument, and use a timeout of 300 seconds for tests. The timeout remains 30 seconds in all other cases. -Change integration tests that themselves dispatch farm jobs into pipeline tests. Necessary because some farm nodes are not set up as submit hosts. Pipeline tests are still run directly on gsa4. -Bump up the timeout for the MaxRuntimeIntegrationTest even more (was still occasionally failing on the farm!)	2013-03-12 16:53:30 -04:00
David Roazen	3ab78543a7	Fix tests that were consistently or intermittently failing when run in parallel on the farm -Make MaxRuntimeIntegrationTest more lenient by assuming that startup overhead might be as long as 120 seconds on a very slow node, rather than the original assumption of 20 seconds -In TraverseActiveRegionsUnitTest, write temp bam file to the temp directory, not to the current working directory -SimpleTimerUnitTest: This test was internally inconsistent. It asserted that a particular operation should take no more than 10 milliseconds, and then asserted again that this same operation should take no more than 100 microseconds (= 0.1 millisecond). On a slow node it could take slightly longer than 100 microseconds, however. Changed the test to assert that the operation should require no more than 10000 microseconds (= 10 milliseconds) -change global default test timeout from 20 to 40 minutes (things just take longer on the farm!) -build.xml: allow runtestonly target to work with scala test classes	2013-03-06 13:56:54 -05:00
David Roazen	65d31ba4ad	Fix runtime public -> protected dependencies in the test suite -replace unnecessary uses of the UnifiedGenotyper by public integration tests with PrintReads -move NanoSchedulerIntegrationTest to protected, since it's completely dependent on the UnifiedGenotyper	2013-02-26 21:19:12 -05:00
Mauricio Carneiro	2a4ccfe6fd	Updated all JAVA file licenses accordingly GSATDG-5	2013-01-10 17:06:41 -05:00
Mark DePristo	251983b8fb	Add GATK-wide command line argument to control the maximum runtime allowed for the GATK -- Providing this optional argument -maxRuntime (in -maxRuntimeUnits units) causes the GATK to exit gracefully when the max. runtime has been exceeded. By cleanly I mean that the engine simply stops at the next available cycle in the walker as through the end of processing had been reached. This means that all output files are closed properly, etc. -- Emits an info message that looks like "INFO 10:36:52,723 MicroScheduler - Aborting execution (cleanly) because the runtime has exceeded the requested maximum 10.0000 s". Otherwise there's currently no way to differentiate a truly completed run from a timelimit exceeded run, which may be a useful thing for a future update -- Resolves GSA-630 / GATK max runtime to deal with bad LSA calling? -- Added new JIRA entry for Ami to restart chr1 macarthur with this argument set to -maxRuntime 1 -maxRuntimeUnits DAYS to see if we can do all of chr1 in one weekend.	2012-10-26 13:18:34 -04:00

8 Commits (701d70401f406c1a9c63a0b2e91c837ce095c871)