gatk-3.8

Commit Graph

Author	SHA1	Message	Date
Eric Banks	6df6c1abd5	Fix for PBT to stop NPE when there are no likelihoods present	2012-09-06 13:14:18 -04:00
Christopher Hartl	a7396d29c9	Final bugfixes to LDCorrectedDosage. In addition: rigorous contractification and a unit test for core functionalities. Special thanks to David for help with building and running contracts.	2012-09-06 00:52:31 -04:00
Mark DePristo	5ab5d8dee8	Give EfficiencyMonitoringThreadFactoryUnitTest longer to complete its tests	2012-09-05 22:08:34 -04:00
Mark DePristo	1b064805ed	Renaming -cnt to -nct for consistency	2012-09-05 21:13:19 -04:00
Mark DePristo	e77abfa82d	Merge branch 'nanoScheduler'	2012-09-05 21:10:08 -04:00
Mark DePristo	0bd2a872fa	Done GSA-282: Unindexed traversals crash if a read goes off the end of a contig -- Already fixed in the codebase. Added unindexed bam and integration tests to ensure this is fine going forward.	2012-09-05 21:10:03 -04:00
Mark DePristo	228bac75e4	By default do only NT tests in integration tests	2012-09-05 20:57:49 -04:00
Mark DePristo	574a8f710b	Add static boolean controlled output of individual map call timing to nanoSecond resolution	2012-09-05 17:40:02 -04:00
Mark DePristo	e11915aa0a	GSA-515 Nanoscheduler GSA-550 ThreadSafeMapReduce shouldn't be super interface of TreeReducible	2012-09-05 17:37:56 -04:00
Mark DePristo	c5f1ceaa95	All read and loci traversals go through NanoScheduler now -- The NanoScheduler is doing a good job at tracking important information like time spent in map/reduce/input etc. -- Can be disabled with static boolean in MicroScheduler if we have problems -- See GSA-515 Nanoscheduler GSA-549 Retire TraverseReads and TraverseLoci after testing confirms nano scheduler version in single threaded version is fine	2012-09-05 16:38:21 -04:00
Mark DePristo	dddf148a59	Fixed bug in ThreadAllocation getTotalNumberOfThreads -- It isnt data + cpu its data * cpu threads.	2012-09-05 16:35:32 -04:00
Mark DePristo	225f3a0ebe	Update integration test system to allow us to differentiate between testing data and cpu parallelism	2012-09-05 16:35:00 -04:00
Mark DePristo	9bf1d138d9	New GATK argument interface for data and cpu threads -- Closes GSA-515 Nanoscheduler GSA-542 Good interface to nanoScheduler -- Old -nt means dataThreads -- New -cnt (--num_cpu_threads_per_data_thread) gives you n cpu threads for each data thread in the system -- Cleanup logic for handling data and cpu threading in HMS, LMS, and MS -- GATKRunReport reports the total number of threads in use by the GATK, not just the nt value -- Removed the io,cpu tags for nt. Stupid system if you ask me. Cleaned up the GenomeAnalysisEngine and ThreadAllocation handling to be totally straightforward now	2012-09-05 15:45:24 -04:00
Mark DePristo	1e55475adc	NanoScheduler uses ExecutorService to run input reader thread	2012-09-05 15:45:24 -04:00
Mark DePristo	71d9ebcb0d	Fix bug (introduced by me) that didn't include contig in progress meter	2012-09-05 15:45:24 -04:00
Mark DePristo	c822b7c760	Fix long-standing NPE in LMS due to inappropriate timing of initialization	2012-09-05 15:45:24 -04:00
Mark DePristo	a997c99806	Initial NanoScheduler with input producer thread	2012-09-05 15:45:24 -04:00
Mark DePristo	03dd470ec1	Test for progressFunction in NanoScheduler; bugfix for single threaded fast path	2012-09-05 15:45:23 -04:00
Mark DePristo	8cdeb51b78	Cleanup printProgress in TraversalEngine -- Separate updating cumulative traversal metrics from printing progress. There's now an updateCumulativeMetrics function and a printProgress() that only takes a current position -- printProgress now soles relies on the time since the last progress to decide if it will print or not. No longer uses the number of cycles, since this isn't reliable in the case of nano scheduling -- GenomeAnalysisEngine now maintains a pointer to the master cumulative metrics. getCumulativeMetrics never returns null, which was handled in some parts of the code but not others. -- Update all of the traversals to use the new updateCumulativeMetrics, printProgress model -- Added progress callback to nano scheduler. Every bufferSize elements this callback is invoked, allowing us to smoothly update the progress meter in the NanoScheduler -- Rename MapFunction to NanoSchedulerMap and the same for reduce.	2012-09-05 15:45:23 -04:00
Mark DePristo	d503ed97ab	Mark I NanoScheduling TraverseLoci -- Refactored TraverseLoci into old linear version and nano scheduling version -- Temp. GATK argument to say how many nano threads to use -- Can efficiently scale to 3 threads before blocking on input	2012-09-05 15:45:23 -04:00
Mark DePristo	757e6a0160	Making Pileup thread-safe -- Old version relied on out printstream magically sorting output, new version puts the print in reduce	2012-09-05 15:45:23 -04:00
Mark DePristo	d7105223fe	More debugging output for NanoScheduler when debugging is enabled	2012-09-05 15:45:23 -04:00
Mark DePristo	9823102c0c	TraverseReadsNano supports walker.filter and walker.done -- Instead of returning directly the result of map(), returns a MapResult object with the value and a reduceMe flag. -- Reduce function respects the reduceMe flag -- Code cleanup and more documentation	2012-09-05 15:45:23 -04:00
Mark DePristo	1a8f5fc374	Trivial cleanup of NanoScheduler	2012-09-05 15:45:23 -04:00
Mark DePristo	6a5a70cdf1	Done GSA-539: SimpleTimer should use System.nanoTime for nanoSecond resolution	2012-09-05 15:45:23 -04:00
Mark DePristo	59109d5eeb	NanoScheduler tracks time outside of its execute call	2012-09-05 15:45:23 -04:00
Mark DePristo	800a27c3a7	NanoScheduler tracks time within input, map, and reduce -- Helpful for understanding where the time goes to each bit of the code. -- Controlled by a local static boolean, to avoid the potential overhead in general	2012-09-05 15:45:23 -04:00
Mark DePristo	7087b22ea3	No debugging output (even conditional) for ReadTransformers in PrintReads	2012-09-05 15:45:23 -04:00
Mark DePristo	e01258b261	NanoScheduler now supports printProgress. Bugfixes to printProgress -- TraverseReadsNano prints progress at the end of each traversal unit -- Fix bugs in TraversalEngine printProgress -- Synchronize the method so we don't get multiple logged outputs when two or more HMSs call printProgress before initialization at the start! -- Fix the logic for mustPrint, which actually had the logic of mustNotPrint. Now we see the done log line that was always supposed to be there -- Fix output formatting, as the done() line was incorrectly shifting over the % complete by 1 char as 100.0% didn't fit in %4.1f -- Add clearer doc on -PF argument so that people know that the performance log can be generated to standard out if one wants	2012-09-05 15:45:23 -04:00
Mark DePristo	6055101df8	NanoScheduler no longer groups inputs, each map() call is interlaced now -- Maximizes the efficiency of the threads -- Simplifies interface (yea!) -- Reduces number of combinatorial tests that need to be performed	2012-09-05 15:45:22 -04:00
Mark DePristo	397a5551ef	More memory for gatkdocs and extracthelp targets	2012-09-05 15:45:22 -04:00
Mark DePristo	e3b4cc02aa	Done GSA-282: Unindexed traversals crash if a read goes off the end of a contig -- Already fixed in the codebase. Added unindexed bam and integration tests to ensure this is fine going forward.	2012-09-05 15:45:22 -04:00
Yossi Farjoun	d6884e705a	Revert "fixed a typo in StringText.properties" This reverts commit b74c1c17e748f75e59d23545084b983e2a8d2fa6.	2012-09-05 15:21:00 -04:00
Yossi Farjoun	ad5fa449e7	fixed a typo in the string comment	2012-09-05 14:46:10 -04:00
Yossi Farjoun	f4b39a7545	Merge branch 'master' of ssh://gsa4/humgen/gsa-scr1/gsa-engineering/git/unstable merging trivially after a commit	2012-09-05 14:33:39 -04:00
Yossi Farjoun	6e517df5d9	fixed a typo in StringText.properties	2012-09-05 14:33:08 -04:00
Ryan Poplin	df749b27c1	Bug fix in delocalized BQSR in the calculateIsIndel() function	2012-09-05 14:21:20 -04:00
Ryan Poplin	84a83fd3f3	fixing typo	2012-09-05 10:41:03 -04:00
Eric Banks	fc06f39411	Fixed docs for Pileup walker	2012-09-05 09:55:34 -04:00
Christopher Hartl	d795437202	- New UserExceptions added for when ReadFilters or Walkers specified on the command line are not found. When -rf xxxx cannot find the class corresponding to xxxx, all read filters are printed in a better formatted way, with links to their gatk docs. - VariantAnnotatorEngine changed to call genotype annotations even if pilups and allele -> likelihood mappings are not present. Current genotype annotations altered to check for null pilupes and null mappings.	2012-09-04 16:41:44 -04:00
Ryan Poplin	9cc1a9931b	Resolving merge conflicts.	2012-09-04 10:47:38 -04:00
Ryan Poplin	c9944d81ef	Skip array needs to also be used in the updateDataForRead function of the delocalized BQSR.	2012-09-04 10:33:37 -04:00
Mark DePristo	0892f2b8b2	Closing GSA-287:LocusReferenceView doesn't do very well in the case where contigs land off the end of the reference -- Confirmed that reads spanning off the end of the chromosome don't cause an exception by adding integration test for a single read that starts 7 bases from the end of chromosome 1 and spans 90 bases or so off. Added pileup integration test to ensure this behavior continues to work	2012-09-03 20:18:56 -04:00
Mark DePristo	52d6bea804	a few more useful git ignores	2012-09-01 11:08:36 -04:00
Mark DePristo	1b0ce511a6	Updating BQSR tests due to my change to reset BQSR calibration data	2012-08-31 19:51:09 -04:00
Eric Banks	277ba94c7b	Update from dbsnp135 to dbsnp137.	2012-08-31 14:06:29 -04:00
Eric Banks	5ea7cd6dcc	Updating resource bundle: no reason to include both genotype and sites files for Omni and HM3, sites are enough. Also, don't include duplicate entry for the Mills indels.	2012-08-31 14:01:54 -04:00
Mark DePristo	f066a02f3e	Merge branch 'applyRecalibration'	2012-08-31 13:43:52 -04:00
Mark DePristo	c9ea213c9b	Make BaseRecalibration thread-safe -- In the process uncovered two strange things 1 -- qualityScoreByFullCovariateKey was created but never used. Seems like a cache? 2 -- Discovered nasty bug in BaseRecalibrator: https://jira.broadinstitute.org/browse/GSA-534	2012-08-31 13:42:42 -04:00
Mark DePristo	27ddebee53	Protect PrintReads from strange state from TraverseReadsUnitTests	2012-08-31 13:42:41 -04:00

1 2 3 4 5 ...

10548 Commits (bfbf1686cd0f71c94dea59c84b6c74c71f0ae1af) All Branches Search

10548 Commits (bfbf1686cd0f71c94dea59c84b6c74c71f0ae1af)

All Branches