Commit Graph

11091 Commits (c8be7c3102d7764b0141665bc64955062da8c00a)

Author SHA1 Message Date
Menachem Fromer c8be7c3102 Keep SNPs and indels separately for batch merging; Add options to DepthOfCoverage to count fragments (to not double-count overlapping reads of same fragment); DepthOfCoverage should now support ReducedReads; Replace recusrion with loop in DoC/package.scala (for lists longer than 5000 elements) 2012-11-21 15:56:53 -05:00
Menachem Fromer 8376c28728 Merge branch 'master' of github.com:broadinstitute/gsa-unstable 2012-11-20 12:35:39 -05:00
Menachem Fromer d2a6e2526d Fixed Input vs. Argument 2012-11-20 12:34:47 -05:00
Menachem Fromer 9111966261 Merge branch 'master' of github.com:broadinstitute/gsa-unstable 2012-11-20 12:19:58 -05:00
David Roazen 553e635415 Bring miscellaneous additional infrastructure scripts under version control 2012-11-19 16:12:05 -05:00
David Roazen 28b8401867 Checking in new git setup script 2012-11-19 10:02:18 -05:00
David Roazen e68dc24aae Git infrastructure updates
-Webhook handler
-Infer From: address from last commit in push rather than from userid
running the git process
2012-11-19 09:45:54 -05:00
David Roazen e1a5c3ce7a Revert "Second test of new repos. Please ignore."
This reverts commit 077532d870ddf53ec514b98f14534ca7dbf55331.
2012-11-19 09:34:32 -05:00
David Roazen 9fc63efc30 Second test of new repos. Please ignore. 2012-11-19 09:34:15 -05:00
David Roazen 2a16d6fa55 Revert "Testing out the new github-hosted repos. Please ignore."
This reverts commit b6bf66cd088754e7fd3d5f105ca8b2551237f183.
2012-11-19 08:07:19 -05:00
David Roazen 855a68ae39 Testing out the new github-hosted repos. Please ignore. 2012-11-19 08:06:58 -05:00
Joel Thibault b70fd4a242 Initial testing of the Active Region Traversal contract
- TODO: many more tests and test cases
2012-11-15 10:08:00 -05:00
Guillermo del Angel a68e6810c9 Back off experimental code that escaped last commit, not for general use yet 2012-11-14 14:45:15 -05:00
Guillermo del Angel 89bbe73a43 Commenting out CMI pipeline test that wasn't meant to be in GATK repository (why was this merged??) 2012-11-14 14:39:04 -05:00
Guillermo del Angel 3771d074dc Merge branch 'master' of ssh://gsa3/humgen/gsa-scr1/gsa-engineering/git/unstable 2012-11-14 14:37:43 -05:00
Eric Banks 843384e435 Rename hg19 files in bundle to b37 since that's what they are 2012-11-14 11:47:09 -05:00
Mauricio Carneiro e35fd1c717 Merging CMI-0.5.0 and GATK-2.2 together. 2012-11-14 10:42:03 -05:00
Mauricio Carneiro a079d8d0d1 Breaking the utility to write @PG tags for SAMFileWriters and StingSAMFileWriters 2012-11-14 10:33:22 -05:00
Mauricio Carneiro dba31018f4 Implementation of BySampleSAMFileWriter
ReduceReads now works with the n-way-out capability, splitting by sample.
DEV-27 #resolve #time 3m
2012-11-14 10:33:22 -05:00
Kristian Cibulskis 8503db6e87 moving cancer pipeline qscript over to cmi-cancer 2012-11-14 10:33:21 -05:00
Kristian Cibulskis de53c32361 moving to cmi-cancer 2012-11-14 10:33:21 -05:00
Guillermo del Angel ff63c53961 Increase bwa mem limit by 1 Gb - see if we can avoid random job failures that way 2012-11-14 10:33:21 -05:00
Mauricio Carneiro a17cd54b68 Co-Reduction implementation in ReduceReads
ReduceReads now co-reduces bams if they're passed in toghether with multiple -I. Co-reduction forces every variant region in one sample to be a variant region in all samples.
Also:
  * Added integrationtest for co-reduction
  * Fixed bug with new no-recalculation implementation of the marksites object where the last object wasn't being removed after finalizing a variant region (updated MD5's accordingly)

DEV-200 #resolve #time 8m
2012-11-14 10:33:21 -05:00
Douglas Voet f4511c0d6f made parameters not required, fixed some null file names 2012-11-14 10:33:21 -05:00
Douglas Voet 1bfe73397f made parameters not required, switched date sequenced format, bug fixes in pulling remote files 2012-11-14 10:33:21 -05:00
Scott Frazer f9cc68ce17 pull metadata from metadata service 2012-11-14 10:33:20 -05:00
kshakir 6d59dd3455 Scala classes were only returning direct subclasses (confirmed when inspected in debugger) so changed PluginManager to allow specifying the explicit subclass.
Removed some generics from PluginManager for now until able to figure out syntax for requesting explicit subclass.
QStatusMessenger uses a slightly more primitive Map[String, Seq[RemoteFile]] instead of Map[ArgumentSource, Seq[RemoteFile]].
Added an QCommandPlugin.initScript utility method for handling specialized script types.
2012-11-14 10:33:20 -05:00
Eric Banks 42ddf51156 Merged bug fix from Stable into Unstable 2012-11-14 10:29:09 -05:00
Eric Banks ba41f65759 Protect against NPEs in SelectVariants by checking for missing Genotypes 2012-11-13 11:53:39 -05:00
Eric Banks c7335c9902 Having a malformed GATK report is a User Error 2012-11-13 11:53:12 -05:00
Eric Banks 525cf331f4 Don't catch a User Error and re-throw as a Reviewed Exception. That makes Eric unhappy. 2012-11-13 11:52:47 -05:00
Eric Banks e93d461910 Adding integration test to BQSR for the csv file 2012-11-09 09:11:04 -05:00
Eric Banks ee776e996a Merged bug fix from Stable into Unstable 2012-11-09 08:35:51 -05:00
Eric Banks 66cbaaee31 Fixed nasty bug in BQSR csv file creation:
numbers larger than 999 in the Errors column were printed out with commas (which looks like a separate column).

This wasn't caught earlier because there are no integration tests covering the csv.  I'll add one into unstable in a sec.
2012-11-09 08:33:55 -05:00
Eric Banks e9183d9fe0 Fix bugs as reported on the forum: BED needs to be explicitly set as the default output format and the output didn't actually adhere to the BED spec. 2012-11-08 15:07:47 -05:00
Eric Banks 17ab3a39d5 Make the --intermediate_csv_file argument un-hidden. 2012-11-08 14:35:23 -05:00
Eric Banks 2da76db945 Updating integration tests 2012-11-06 22:23:05 -08:00
Eric Banks e6f33a2e34 Merge branch 'master' of ssh://gsa2/humgen/gsa-scr1/gsa-engineering/git/unstable 2012-11-06 20:55:34 -08:00
Eric Banks f4d4846435 Merged bug fix from Stable into Unstable 2012-11-06 20:53:54 -08:00
Eric Banks 15b8c08132 Apparently CIGAR elements can have 0 length according to the spec, but 0Ms were causing left alignment of indels to fail. Fixed. 2012-11-06 20:53:33 -08:00
Eric Banks 0a2dded093 Fixes for bugs uncovered by unit tests 2012-11-06 16:07:40 -08:00
Eric Banks b07106b3a7 Reimplement the allele biased downsampling to be smarter. Now we don't blindly pull n% of reads off of each allele. Instead, we try all possible genotype conformations for the contaminating sample and choose the one that provides the best genotype for the target sample (based heuristically on allele balance). This method allows us to save some of the reads that belong to the target sample, which should make Daniel M happy. Added unit tests to test the biased downsampling functionality. 2012-11-06 14:39:58 -08:00
Ami Levy Moonshine bff7803f3a update the Mills_and_1000G_gold_standard file 2012-11-05 13:00:40 -05:00
Ami Levy Moonshine 659ec4b6b7 Merge branch 'master' of ssh://gsa2.broadinstitute.org/humgen/gsa-scr1/gsa-engineering/git/unstable 2012-11-05 11:43:57 -05:00
Ami Levy Moonshine fca6321be2 update the dbSNP file 2012-11-05 11:43:36 -05:00
David Roazen bf7c832d25 Merged bug fix from Stable into Unstable 2012-11-02 19:23:11 -04:00
David Roazen eae2d019cf Refuse to package the GATK from a non-clean working directory
Packaging from a non-clean working directory can result in an incorrect
jar. Now that we have external collaborators packaging and distributing
the GATK, not enforcing the clean requirement has become far too dangerous.
At the same time, invoking "clean" automatically through a direct
dependency would also be dangerous -- instead, it's better to error out
if a packaging target is invoked from a non-clean working dir.
2012-11-02 19:11:24 -04:00
Mark DePristo 0ab4022f23 Final r119 tribble jar 2012-11-02 14:30:33 -04:00
David Roazen 73157ae3d3 Allow each pipeline test the max of 10 hours to run
The runtime of these tests is extremely variable -- sometimes they will complete almost instantly,
other times they will wait in an LSF queue for 5-10+ hours. Minimize timeout errors by setting the
timeout for these tests to the maximum of 10 hours.
2012-11-02 12:40:56 -04:00
Mark DePristo f8a0a947e3 Critical bugfix for GSA-652 / Multi-threaded VCF -> BCF writing produces invalid intermediate file that fails on merging
-- New tribble library now uses 64 bit sizes.  The 26K VCF has so much data that low-level tribble block indices where overflowing their int size values.  This includes a to-be-committed tribble jar that fixes this problem
-- See https://jira.broadinstitute.org/browse/GSA-652
-- Minor cleanup of error messages that were useful on the way to solving this monster problem
2012-11-02 09:09:59 -04:00