Commit Graph

7213 Commits (5a6ae954bfd28c8a2ae31fcbe0ccead7760b4645)

Author SHA1 Message Date
Mark DePristo 5a6ae954bf Added VCF streaming to tribbleVsGATK 2011-08-26 14:04:48 -04:00
Mark DePristo bd92a1b220 Merge branch 'master' of ssh://gsa1/humgen/gsa-scr1/gsa-engineering/git/unstable 2011-08-26 13:24:26 -04:00
Mark DePristo e37a638e09 Fix for disallowed characters in GATKReportTable
-- Illegal characters are automatically replaced with _
2011-08-26 13:24:06 -04:00
Ryan Poplin 6e66f1c243 Removing code that cleans up the assembly graph for purposes of display. There seems to be bugs 2011-08-26 12:36:12 -04:00
Ryan Poplin 8f150c6764 Assembly debug mode now uses smith-waterman to locally align all haplotypes and outputs to a bam file instead of first writing a fasta file and using bwa-sw outside of the GATK. 2011-08-26 10:43:40 -04:00
Ryan Poplin 8f90a22555 Merge branch 'master' of ssh://nickel.broadinstitute.org/humgen/gsa-scr1/gsa-engineering/git/unstable 2011-08-26 10:16:56 -04:00
Mark DePristo 0cb1605df0 Clean documentation for JobRunInfo 2011-08-26 09:22:58 -04:00
Mark DePristo 415d5d5301 LSF long times are in seconds, convert to milliseconds to meet standard 2011-08-26 09:18:28 -04:00
Mark DePristo c11ddf3da3 Enchancements to queueJobReport
-- Runtimes are in seconds, not milliseconds
-- Job names are shown in the gantt chart
-- axis names are clearer
-- documentation for all of the function
2011-08-26 09:17:09 -04:00
Mark DePristo c0503283df Spelling fix requires md5 updates 2011-08-26 07:40:44 -04:00
Mark DePristo eef1ac415a Merge branch 'master' into rodTesting
Conflicts:
	public/java/src/org/broadinstitute/sting/gatk/walkers/variantutils/VariantsToTable.java
2011-08-26 00:35:41 -04:00
Mark DePristo 6bd3959ff2 Now shows job scheduling over time 2011-08-26 00:19:50 -04:00
Mark DePristo 34fb6ef798 SitesVsGenotypes should only be on chr1 2011-08-26 00:18:09 -04:00
Eric Banks c0e55c5ca4 Merge branch 'master' of ssh://nickel.broadinstitute.org/humgen/gsa-scr1/gsa-engineering/git/unstable 2011-08-25 22:42:27 -04:00
Eric Banks 9b7512fd94 Just because there's a ref base doesn't mean the VC needs to be padded 2011-08-25 22:42:14 -04:00
Menachem Fromer 6d8851eb63 Added option to control memory usage of DoC merging step 2011-08-25 22:12:54 -04:00
Mark DePristo 03a655023c Must explicitly enable multithreaded tests
-- Avoids sending nt 8 jobs to the unsuspecting farm
2011-08-25 19:41:08 -04:00
Mark DePristo 1a50a251d8 Draft version of report 2011-08-25 17:03:50 -04:00
Mark DePristo e03dfdb0ab Automatic iteration field addition works properly. 2011-08-25 16:59:02 -04:00
Mark DePristo e01273ca7c Queue now writes out queueJobReport.pdf
-- General purpose RScript executor in java (please use when invoking RScripts)
-- Removed groupName.  This is now analysisName
-- Explicitly added capability to enable/disable individual QFunction
2011-08-25 16:57:11 -04:00
Eric Banks 09a729da3a Removing incorrect comment 2011-08-25 15:42:52 -04:00
Eric Banks 3eed5af2f7 Merge branch 'master' of ssh://nickel.broadinstitute.org/humgen/gsa-scr1/gsa-engineering/git/unstable 2011-08-25 15:37:35 -04:00
Eric Banks 8bbef79fc2 Create clipped alleles during allele parsing instead of creating a full VC, clipping alleles, and regenerating the VC from scratch. 2011-08-25 15:37:26 -04:00
Ryan Poplin c34b4b20aa Merge branch 'master' of ssh://nickel.broadinstitute.org/humgen/gsa-scr1/gsa-engineering/git/unstable 2011-08-25 14:58:15 -04:00
Guillermo del Angel 6347317ba8 Too fast to commit last time 2011-08-25 14:58:10 -04:00
Mark DePristo 0f4be2c4a4 Argument to disable queueJobReport entirely
-- Minor improvements to RodPerformanceGoals
2011-08-25 13:32:03 -04:00
Guillermo del Angel 999dff1252 Merge branch 'master' of ssh://nickel.broadinstitute.org/humgen/gsa-scr1/gsa-engineering/git/unstable 2011-08-25 13:28:23 -04:00
Guillermo del Angel 1bf0fcd7a4 Obvious bug fix in HSP script: not enough to fix test yet, though 2011-08-25 13:27:55 -04:00
Eric Banks d81122cf5f Fixed up so that all modes can go through multiple iterations 2011-08-25 13:22:39 -04:00
Mark DePristo d65faf509c Default output name for Queue JobReport is queue_jobreport.gatkreport.txt 2011-08-25 13:15:20 -04:00
Mark DePristo a7d6946b22 Refactored QJobReport and QFunction, which is now automatically tracked
-- All QFunctions, including sg ones, are tracked
-- Removed memory information
2011-08-25 13:13:55 -04:00
Mark DePristo 08f6c3eea9 companion gantt charts for Q runtimes 2011-08-25 12:59:59 -04:00
Ryan Poplin e5ca61e371 Bug fix for multiallelic events in haplotype caller 2011-08-25 11:09:39 -04:00
Ryan Poplin 5e0660e27d Adding debug option to output all possible haplotypes in fasta format for use with bwa 2011-08-25 10:31:10 -04:00
Mauricio Carneiro 16caca0822 BLASR BAMs and new BWA parameters
*Added the functions to turn a BLASR generated BAM file into a usable BAM file.
*Modified the bwa parameters according to test results from NA12878 pb2k dataset.
2011-08-24 17:04:07 -04:00
Mauricio Carneiro e3f5d7067a Added ReorderSam queue binding 2011-08-24 17:03:11 -04:00
Mark DePristo 08fb21f127 Removing hostname 2011-08-24 16:45:50 -04:00
Mauricio Carneiro d50474f14c Merged bug fix from Stable into Unstable
resolved conflicts by maintaining all the changes in UNSTABLE where this bug had already been fixed.

Conflicts:
	public/scala/qscript/org/broadinstitute/sting/queue/qscripts/DataProcessingPipeline.scala
2011-08-24 16:06:36 -04:00
Mauricio Carneiro dc8398e165 fixing bai output for indel cleaning. 2011-08-24 15:58:34 -04:00
Ryan Poplin da5e6b52e7 Refactoring the smith-waterman and genotyping-related pieces into their own engine 2011-08-24 15:54:17 -04:00
Mark DePristo 06e30a81d1 Fixes throughout for getting job information
-- no more hostname -- it's just not going to be important
2011-08-24 15:30:09 -04:00
Ryan Poplin 29c7b10f7b Merge branch 'master' of ssh://nickel.broadinstitute.org/humgen/gsa-scr1/gsa-engineering/git/unstable 2011-08-24 15:18:58 -04:00
Ryan Poplin e5008aba00 Output the top two haplotypes as a variant call by running smith-waterman alignment against the reference and calling any difference as variation. This is the first verion that runs end-to-end by taking in reads as bam file and writing out variant calls in VCF. 2011-08-24 15:18:44 -04:00
Mark DePristo 4918519a58 No more NPE in getRuntime() when you cntr-c out of Queue 2011-08-24 14:14:01 -04:00
Mark DePristo 16d8360592 QJobReport is now the official capability name 2011-08-24 13:59:14 -04:00
Mark DePristo d047c19ad1 Writes output to file 2011-08-24 13:52:05 -04:00
Guillermo del Angel 61f5968807 Change indel/snp selection to new arguments 2011-08-24 13:42:23 -04:00
Mark DePristo 3ae68e2397 JobLogging trait now writes out GATKReport log of jobs 2011-08-24 13:36:39 -04:00
Guillermo del Angel e618cb1e79 a) Renamed/expanded SelectVariants arguments that choose particular kinds of variants and particular allelic types, now instead of -Indels or -SNPs we can specify for example -selectType [MIXED|INDEL|SNP|MNP|SYMBOLIC]. To select biallelic, multiallelic variants, use -restrictAllelesTo [BIALLELIC|MULTIALLELIC]. Corresponding gatkdocs changes.
b) More useful AC,AF logging in VariantsToTable with multiallelic sites: instead of logging comma-separated values, log max value by default. Hidden, experimental argument -logACSum to log sum of ACs instead. This is due to extreme slowness of R in parsing strings to tokens and computing max/sum itself (~100x slower than gatk).
c) Added integrationtest for new SelectVariants commands
2011-08-24 12:25:50 -04:00
Mauricio Carneiro cd12f7f286 Fixed list dependency
Instead of creating a bam list file, I dynamically create a scala list and pass as parameters. This way the intermediate bam files don't get deleted before they should.
2011-08-24 11:12:46 -04:00