Commit Graph

7214 Commits (beb947d3cc979a7bf46e41b33456bce9b9d786ef)

Author SHA1 Message Date
David Roazen beb947d3cc Standalone program to create an XML index of the GSA pipeline directory suitable for loading in IGV.
This is a replacement for an ancient Perl script that will soon be retired.
2011-08-26 14:48:38 -04:00
Mark DePristo 5a6ae954bf Added VCF streaming to tribbleVsGATK 2011-08-26 14:04:48 -04:00
Mark DePristo bd92a1b220 Merge branch 'master' of ssh://gsa1/humgen/gsa-scr1/gsa-engineering/git/unstable 2011-08-26 13:24:26 -04:00
Mark DePristo e37a638e09 Fix for disallowed characters in GATKReportTable
-- Illegal characters are automatically replaced with _
2011-08-26 13:24:06 -04:00
Ryan Poplin 6e66f1c243 Removing code that cleans up the assembly graph for purposes of display. There seems to be bugs 2011-08-26 12:36:12 -04:00
Ryan Poplin 8f150c6764 Assembly debug mode now uses smith-waterman to locally align all haplotypes and outputs to a bam file instead of first writing a fasta file and using bwa-sw outside of the GATK. 2011-08-26 10:43:40 -04:00
Ryan Poplin 8f90a22555 Merge branch 'master' of ssh://nickel.broadinstitute.org/humgen/gsa-scr1/gsa-engineering/git/unstable 2011-08-26 10:16:56 -04:00
Mark DePristo 0cb1605df0 Clean documentation for JobRunInfo 2011-08-26 09:22:58 -04:00
Mark DePristo 415d5d5301 LSF long times are in seconds, convert to milliseconds to meet standard 2011-08-26 09:18:28 -04:00
Mark DePristo c11ddf3da3 Enchancements to queueJobReport
-- Runtimes are in seconds, not milliseconds
-- Job names are shown in the gantt chart
-- axis names are clearer
-- documentation for all of the function
2011-08-26 09:17:09 -04:00
Mark DePristo c0503283df Spelling fix requires md5 updates 2011-08-26 07:40:44 -04:00
Mark DePristo eef1ac415a Merge branch 'master' into rodTesting
Conflicts:
	public/java/src/org/broadinstitute/sting/gatk/walkers/variantutils/VariantsToTable.java
2011-08-26 00:35:41 -04:00
Mark DePristo 6bd3959ff2 Now shows job scheduling over time 2011-08-26 00:19:50 -04:00
Mark DePristo 34fb6ef798 SitesVsGenotypes should only be on chr1 2011-08-26 00:18:09 -04:00
Eric Banks c0e55c5ca4 Merge branch 'master' of ssh://nickel.broadinstitute.org/humgen/gsa-scr1/gsa-engineering/git/unstable 2011-08-25 22:42:27 -04:00
Eric Banks 9b7512fd94 Just because there's a ref base doesn't mean the VC needs to be padded 2011-08-25 22:42:14 -04:00
Menachem Fromer 6d8851eb63 Added option to control memory usage of DoC merging step 2011-08-25 22:12:54 -04:00
Mark DePristo 03a655023c Must explicitly enable multithreaded tests
-- Avoids sending nt 8 jobs to the unsuspecting farm
2011-08-25 19:41:08 -04:00
Mark DePristo 1a50a251d8 Draft version of report 2011-08-25 17:03:50 -04:00
Mark DePristo e03dfdb0ab Automatic iteration field addition works properly. 2011-08-25 16:59:02 -04:00
Mark DePristo e01273ca7c Queue now writes out queueJobReport.pdf
-- General purpose RScript executor in java (please use when invoking RScripts)
-- Removed groupName.  This is now analysisName
-- Explicitly added capability to enable/disable individual QFunction
2011-08-25 16:57:11 -04:00
Eric Banks 09a729da3a Removing incorrect comment 2011-08-25 15:42:52 -04:00
Eric Banks 3eed5af2f7 Merge branch 'master' of ssh://nickel.broadinstitute.org/humgen/gsa-scr1/gsa-engineering/git/unstable 2011-08-25 15:37:35 -04:00
Eric Banks 8bbef79fc2 Create clipped alleles during allele parsing instead of creating a full VC, clipping alleles, and regenerating the VC from scratch. 2011-08-25 15:37:26 -04:00
Ryan Poplin c34b4b20aa Merge branch 'master' of ssh://nickel.broadinstitute.org/humgen/gsa-scr1/gsa-engineering/git/unstable 2011-08-25 14:58:15 -04:00
Guillermo del Angel 6347317ba8 Too fast to commit last time 2011-08-25 14:58:10 -04:00
Mark DePristo 0f4be2c4a4 Argument to disable queueJobReport entirely
-- Minor improvements to RodPerformanceGoals
2011-08-25 13:32:03 -04:00
Guillermo del Angel 999dff1252 Merge branch 'master' of ssh://nickel.broadinstitute.org/humgen/gsa-scr1/gsa-engineering/git/unstable 2011-08-25 13:28:23 -04:00
Guillermo del Angel 1bf0fcd7a4 Obvious bug fix in HSP script: not enough to fix test yet, though 2011-08-25 13:27:55 -04:00
Eric Banks d81122cf5f Fixed up so that all modes can go through multiple iterations 2011-08-25 13:22:39 -04:00
Mark DePristo d65faf509c Default output name for Queue JobReport is queue_jobreport.gatkreport.txt 2011-08-25 13:15:20 -04:00
Mark DePristo a7d6946b22 Refactored QJobReport and QFunction, which is now automatically tracked
-- All QFunctions, including sg ones, are tracked
-- Removed memory information
2011-08-25 13:13:55 -04:00
Mark DePristo 08f6c3eea9 companion gantt charts for Q runtimes 2011-08-25 12:59:59 -04:00
Ryan Poplin e5ca61e371 Bug fix for multiallelic events in haplotype caller 2011-08-25 11:09:39 -04:00
Ryan Poplin 5e0660e27d Adding debug option to output all possible haplotypes in fasta format for use with bwa 2011-08-25 10:31:10 -04:00
Mauricio Carneiro 16caca0822 BLASR BAMs and new BWA parameters
*Added the functions to turn a BLASR generated BAM file into a usable BAM file.
*Modified the bwa parameters according to test results from NA12878 pb2k dataset.
2011-08-24 17:04:07 -04:00
Mauricio Carneiro e3f5d7067a Added ReorderSam queue binding 2011-08-24 17:03:11 -04:00
Mark DePristo 08fb21f127 Removing hostname 2011-08-24 16:45:50 -04:00
Mauricio Carneiro d50474f14c Merged bug fix from Stable into Unstable
resolved conflicts by maintaining all the changes in UNSTABLE where this bug had already been fixed.

Conflicts:
	public/scala/qscript/org/broadinstitute/sting/queue/qscripts/DataProcessingPipeline.scala
2011-08-24 16:06:36 -04:00
Mauricio Carneiro dc8398e165 fixing bai output for indel cleaning. 2011-08-24 15:58:34 -04:00
Ryan Poplin da5e6b52e7 Refactoring the smith-waterman and genotyping-related pieces into their own engine 2011-08-24 15:54:17 -04:00
Mark DePristo 06e30a81d1 Fixes throughout for getting job information
-- no more hostname -- it's just not going to be important
2011-08-24 15:30:09 -04:00
Ryan Poplin 29c7b10f7b Merge branch 'master' of ssh://nickel.broadinstitute.org/humgen/gsa-scr1/gsa-engineering/git/unstable 2011-08-24 15:18:58 -04:00
Ryan Poplin e5008aba00 Output the top two haplotypes as a variant call by running smith-waterman alignment against the reference and calling any difference as variation. This is the first verion that runs end-to-end by taking in reads as bam file and writing out variant calls in VCF. 2011-08-24 15:18:44 -04:00
Mark DePristo 4918519a58 No more NPE in getRuntime() when you cntr-c out of Queue 2011-08-24 14:14:01 -04:00
Mark DePristo 16d8360592 QJobReport is now the official capability name 2011-08-24 13:59:14 -04:00
Mark DePristo d047c19ad1 Writes output to file 2011-08-24 13:52:05 -04:00
Guillermo del Angel 61f5968807 Change indel/snp selection to new arguments 2011-08-24 13:42:23 -04:00
Mark DePristo 3ae68e2397 JobLogging trait now writes out GATKReport log of jobs 2011-08-24 13:36:39 -04:00
Guillermo del Angel e618cb1e79 a) Renamed/expanded SelectVariants arguments that choose particular kinds of variants and particular allelic types, now instead of -Indels or -SNPs we can specify for example -selectType [MIXED|INDEL|SNP|MNP|SYMBOLIC]. To select biallelic, multiallelic variants, use -restrictAllelesTo [BIALLELIC|MULTIALLELIC]. Corresponding gatkdocs changes.
b) More useful AC,AF logging in VariantsToTable with multiallelic sites: instead of logging comma-separated values, log max value by default. Hidden, experimental argument -logACSum to log sum of ACs instead. This is due to extreme slowness of R in parsing strings to tokens and computing max/sum itself (~100x slower than gatk).
c) Added integrationtest for new SelectVariants commands
2011-08-24 12:25:50 -04:00