gatk-3.8

Commit Graph

Author	SHA1	Message	Date
Mauricio Carneiro	7f9000382e	Making indel calls default in the MDCP You can turn off indel calling by using -noIndels.	2011-09-09 14:09:26 -04:00
Khalid Shakir	510d5e7730	Merged bug fix from Stable into Unstable	2011-09-09 01:34:55 -04:00
Khalid Shakir	367bbee25a	Fixed typo when printing the contents or last N lines of a file. Thanks to larryns.	2011-09-09 01:33:25 -04:00
Mauricio Carneiro	ee9d599558	Just cleaning up clean up old commented code from tha data processing pipeline.	2011-09-07 13:32:40 -04:00
Mauricio Carneiro	28d782b4c7	Allowing multiple dnsnp and indel files in the DPP	2011-09-02 13:38:47 -04:00
Mauricio Carneiro	ad4ea0b80b	Merged bug fix from Stable into Unstable	2011-09-01 18:14:45 -04:00
Mauricio Carneiro	e253f6f05d	Fixing typo in DPP platform and library were exchanged when rebuilding the read group information	2011-09-01 18:13:52 -04:00
Mauricio Carneiro	d2a33beff7	Added WGS/WEX b37-decoy CEU trio datasets	2011-09-01 13:14:40 -04:00
Mark DePristo	61633c95a8	Default jobreport is now jobPrefix, so you see logs like Q-2508.jobreport.txt	2011-08-28 19:19:45 -04:00
Mark DePristo	b38de1fa35	Now captures the exechost in the job report -- Works for in process, shell, and LSF runners -- Cleanup of debugging output	2011-08-28 12:05:56 -04:00
Mark DePristo	e37a638e09	Fix for disallowed characters in GATKReportTable -- Illegal characters are automatically replaced with _	2011-08-26 13:24:06 -04:00
Mark DePristo	0cb1605df0	Clean documentation for JobRunInfo	2011-08-26 09:22:58 -04:00
Mark DePristo	415d5d5301	LSF long times are in seconds, convert to milliseconds to meet standard	2011-08-26 09:18:28 -04:00
Mark DePristo	eef1ac415a	Merge branch 'master' into rodTesting Conflicts: public/java/src/org/broadinstitute/sting/gatk/walkers/variantutils/VariantsToTable.java	2011-08-26 00:35:41 -04:00
Mark DePristo	e03dfdb0ab	Automatic iteration field addition works properly.	2011-08-25 16:59:02 -04:00
Mark DePristo	e01273ca7c	Queue now writes out queueJobReport.pdf -- General purpose RScript executor in java (please use when invoking RScripts) -- Removed groupName. This is now analysisName -- Explicitly added capability to enable/disable individual QFunction	2011-08-25 16:57:11 -04:00
Mark DePristo	0f4be2c4a4	Argument to disable queueJobReport entirely -- Minor improvements to RodPerformanceGoals	2011-08-25 13:32:03 -04:00
Mark DePristo	d65faf509c	Default output name for Queue JobReport is queue_jobreport.gatkreport.txt	2011-08-25 13:15:20 -04:00
Mark DePristo	a7d6946b22	Refactored QJobReport and QFunction, which is now automatically tracked -- All QFunctions, including sg ones, are tracked -- Removed memory information	2011-08-25 13:13:55 -04:00
Mauricio Carneiro	16caca0822	BLASR BAMs and new BWA parameters Added the functions to turn a BLASR generated BAM file into a usable BAM file. Modified the bwa parameters according to test results from NA12878 pb2k dataset.	2011-08-24 17:04:07 -04:00
Mauricio Carneiro	e3f5d7067a	Added ReorderSam queue binding	2011-08-24 17:03:11 -04:00
Mark DePristo	08fb21f127	Removing hostname	2011-08-24 16:45:50 -04:00
Mauricio Carneiro	dc8398e165	fixing bai output for indel cleaning.	2011-08-24 15:58:34 -04:00
Mark DePristo	06e30a81d1	Fixes throughout for getting job information -- no more hostname -- it's just not going to be important	2011-08-24 15:30:09 -04:00
Mark DePristo	4918519a58	No more NPE in getRuntime() when you cntr-c out of Queue	2011-08-24 14:14:01 -04:00
Mark DePristo	16d8360592	QJobReport is now the official capability name	2011-08-24 13:59:14 -04:00
Mark DePristo	d047c19ad1	Writes output to file	2011-08-24 13:52:05 -04:00
Mark DePristo	3ae68e2397	JobLogging trait now writes out GATKReport log of jobs	2011-08-24 13:36:39 -04:00
Mauricio Carneiro	cd12f7f286	Fixed list dependency Instead of creating a bam list file, I dynamically create a scala list and pass as parameters. This way the intermediate bam files don't get deleted before they should.	2011-08-24 11:12:46 -04:00
Mauricio Carneiro	219252a566	Adapting to the new RodBinding framework	2011-08-24 11:12:46 -04:00
Mark DePristo	b8bc03bb42	JobRunInfo improvements -- dry-run now adds some info, for testing -- InProcessRunner adds some, but not all, of the information we want	2011-08-23 17:11:22 -04:00
Mark DePristo	31ec6e316c	First implementation of JobRunInfo -- onExecutionDone(Map(QFunction, JobRunInfo)) is the new signature, so that you can walk over your jobs and inspect their success/failure and runtime characteristics	2011-08-23 16:51:54 -04:00
Mark DePristo	a9ba945595	onExecutionDone(jobs, successFlag) added to QScript. -- This function is called when the Qscript ends, so scripts can overload this function if they want to run some code after all of the jobs have completed	2011-08-23 10:09:51 -04:00
Mauricio Carneiro	136f0eb685	Creating sample-bam list instead of joining This should save us at least one day in the trio decoy processing.	2011-08-22 18:03:39 -04:00
Mauricio Carneiro	04d8bcaf19	Fixed bai removal on picard tools BAM index files were not being deleted because picard replaces the name of the file with bai instead of appending to it.	2011-08-22 18:03:39 -04:00
Mauricio Carneiro	8aed151a71	Created RevertSam queue class Class for the picard tool RevertSam with all the options for queue scripts.	2011-08-22 18:03:39 -04:00
Mauricio Carneiro	caebc88e9a	Consensus mode and new RodBinding framework. The DPP was not using the parameter correctly. It didn't matter for the default option (which is the only one we have been testing) but it would not work for knowns only or smith waterman. It is fixed now. It now complies with the new rod binding framework.	2011-08-22 18:03:39 -04:00
Khalid Shakir	c4c90c8826	Updates to JobRunners from the Queue developer community and from running the WholeGenomePipeline: - Ability to pass a different resident memory reservation and limits. Useful for large pileups of low pass genome data that sometimes need high -Xmx6g but usually don't exceed 2-3g in actual heap size. - Fixed jobPriority to work for all job runners. Now must be a integer between 0 and 100- even for GridEngine- and will be mapped to the correct values. - Passing parallel environment and job resource requests to LSF and GridEngine. Useful for passing tokens like iodine_io=1 and -pe pe_slots 8 - Refactored GridEngine JobRunner to also provide basic support for other job dispatchers with DRMAA implementations such as Torque/PBS. Should work for basic running but advanced users must pass their own jobNativeArgs from the command line or in customized QScripts until someone maps properties like jobQueue, jobPriority, residentRequest, etc. into a Torque/PBS/etc. dispatcher.	2011-08-22 15:13:27 -04:00
Ryan Poplin	f93a554b01	updating exome specific parameters in MDCP	2011-08-21 10:25:36 -04:00
Ryan Poplin	b008676878	fixing the previous fix	2011-08-20 21:21:55 -04:00
Ryan Poplin	539e157ecd	Fixing misc parameters in MDCP. The pipeline now does VariantEval of output by default. Fix for NaN vqslod values in VQSR	2011-08-20 11:28:48 -04:00
Ryan Poplin	ddb5045e14	Updating the methods development calling pipeline for the new rod binding syntax and the new best practices.	2011-08-19 19:29:51 -04:00
Mauricio Carneiro	46051c36c6	Merge branch 'master' of ssh://nickel.broadinstitute.org/humgen/gsa-scr1/gsa-engineering/git/unstable	2011-08-10 16:57:34 -04:00
Mauricio Carneiro	b0ff5b1ff7	a better name for the pacbio processing pipeline	2011-08-10 16:16:53 -04:00
Mark DePristo	9e53fd6880	Fixed VCFGatherFunction to not provide incorrect rod_priority_list -- simply don't provide one, since you are just 'cating' the files together and genotypes never overlap	2011-08-10 07:28:35 -04:00
Mauricio Carneiro	481630da00	BWA parameters added	2011-08-09 17:05:24 -04:00
Mauricio Carneiro	22d2563823	added BWA SW alignment The pipeline now accepts fasta/fastq files and aligns them using BWA SW, adds default basequalities, creates read groups and performs BQSR.	2011-08-09 17:05:24 -04:00
Mauricio Carneiro	bd1cf4c7bc	Pacbio Pipeline Added the base quality "filling" step to allow the pipeline to handle raw pacbio BAM files. This is the first step towards a generic pacbio data processing pipeline.	2011-08-09 17:05:24 -04:00
Eric Banks	5a3c99b7b9	Fixing 'variants' change in qscript	2011-08-09 12:30:46 -04:00
Khalid Shakir	cb28875c2a	Updated rod binding syntax usage on CombineVariants from .rodBind to .variants.	2011-08-09 00:46:39 -04:00

1 2 3

121 Commits (b399424a9cd4c842e8dac0e2e0f9c17ba4002ff4)