gatk-3.8

Commit Graph

Author	SHA1	Message	Date
Eric Banks	d981fd01b8	Now that we don't generate dict and fai files, the resource script needs to copy them to the bundle.	2013-05-02 15:18:13 -04:00
Eric Banks	6d0e383a60	Fixing the bundle script 1. someone out there busted it when adding high confidence 1000G calls 2. new path to NA12878 bam 3. updated clashing version argument	2013-05-02 09:40:36 -04:00
Ryan Poplin	80131ac996	Adding the 1000G_phase1.snps.high_confidence callset to the GATK resource bundle for use in the April 2013 updated best practices.	2013-04-24 11:41:32 -04:00
Geraldine Van der Auwera	f972963918	Fixed issues raised by Appistry QA (mostly small fixes, corrections & clarifications to GATKDocs) GATK-73 updated docs for bqsr args GATK-9 differentiate CountRODs from CountRODsByRef GATK-76 generate GATKDoc for CatVariants GATK-4 made resource arg required GATK-10 added -o, some docs to CountMales; some docs to CountLoci GATK-11 fixed by MC's -o change; straightened out the docs. GATK-77 fixed references to wiki GATK-76 Added Ami's doc block GATK-14 Added note that these annotations can only be used with VariantAnnotator GATK-15 specified required=false for two arguments GATK-23 Added documentation block GATK-33 Added documentation GATK-34 Added documentation GATK-32 Corrected arg name and docstring in DiffObjects GATK-32 Added note to DO doc about reference (required but unused) GATK-29 Added doc block to CountIntervals GATK-31 Added @Output PrintStream to enable -o GATK-35 Touched up docs GATK-36 Touched up docs, specified verbosity is optional GATK-60 Corrected GContent annot module location in gatkdocs GATK-68 touched up docs and arg docstrings GATK-16 Added note of caution about calling RODRequiringAnnotations as a group GATK-61 Added run requirements (num samples, min genotype quality) Tweaked template and generic doc block formatting (h2 to h3 titles) GATK-62 Added a caveat to HR annot Made experimental annotation hidden GATK-75 Added setup info regarding BWA GATK-22 Clarified some argument requirements GATK-48 Clarified -G doc comments GATK-67 Added arg requirement GATK-58 Added annotation and usage docs GSATDG-96 Corrected doc Updated MD5 for DiffObjectsIntegrationTests (only change is link in table title)	2013-03-12 10:57:14 -04:00
David Roazen	2a7af43164	Fix improper dependencies in QScripts used by pipeline tests, and attempt to fix the flawed MisencodedBaseQualityUnitTest -Some QScripts used by public pipeline tests unnecessarily used the (now protected) UnifiedGenotyper. Changed them to use PrintReads instead. -Moved ExampleUnifiedGenotyperPipelineTest to protected -Attempt to fix the flawed and sporadically failing MisencodedBaseQualityUnitTest: After looking at this class a bit, I think the problem was the use of global arrays for the quals shared across all reads in all tests (BAMRecord class definitely does not make a separate copy for each read!). One test (testFixBadQuals) modifies the bad quals array, and if this happens to run before the testBadQualsThrowsError test the bad quals array will have been "fixed" and no exception will be thrown.	2013-02-27 04:45:53 -05:00
Mauricio Carneiro	bc64d4240f	Licensing update -- batch #2 - caught all scala files that didn't have proper package information / class names - included all source files in archive as well GSATDG-5	2013-01-11 13:38:11 -05:00
Mauricio Carneiro	28235f57f2	Adding package information to scala scripts that were missing it. Including archived ones. GSATDG-5	2013-01-11 13:38:05 -05:00
Mauricio Carneiro	e5913e50b2	Updating licenses for all scala files GSATDG-5	2013-01-10 17:46:10 -05:00
Mauricio Carneiro	d3e2352072	Moved processing pipelines to private These pipelines were supposed to serve as an example for the community, they were written a long-long-long time ago and are being used today by users as the 'best practice pipeline'. Unless we decide we want to support and maintain an example best-practices pipeline, I'm moving these to private.	2013-01-07 14:49:57 -05:00
Eric Banks	18728ec5bd	Updates to the bundle script: 1. Add the symbolic 'current' link for the new bundle dir 2. Don't gzip and copy .out files 3. Don't call chr20 SNPs on the example BAM because it's now just a few reads on chr1	2012-12-18 11:16:42 -05:00
Menachem Fromer	a8c7edca05	Fixed fragment handling in DepthOfCoverage	2012-11-21 16:01:10 -05:00
Menachem Fromer	c8be7c3102	Keep SNPs and indels separately for batch merging; Add options to DepthOfCoverage to count fragments (to not double-count overlapping reads of same fragment); DepthOfCoverage should now support ReducedReads; Replace recusrion with loop in DoC/package.scala (for lists longer than 5000 elements)	2012-11-21 15:56:53 -05:00
Menachem Fromer	9111966261	Merge branch 'master' of github.com:broadinstitute/gsa-unstable	2012-11-20 12:19:58 -05:00
Eric Banks	843384e435	Rename hg19 files in bundle to b37 since that's what they are	2012-11-14 11:47:09 -05:00
Eric Banks	eccb76c304	Only run UG in the bundle for chr20	2012-10-30 15:09:46 -04:00
Eric Banks	8a402024c2	Updating bundle script to handle new naming convention of CEU trio best practices callset	2012-10-30 09:11:56 -04:00
Menachem Fromer	9af4b34fd8	Changed @Input to @Argument for non-File types	2012-10-26 01:21:05 -04:00
Menachem Fromer	0fe36b1c72	Merge branch 'master' of ssh://gsa3.broadinstitute.org/humgen/gsa-scr1/gsa-engineering/git/unstable	2012-10-25 16:18:57 -04:00
Menachem Fromer	cde4f037d3	Begin moving XHMM scripts to public	2012-10-25 16:18:25 -04:00
Ami Levy Moonshine	dde3060bb8	add the CEUtrio best practices results (UG + PBT) to the bundle	2012-10-25 15:36:17 -04:00
Khalid Shakir	2ef456d51a	Added explicit @ClassType annotations to @Argument for Option[Int] or Option[Double] since scala seems to change the reflected type to Option[Object] on some systems. Changed ReflectionUtils.getGenericTypes' order of looking for @ClassType since the primitive generic wasn't completely erased, only changed to Object which is incorrect. More fixes to @Arguments labeled as java.io.File via incorrect @Input annotation. Put in a default undocumented implementation of @Argument doc() to match the one added to @Input.	2012-10-19 13:20:29 -04:00
Khalid Shakir	403654d40a	Fixed null checkes in ArgumentTypeDescriptor due to ArgumentMatchValue updates. Fixed @Arguments such as scatter count that were labeled as java.io.File via incorrect @Input annotation.	2012-10-18 16:57:15 -04:00
Khalid Shakir	f66284658d	RetryMemoryLimit now works with Scatter/Gather.	2012-10-09 21:51:03 -04:00
Eric Banks	277ba94c7b	Update from dbsnp135 to dbsnp137.	2012-08-31 14:06:29 -04:00
Eric Banks	5ea7cd6dcc	Updating resource bundle: no reason to include both genotype and sites files for Omni and HM3, sites are enough. Also, don't include duplicate entry for the Mills indels.	2012-08-31 14:01:54 -04:00
Khalid Shakir	22b4466cf5	Added setupRetry() to modify jobs when Queue is run with '-retry' and jobs are about to restart after an error. Implemented a mixin called "RetryMemoryLimit" which will by default double the memory. GridEngine memory request parameter can be selected on the command line via '-resMemReqParam mem_free' or '-resMemReqParam virtual_free'. Java optimizations now enabled by default: - Only 4 GC threads instead of each job using java's default O(number of cores) GC threads. Previously on a machine with N cores if you have N jobs running and java allocates N GC threads by default, then the machines are using up to N^2 threads if all jobs are in heavy GC (thanks elauzier). - Exit if GC spends more than 50% of time in GC (thanks ktibbett). - Exit if GC reclaims lest than 10% of max heap (thanks ktibbett). Added a -noGCOpt command line option to disable new java optimizations.	2012-08-13 15:43:05 -04:00
Eric Banks	7cf4b63d76	Disabling indel quals in BaseRecalibrator as it should be, not PrintReads.	2012-08-01 09:23:04 -04:00
Eric Banks	675ccab2fa	Renaming BQSR to BaseRecalibrator	2012-07-23 10:17:17 -04:00
Eric Banks	863eb5b5c0	Use Context not Dinuc covariate	2012-07-17 15:18:11 -04:00
Eric Banks	17d627b86d	Update the DPP and PBPP to use the BQSRv2 walkers	2012-07-17 13:15:32 -04:00
Mauricio Carneiro	9346c5b37a	Merged bug fix from Stable into Unstable	2012-06-26 14:55:41 -04:00
Mauricio Carneiro	334d66f2b1	Updating validation parameter in the DPP users were very confused with the failing validation of their 'unpicarded' bam files. Changed the default to OFF and added an option to turn it on.	2012-06-26 14:54:37 -04:00
Ryan Poplin	c3fb321014	Minor updates to pacbio data processing script to make it work with the latest bwa version/settings.	2012-05-22 10:24:45 -04:00
Khalid Shakir	91cb654791	AggregateMetrics: - By porting from jython to java now accessible to Queue via automatic extension generation. - Better handling for problematic sample names by using PicardAggregationUtils. GATKReportTable looks up keys using arrays instead of dot-separated strings, which is useful when a sample has a period in the name. CombineVariants has option to suppress the header with the command line, which is now invoked during VCF gathering. Added SelectHeaders walker for filtering headers for dbGAP submission. Generated command line for read filters now correctly prefixes the argument name as --read_filter instead of -read_filter. Latest WholeGenomePipeline. Other minor cleanup to utility methods.	2012-04-17 11:45:32 -04:00
Eric Banks	ed69f4ff7c	Merge branch 'master' of ssh://gsa1.broadinstitute.org/humgen/gsa-scr1/gsa-engineering/git/unstable	2012-03-13 09:28:16 -04:00
Eric Banks	9b9856ead5	quick todo for next time we make a bundle	2012-03-13 09:28:11 -04:00
Eric Banks	6e9b8559d8	Unfortunately need to bump up memory needed for liftover to get Omni file sorted	2012-03-12 23:20:00 -04:00
Eric Banks	359090c4b7	Updating dbsnp to v135	2012-03-12 13:17:58 -04:00
Eric Banks	7e9a535c4d	Updated the bundle to use the official filtered (final) indel calls	2012-03-12 12:12:24 -04:00
Christopher Hartl	2c1b14d35e	Mostly small changes to my own scala scripts: .vcf.gz compatibility for output files, smarter beagle generation, simple script to scatter-gather combine variants. Whole genome indel calling now uses the gold standard indel set.	2012-02-22 17:20:04 -05:00
Christopher Hartl	974c2499cc	Bugfixed to script.	2012-02-02 12:55:54 -05:00
Christopher Hartl	27ea6426a4	Small script to chunk up a VCF into equal-sized chunks	2012-02-02 12:29:03 -05:00
Christopher Hartl	0c562756eb	Add a memory limit so this thing doesn't get killed on the farm	2012-02-02 10:30:09 -05:00
Christopher Hartl	45bf2562cc	.	2012-02-02 09:11:17 -05:00
Christopher Hartl	f8c5406084	Add the ability to extract samples	2012-02-02 09:06:39 -05:00
Christopher Hartl	b567ed8793	Use the right reference path :(	2012-02-01 12:35:18 -05:00
Christopher Hartl	87a63d54d6	fix the script!	2012-02-01 12:05:29 -05:00
Christopher Hartl	810996cfca	Introducing: VariantsToPed, the world's most annoying walker! And also a busted QScript to run it that I need Khalid's help debugging ( frownie face ). Note that VariantsToPed and PlinkSeq generate the same binary file (up to strand flips...thanks PlinkSeq), so I know it's working properly. Hooray!	2012-02-01 10:39:03 -05:00
Mauricio Carneiro	052a4bdb9c	Turning off PHONE HOME option in the MDCP * MDCP is for internal use and there is no need to report to the Amazon cloud. * Reporting to ASW_S3 is not allowing jobs to finish, this is probably a bug.	2012-01-27 11:13:30 -05:00
Mauricio Carneiro	97499529c7	another small bug with the file extension.	2012-01-24 16:14:35 -05:00

1 2 3

144 Commits (fdfe4e41d5d8c92fad74f56e654992f3a97ab602)