gatk-3.8

Commit Graph

Author	SHA1	Message	Date
Mark DePristo	90cc17ee2a	Merged bug fix from Stable into Unstable Conflicts: private/shell/runGATKReport.csh	2012-01-06 18:14:51 -05:00
Mark DePristo	63b7a70c44	Removing very costly analyses of all GATK versions. Will be replaced by Tableau website	2012-01-06 18:13:19 -05:00
Mauricio Carneiro	1f88a1bfe2	Small fix to RRead script * fixing the downsample strategy variable	2012-01-06 17:25:04 -05:00
Mauricio Carneiro	f6a18aea63	Updated MDCP with INDEL best practices * chose 90.0 indel cut target for most datasets (this is arbitrary).	2012-01-06 17:21:59 -05:00
Mark DePristo	65c614fb4b	Merge branch 'master' of ssh://gsa1/humgen/gsa-scr1/gsa-engineering/git/unstable	2012-01-06 16:38:26 -05:00
Mark DePristo	d9da37f9b4	Added SQL table creation and log loading to analyzeRunReports -- You can create (and drop the old) GATK_LOG table with the setupDB command -- You can load data into the database with the loadToDB command Currently I'm pushing up all of the GATK logs into the new MySQL server setup for the gsa group. Details of the server are in the code, for those interested. All of this is part of my experimentation with Tableau for visualizing GATK run logs.	2012-01-06 16:35:53 -05:00
Guillermo del Angel	d4e7655d14	Added ability to call multiallelic indels, if -multiallelic is included in UG arguments. Simple idea: we genotype all alleles with count >= minIndelCnt. To support this, refactored code that computes consensus alleles. To ease merging of mulitple alt alleles, we create a single vc for each alt alleles and then use VariantContextUtils.simpleMerge to carry out merging, which takes care of handling all corner conditions already. In order to use this, interface to GenotypeLikelihoodsCalculationModel changed to pass in a GenomeLocParser object (why are these objects to hard to handle??). More testing is required and feature turned off my default.	2012-01-06 11:24:38 -05:00
Mauricio Carneiro	43224ef364	Turning the Adaptive Downsampler on with 100 by default	2012-01-05 23:47:27 -05:00
Mark DePristo	dd80ffbbbe	Merged bug fix from Stable into Unstable	2012-01-05 21:51:48 -05:00
Mark DePristo	c96fee477c	Bug fix for VariantSummary -- Call sets with indels > 50 bp in length are tagged as CNVs in the tag (following the 1000 Genomes convention) and were unconditionally checking whether the CNV is already known, by looking at the known cnvs file, which is optional. Fixed. Has the annoying side effect that indels > 50bp in size are not counted as indels, and so are substrated from both the novel and known counts for indels. C'est la vie -- Added integration test to check for this case, using Mauricio's most recent VCF file for NA12878 which has many large indels. Using this more recent and representative file probably a good idea for more future tests in VE and other tools. File is NA12878.HiSeq.WGS.b37_decoy.indel.recalibrated.vcf in Validation_Data	2012-01-05 21:51:06 -05:00
Eric Banks	f5e10e9879	Merged bug fix from Stable into Unstable	2012-01-05 15:35:09 -05:00
Eric Banks	18ed954741	Compute Ti/Tv only if bi-allelic	2012-01-05 15:33:26 -05:00
Ryan Poplin	a6886a4cc0	Initial commit of the Active Region Traversal. Not ready to be used by anyone yet.	2012-01-04 17:03:21 -05:00
Guillermo del Angel	58d4539304	Enabled banded indel computation by default. Reversed logic in input UG argument so that we can still disable it if required. Minor changes to integration tests due to minor differences in GL's and in annotations	2012-01-04 15:28:26 -05:00
Christopher Hartl	5cdde168af	Switch from using BWA to direct edit distance inspection. Seems to work quite well.	2012-01-04 14:25:43 -05:00
Christopher Hartl	310c05bd09	Merge branch 'master' of ssh://chartl@tin.broadinstitute.org/humgen/gsa-scr1/gsa-engineering/git/unstable	2012-01-04 01:12:23 -05:00
Christopher Hartl	31ecc38db8	Initial implementation of a walker for redesigning low or high GC baits using a fairly textbook genetic algorithm.	2012-01-04 01:10:28 -05:00
David Roazen	fe67276e1e	Merged bug fix from Stable into Unstable	2012-01-04 00:54:02 -05:00
Khalid Shakir	253a07fdb1	Implicits conversion issue/bug: QScript String<==>File shortcuts at compile time do not make String.equals(File) at runtime.	2012-01-03 18:43:45 -05:00
Mauricio Carneiro	9ff8a01da2	Merged bug fix from Stable into Unstable	2012-01-03 18:10:39 -05:00
Mauricio Carneiro	9b55505c03	Fixing PairHMMIndelErrorModel array out of bounds This error was due to the ReadClipper change of contract. Before the read utils would return null if a read was entirely clipped, now it returns an empty (safe) GATKSAMRecord.	2012-01-03 18:08:46 -05:00
Christopher Hartl	2c3a9ce02f	Merge branch 'master' of ssh://tin.broadinstitute.org/humgen/gsa-scr1/chartl/dev/unstable	2012-01-03 17:25:56 -05:00
David Roazen	621ee2b613	Merged bug fix from Stable into Unstable	2012-01-03 16:56:49 -05:00
Christopher Hartl	9093de1132	Cleanup: remove code to calculate the MLE AC in the UGE.	2012-01-03 15:58:51 -05:00
Christopher Hartl	2d093828a4	Final changes to Junky (been frozen for a while, but uncommitted) and the qscript for it. A first cursory implementation of the trellis-based Exact AC-constrained genotyping algorithm in UGE. Nothing calls into it, so this should be entirely safe (and, no surprise, it passes UG integration tests).	2012-01-03 15:33:04 -05:00
David Roazen	ea6e718cb8	SnpEff 2.0.5 support. Re-enabled SnpEff in the HybridSelectionPipeline. For now, we recommend only running with the GRCh37.64 database.	2012-01-03 15:18:36 -05:00
Christopher Hartl	93e1417b6e	Update to the VSS GATK documentation.	2012-01-03 13:39:31 -05:00
David Roazen	4984ca5e31	Merged bug fix from Stable into Unstable	2012-01-03 11:03:30 -05:00
David Roazen	f3f01da1af	Enforce serial dependencies in RecalibrationWalkersIntegrationTest Some tests in this class were intermittently not being executed due to being randomly scheduled before tests whose results they depend on. Now the serial dependencies are enforced to avoid problematic orderings.	2012-01-03 10:42:41 -05:00
David Roazen	055364d786	Always use full, three-part version numbers. Previously, the initial release of a new GATK version had a version number with only one part (eg., "1.4"). This could potentially mislead people into thinking it's the most recent revision of a release, instead of the least recent. Now, initial releases will have full, three-part version numbers (eg., "1.4-0-g472fc94") like everything else.	2012-01-03 10:25:19 -05:00
Eric Banks	ab8d47d9a5	Merge branch 'master' of ssh://nickel.broadinstitute.org/humgen/gsa-scr1/gsa-engineering/git/unstable	2012-01-03 09:38:49 -05:00
Mauricio Carneiro	ca669ae744	Optimizations to the CoverageByRG walker * outputs only the groups of read groups necessary, avoiding multiple pileup creations every call to map * now also counts the number of variants associated with a given ROD (dbSNP) exist in the interval * new column: interval size	2012-01-03 09:36:01 -05:00
Mauricio Carneiro	3d4bf273de	Added getPileupForReadGroups to ReadBackPileup * returns a pileup for all the read groups provided. * saves us from multiple calls to getPileup (which is very inefficient)	2012-01-03 09:35:11 -05:00
Roger Zurawicki	caa5da2fd2	Added parameter to combine RGs in CoverageByRG * -g takes a string of read groups separated by space " " * multiple -g creates multiple sum columns in the table Signed-off-by: Mauricio Carneiro <carneiro@broadinstitute.org>	2012-01-03 09:35:10 -05:00
Mauricio Carneiro	18f06ad913	Script to calculate gc content of intervals independently * necessary for baits because we don't want the overlapping intervals to be merged by the GATK engine	2012-01-03 09:35:10 -05:00
Mauricio Carneiro	0bdeda6f3f	Added single sample option for the ReduceReads calling script	2012-01-03 09:29:47 -05:00
Mauricio Carneiro	4a208c7c06	Refactor of the downsampling machinery to accept different strategies * Implemented Adaptive downsampler * Added integration test * Added option to RRead scala script to choose downsampling strategy	2012-01-03 09:29:47 -05:00
Mauricio Carneiro	cce8511d29	Some WGS performance upgrades for ReduceReads * Do not try to hard clip to the interval when doing WGS * Do not even add reads that have been completely clipped out in WGS	2012-01-03 09:29:46 -05:00
Mauricio Carneiro	21ae3ef5f9	Added downsampling support to ReduceReads * Downsampling is now a parameter to the walker with default value of 0 (no downsampling) * Downsampling selects reads at random at the variant region window and strives to achieve uniform coverage if possible around the desired downsampling value. * Added integration test	2012-01-03 09:29:46 -05:00
Mauricio Carneiro	cd68cc239b	Added knuth-shuffle (KS) and randomSubset using KS to MathUtils * Knuth-shuffle is a simple, yet effective array permutator (hope this is good english). * added a simple randomSubset that returns a random subset without repeats of any given array with the same probability for every permutation. * added unit tests to both functions	2012-01-03 09:29:46 -05:00
Mauricio Carneiro	94791a2a75	Add support for reads starting with insertion * Modified cleanCigarShift to allow insertions in the beginning and end of the read * Allowed cigars starting/ending in insertions in the systematic ReadClipper tests * Updated all ReadClipper unit tests * ReduceReads does not hard clip leading insertions by default anymore * SlidingWindow adjusts start location if read starts with insertion * SlidingWindow creates an empty element with insertions to the right * Fixed all potential divide by zero with totalCount() (from BaseCounts) * Updated all Integration tests * Added new integration test for multiple interval reducing	2012-01-03 09:29:45 -05:00
Mark DePristo	3ecb9a0bf7	Merge branch 'master' of ssh://gsa1/humgen/gsa-scr1/gsa-engineering/git/unstable	2012-01-02 13:56:55 -05:00
Mark DePristo	b3e613647a	GATKPerformanceOverTime bug fixes -- Don't try to do nt 16, it's just too painful as the threading doesn't work well and it consumes a large chunk of our available slots on gsa4 -- bugfix: only do multi-threaded test for each iteration, not expanding by subiterations, so we no longer try to do 3x3 nt 16 runs	2012-01-02 13:56:44 -05:00
Mark DePristo	188bd48139	runGATKReport only archives and shows errors for last days runs	2012-01-02 10:39:05 -05:00
Mark DePristo	d05f0c2318	GATKPerformanceOverTime script update -- Automatic detection of most recent version of GATK release (just tell the script now to use 1.2, 1.3, and 1.4) -- Uses 1.4 now -- By default we do 9 runs of each non-parallel test -- In PathUtils added convenience utility to find most recent release GATK jar with a specific release number	2012-01-02 09:58:46 -05:00
Mauricio Carneiro	a837970ea2	Merged bug fix from Stable into Unstable	2012-01-01 22:20:53 -05:00
Mauricio Carneiro	1b6d52817e	fixing adaptor clipping effect on recalibration integration test	2012-01-01 22:20:06 -05:00
Ryan Poplin	e45ca8bfa2	Protect against too many alternate alleles in the haplotype caller.	2012-01-01 19:12:48 -05:00
Eric Banks	393993e0c7	Merge branch 'master' of ssh://nickel.broadinstitute.org/humgen/gsa-scr1/gsa-engineering/git/unstable	2011-12-31 20:42:46 -05:00
Eric Banks	b0d68eb0e3	Merge remote-tracking branch 'unstable/master'	2011-12-31 20:26:44 -05:00

1 2 3 4 5 ...

8529 Commits (90cc17ee2aa3ff9b266d1625da46c264169e4b9f) All Branches Search

8529 Commits (90cc17ee2aa3ff9b266d1625da46c264169e4b9f)

All Branches