gatk-3.8

Commit Graph

Author	SHA1	Message	Date
Mauricio Carneiro	a5e75cd14c	Outputting both consensus base qualities and counts The base qualities of a consensus reads are now the average quality of the bases forming the consensus base (most common base) and the consensus quality tag now carry an array with the counts of each base in the consensus. This should increase file size but improve calling sensitivity/specificity.	2011-09-29 12:54:41 -04:00
Mauricio Carneiro	4086fa768f	Disabling all ReadClipperUnitTests	2011-09-29 12:20:35 -04:00
Mauricio Carneiro	fc86cd6fd8	Merge branch 'master' of ssh://nickel.broadinstitute.org/humgen/gsa-scr1/carneiro/gatk/RR into rr	2011-09-29 00:12:15 -04:00
Roger Zurawicki	4fd5630f6a	Added ReadClipper Unit Test * Includes tests that include HardClip to Read and Reference Coords. * Changed ReadUtils.HardClipByReferenceCoordinates from private to protected to allow for testing	2011-09-28 23:13:50 -04:00
Matt Hanna	9272ed03b5	Merged bug fix from Stable into Unstable	2011-09-28 21:26:43 -04:00
Matt Hanna	0acaf2df65	Fix an embarrassing issue where a specific configuration of minimal coverage over small intervals could cause reads to be dropped from the pileup. Nothing to see here...	2011-09-28 21:23:01 -04:00
Guillermo del Angel	c8d3a720f9	Merge branch 'master' of ssh://nickel.broadinstitute.org/humgen/gsa-scr1/gsa-engineering/git/unstable	2011-09-28 18:17:34 -04:00
Guillermo del Angel	7e3cb45093	Further performance optim in banded hmm, about 60% speed improvement over current implementation now	2011-09-28 16:27:28 -04:00
Ryan Poplin	1b1ca80df2	Merge branch 'master' of ssh://nickel.broadinstitute.org/humgen/gsa-scr1/gsa-engineering/git/unstable	2011-09-28 16:17:39 -04:00
Ryan Poplin	3b73dc89fe	Making several esoteric arguments in the BQSR @Hidden. Adding basic support for Complete Genomics machine cycle.	2011-09-28 16:17:31 -04:00
Mauricio Carneiro	ff2f4df043	Fixed hardclipping inside indel (right tail) when hard clipping the right tail of a read falls inside a deletion, clipping should fall back to the last base before the deletion to follow the ReadClipper's contract.	2011-09-28 16:07:34 -04:00
Mauricio Carneiro	3c7b7f74ef	Optimized interval iteration Using a TreedSet to manipulate getToolkit.getIntervals() and being smart about which intervals to test makes interval clipping O(1) instead of O(n).	2011-09-28 16:07:34 -04:00
Mauricio Carneiro	5c9b659c02	clipping both ends of the reads was modifying the original read This goes against the ReadClipper contract, and was affecting the second part of the read that spans over multiple intervals. Fixed.	2011-09-28 16:07:34 -04:00
Guillermo del Angel	fe23e4d10c	Merge branch 'master' of ssh://nickel.broadinstitute.org/humgen/gsa-scr1/gsa-engineering/git/unstable	2011-09-28 15:53:11 -04:00
Guillermo del Angel	e2b9030e93	First mostly fully functional implementation of banded pair HMM likelihood computation for indel caller. More experimentation to follow but it right now works in small data sets and at least it doesn't break existing things. Disabled by default at this point	2011-09-28 15:51:48 -04:00
Eric Banks	1b45f21774	Removing this command-line tool. Purposely not doing this in stable so that users who may still use it have time to find other options. But the docs are no longer on the wiki.	2011-09-28 13:18:32 -04:00
Eric Banks	1f0e354fae	Merge branch 'master' of ssh://nickel.broadinstitute.org/humgen/gsa-scr1/gsa-engineering/git/unstable	2011-09-28 13:13:21 -04:00
Eric Banks	bb619a9a3c	Fixing docs	2011-09-28 13:13:03 -04:00
Mark DePristo	5812004e06	Merge branch 'stable'	2011-09-28 11:36:40 -04:00
Mark DePristo	a5006831d7	Shows "" not empty space when default string value is ""	2011-09-28 11:35:52 -04:00
Mark DePristo	1e32281a15	Fix to not show -null when missing short name argument	2011-09-28 11:31:20 -04:00
Mauricio Carneiro	89544c209c	Fixing contracts changed return type to Pair, changing contracts accordingly.	2011-09-28 11:19:17 -04:00
Eric Banks	eacbee3fe5	Merged bug fix from Stable into Unstable	2011-09-27 20:35:18 -04:00
Eric Banks	43b0c98298	Fix docs	2011-09-27 20:34:46 -04:00
Eric Banks	232a6df11c	Add longhand form to the error message.	2011-09-27 20:29:31 -04:00
Eric Banks	1d6fcb6eb1	Revert "Add longhand form to the error message to prevent users from posting borderline dumb posts to GS." This reverts commit 75b2600527cfce05ae683cb394290ff2a80e8552.	2011-09-27 20:27:00 -04:00
Eric Banks	269b9826b6	Add longhand form to the error message to prevent users from posting borderline dumb posts to GS.	2011-09-27 20:26:36 -04:00
Mauricio Carneiro	3b6e43b7c4	Use reads that span multiple intervals * RR will now compress reads that span across multiple intervals correctly and output them in the correct order. * Fixed bug in getReadCoordinateForReferenceCoordinate where if the requested reference coordinate fell inside a deletion in the read the read would be clipped up to one element past the deletion.	2011-09-27 18:39:06 -04:00
Khalid Shakir	84bd355690	Merged bug fix from Stable into Unstable	2011-09-27 14:34:39 -04:00
Khalid Shakir	b090751f62	Fixed Ant / PluginManager issue where reflections was picking up all class files under current working directory due to "." in jar manifest classpaths. Updates to HybridSelectionPipeline: - Added annotations back via snpEff - Minor updates to VQSR paths and lowered memory	2011-09-27 14:33:57 -04:00
Eric Banks	26e71f6688	The Omni files have multiple records (with the same ALT) at a particular location, with one PASSing and the other(s) filtered. Chris, this is why using this file as both eval and comp leads to ref/no-call cells in the GenotypeConcordance table. However, this led to non-determinism in VE because the VCs were placed in a HashSet; we use a LinkedHashMap instead to bring back determinism.	2011-09-27 11:03:17 -04:00
Guillermo del Angel	ceffefa6a6	Intermediate version with banded pair HMM	2011-09-27 10:18:58 -04:00
Mark DePristo	e99ff3caae	Removed lots of old, and not to be used, HMM options -- resulted in massive code cleanup -- GdA will integrate his new banded algorithm here -- Removed: DO_CONTEXT_DEPENDENT_PENALTIES, GET_GAP_PENALTIES_FROM_DATA, INDEL_RECAL_FILE, dovit, GSA_PRODUCTION_ONLY	2011-09-27 10:08:40 -04:00
Mark DePristo	fa0efbc4ca	Refactoring of PairHMM to support reduced reads	2011-09-26 13:28:56 -04:00
Mark DePristo	a6b65d6347	Merge branch 'master' of ssh://gsa1/humgen/gsa-scr1/gsa-engineering/git/unstable	2011-09-26 13:26:21 -04:00
Mark DePristo	4f09453470	Refactored reduced read utilities -- UnitTests for key functions on reduced reads -- PileupElement calls static functions in ReadUtils -- Simple routine that takes a reduced read and fills in its quals with its reduced qual	2011-09-26 12:58:31 -04:00
Eric Banks	234b74dd05	Merged bug fix from Stable into Unstable	2011-09-26 11:47:23 -05:00
Eric Banks	317b95fa57	Fixing some annotator docs	2011-09-26 11:46:45 -05:00
Mauricio Carneiro	b76dbc72f0	Fixed interval navigation bug. If a read was hard clipped away from the current interval, all subsequent reads within that interval (not hardclipped) would be filtered out. Fixed.	2011-09-26 08:13:44 -04:00
Guillermo del Angel	9afccd11b1	Minor refactoring: add ability to MathUtils.normalizeFromLog10 to not go to linear domain but just substract max value from log values and return. Use this function in snp and indel GL computation.	2011-09-25 21:18:56 -04:00
Guillermo del Angel	3eef800889	Merge branch 'master' of ssh://nickel.broadinstitute.org/humgen/gsa-scr1/gsa-engineering/git/unstable	2011-09-24 21:20:11 -04:00
Guillermo del Angel	4707ab4a7d	Added unit tests to test genotype merges with PL's	2011-09-24 21:17:15 -04:00
Guillermo del Angel	203517fbb7	a) Cleanups/bug fixes to previous commit to CombineVariants. b) Change md5 to reflect records that are now merged correctly. c) Change unit merge alleles test to reflect the fact that a null non-variant vc object is not valid and not supported because there's no way to codify such object in a vcf. The code correctly converts this to a non-variant single-base event with whatever the reference is at that location.	2011-09-24 19:08:00 -04:00
Mauricio Carneiro	c31f4cb2f6	Cleaning leading insertions With the current implementation, a read cannot start with a deletion or an insertion. Maybe this will change in the future, but for now, chop the leading insertion off.	2011-09-24 14:33:32 -04:00
Guillermo del Angel	cd058dd10f	a) Fixed md5 for legit change in UG output that now also no-calls genotypes w/0,0,0 in PL's in SNP case. b) First reimplementation of new vc merger of different types. Previous version did it in two steps, first merging all vc's per type and then trying to see if resulting vc's would be merged if alleles of one type were a subset of another, but this won't work when uniquifying genotypes since sample names would be messed up and GT sample names wouldn't match VC sample names. Now, it's actually simpler: when splitting vc's by type before merging, we check for alleles of one vc being a subset of alleles of vc of another type and if so we put them together in same list.	2011-09-24 13:40:11 -04:00
Mark DePristo	bb11951255	Merge branch 'master' of ssh://gsa1/humgen/gsa-scr1/gsa-engineering/git/unstable	2011-09-24 09:26:45 -04:00
Mark DePristo	8d9e136bba	Merge branch 'stable'	2011-09-24 09:26:28 -04:00
Mark DePristo	6804ab6d2f	Bug fix for NPE in very short GATK runs -- Was already in unstable, but not stable...	2011-09-24 09:25:29 -04:00
Mark DePristo	92acff46e5	Moved Haplotype into Utils root	2011-09-24 09:14:05 -04:00
Mark DePristo	f792353dcd	Framework for genotype unit test	2011-09-24 08:56:45 -04:00

1 2 3 4 5 ...

822 Commits (a5e75cd14cf7a8b6d3c4c8271302468fc5f256b4)