gatk-3.8

Commit Graph

Author	SHA1	Message	Date
Ryan Poplin	a7fe334a3f	calculating the md5s for the new tests.	2013-01-11 15:43:52 -05:00
Ryan Poplin	65afec2a53	Merge branch 'master' of github.com:broadinstitute/gsa-unstable	2013-01-11 15:22:52 -05:00
Mark DePristo	85b529cced	Updating MD5s in HC and UG that changed due to new LIBS -- Resolved what was clearly a bug in UG (GGA mode was returning a neighboring, equivalent indel site that wasn't in input list. Not ideal) -- Trivial read count differences in HC	2013-01-11 15:17:19 -05:00
Mark DePristo	f204908a94	Add some todos for future optimization to LIBS	2013-01-11 15:17:18 -05:00
Mark DePristo	e88dae2758	LocusIteratorByState operates natively on GATKSAMRecords now -- Updated code to reflect this new typing	2013-01-11 15:17:18 -05:00
Mark DePristo	94cb50d3d6	Retire LegacyLocusIteratorByState -- Left in the remaining infrastructure for David to remove, but the legacy downsampler is no longer a functional option in the GATK	2013-01-11 15:17:18 -05:00
Mark DePristo	cc0c1b752a	Delete old LocusIteratorByState, leaving only new LIBS and legacy	2013-01-11 15:17:18 -05:00
Mark DePristo	bd03511e35	Updating AlignmentStateMachinePerformance to include some more useful performance assessments	2013-01-11 15:17:18 -05:00
Mark DePristo	9e23c592e6	ReadBackedPileup cleanup -- Only ReadBackedPileupImpl (concrete class) and ReadBackedPileup (interface) live, moved all functionality of AbstractReadBackedPileup into the impl -- ReadBackedPileupImpl was literally a shell class after we removed extended events. A few bits of code cleanup and we reduced a bunch of class complexity in the gatk -- ReadBackedPileups no longer accept pre-cached values (size, nMapQ reads, etc) but now lazy load these values as needed -- Created optimized calculation routines to iterator over all of the reads in the pileup in whatever order is most efficient as well. -- New LIBS no longer calculates size, n mapq, and n deletion reads while making pileups. -- Added commons-collections for IteratorChain	2013-01-11 15:17:18 -05:00
Mark DePristo	e3e3ae29b2	Final documentation for LocusIteratorByState	2013-01-11 15:17:18 -05:00
Mark DePristo	6a91902aa2	Fix final merge conflicts	2013-01-11 15:17:18 -05:00
Mark DePristo	b9a33d3c66	Split original and optimized ART into largely independent pieces -- Allows us to cleanly run old and new art, which now have different traversal behavior (on purpose). Split unit tests as well.	2013-01-11 15:17:18 -05:00
Mark DePristo	02130dfde7	Cleanup ART -- Initialize routine captures essential information for running the traversal	2013-01-11 15:17:17 -05:00
Mark DePristo	9b2be795a7	Initial working version of new ActiveRegionTraversal based on the LocusIteratorByState read stream -- Implemented as a subclass of TraverseActiveRegions -- Passes all unit tests -- Will be very slow -- needs logical fixes	2013-01-11 15:17:17 -05:00
Mark DePristo	8b83f4d6c7	Near final cleanup of PileupElement -- All functions documented and unit tested -- New constructor interface -- Cleanup some uses of old / removed functionality	2013-01-11 15:17:17 -05:00
Mark DePristo	fb9eb3d4ee	PileupElement and LIBS cleanup -- function to create pileup elements in AlignmentStateMachine and LIBS -- Cleanup pileup element constructors, directing users to LIBS.createPileupFromRead() that really does the right thing	2013-01-11 15:17:17 -05:00
Mark DePristo	2f2a592c8e	Contracts and documentation for AlignmentStateMachine and LocusIteratorByState -- Add more unit tests for both as well	2013-01-11 15:17:17 -05:00
Mark DePristo	cc1d259cac	Implement get Length and Bases of OfImmediatelyFollowingIndel in PileupElement -- Added unit tests for this behavior. Updated users of this code	2013-01-11 15:17:17 -05:00
Mark DePristo	2c38310868	Create LIBS using new AlignmentStateMachine infrastructure -- Optimizations to AlignmentStateMachine -- Properly count deletions. Added unit test for counting routines -- AlignmentStateMachine.java is no longer recursive -- Traversals now use new LIBS, not the old one	2013-01-11 15:17:17 -05:00
Mark DePristo	80d9b7011c	Complete rewrite of low-level machinery of LIBS, not hooked up -- AlignmentStateMachine does what SAMRecordAlignmentState should really do. It's correct in that it's more accurate than the LIB_position tests themselves. This is a non-broken, correct implementation. Needs cleanup, contracts, etc. -- This version is like 6x slower than the original implementation (according to the google caliper benchmark here). Obvious optimizations for future commit	2013-01-11 15:17:16 -05:00
Mark DePristo	b53286cc3c	HaplotypeCaller mode to skip assembly and genotyping for performance testing -- Added HCPerformance evaluation Qscript -- Added some docs about one of the HC integration tests -- HaplotypeCaller / ART performance evaluation script	2013-01-11 15:17:16 -05:00
Mark DePristo	0ac4352614	LIBS can now (optionally) track the unique reads it uses from the underlying read iterator -- This capability is essential to provide an ordered set of used reads to downstream users of LIBS, such as ART, who want an efficient way to get the reads used in LIBS -- Vastly expanded the multi-read, multi-sample LIBS unit tests to make sure this capability is working -- Added createReadStream to ArtificialSAMUtils that makes it relatively easy to create multi-read, multi-sample read streams for testing	2013-01-11 15:17:16 -05:00
Mark DePristo	b3ecfbfce8	Refactor LIBS into component parts, expand unit tests, some code cleanup -- Split out all of the inner classes of LIBS into separate independent classes -- Split / add unit tests for many of these components. -- Radically expand unit tests for SAMRecordAlignmentState (the lowest level piece of code) making sure at least some of it works -- No need to change unit tests or integration tests. No change in functionality. -- Added (currently disabled) code to track all submitted reads to LIBS, but this isn't accessible or tested	2013-01-11 15:17:16 -05:00
Mark DePristo	2e5d38fd0e	Updating to latest google caliper code	2013-01-11 15:17:16 -05:00
Mark DePristo	b2990497e2	Refactor LIBS into utils.locusiterator before refactoring	2013-01-11 15:17:16 -05:00
Ryan Poplin	e952296c10	Adding HC GGA integration test to cover duplicated input alleles.	2013-01-11 15:01:27 -05:00
Ryan Poplin	7f7f40f851	Adding additional HC GGA integration tests to cover more complicated input alleles.	2013-01-11 14:36:21 -05:00
Mauricio Carneiro	9ed922d562	Updating licenses to Eric's last commit - for now we're still running the script by hand, soon automated solution will be in place. GSATDG-5	2013-01-11 14:33:00 -05:00
Mauricio Carneiro	009d2f5705	Removed CMI specific script from GATK repo	2013-01-11 14:33:00 -05:00
Ami Levy-Moonshine	e9a8b1a403	Merge branch 'master' of github.com:broadinstitute/gsa-unstable	2013-01-11 14:18:26 -05:00
Ami Levy-Moonshine	9519c3fd6f	(1) add scala script to merge bam files; (2) few change in Queue script that run the new CoveredByNSamplesSites walker	2013-01-11 13:47:05 -05:00
Mauricio Carneiro	bc64d4240f	Licensing update -- batch #2 - caught all scala files that didn't have proper package information / class names - included all source files in archive as well GSATDG-5	2013-01-11 13:38:11 -05:00
Mauricio Carneiro	4ea2c5df43	Updating updateAllLicenses scripts to include archived files GSATDG-5	2013-01-11 13:38:05 -05:00
Mauricio Carneiro	28235f57f2	Adding package information to scala scripts that were missing it. Including archived ones. GSATDG-5	2013-01-11 13:38:05 -05:00
Mauricio Carneiro	cc9a2aaee7	Script to identify code without package info - package information is critical for the licensing scripts. All java and scala files MUST contain package information. GSATDG-5	2013-01-11 13:38:05 -05:00
Eric Banks	85baf71b39	Merged bug fix from Stable into Unstable	2013-01-11 11:05:27 -05:00
Eric Banks	d78539774f	Another RR bug: off by one error led to ArrayIndexOutOfBoundsException when working with multiple samples and the variant region ended 1 base after the end of the last read for a given sample.	2013-01-11 11:05:09 -05:00
Eric Banks	79b93f659c	Merged bug fix from Stable into Unstable	2013-01-11 09:20:13 -05:00
Eric Banks	67fafbb625	Forgot an include	2013-01-11 09:19:46 -05:00
Eric Banks	6bf0cc32f9	When reducing multiple samples it is possible to try to close a region that for a given sample has no reads. Currently we'd NPE. Fixed.	2013-01-11 09:16:19 -05:00
Eric Banks	e7906713d9	Moving some random walkers back to public as requested by Mark. Mauricio will the licenses get updated automatically?	2013-01-11 02:03:43 -05:00
Eric Banks	3a51823c2a	Clean up imports	2013-01-10 23:35:01 -05:00
Eric Banks	e4b7b1955c	Forgot to add the note about length normalization to the QD docs	2013-01-10 23:34:06 -05:00
Eric Banks	ff5ac986d8	Fix docs for QD	2013-01-10 23:31:46 -05:00
Ami Levy-Moonshine	352cb831d0	Merge branch 'master' of github.com:broadinstitute/gsa-unstable	2013-01-10 21:27:06 -05:00
Ami Levy-Moonshine	fac0bce916	add RunCoveredByNSamplesSites; changes in CoveredByNSamplesSites so it can work in parallel; also, move it to diagnostics	2013-01-10 21:26:49 -05:00
Mauricio Carneiro	ea8c8573d2	Fixing ParseLicense script for scala syntax - Scala allows package objects in its syntax, so the script needs to be aware of that and not add "*/" every time it sees it. GSATDG-5	2013-01-10 18:24:24 -05:00
Mauricio Carneiro	e5913e50b2	Updating licenses for all scala files GSATDG-5	2013-01-10 17:46:10 -05:00
Mauricio Carneiro	9b45e54ae6	Adding package information to stray scala files GSATDG-5	2013-01-10 17:45:36 -05:00
Mauricio Carneiro	6623148838	Removing CMI scripts from the GATK repo	2013-01-10 17:45:36 -05:00

1 2 3 4 5 ...

11577 Commits (a7fe334a3fc3fda04e019019dd3a8c671d995bb6) All Branches Search

11577 Commits (a7fe334a3fc3fda04e019019dd3a8c671d995bb6)

All Branches