gatk-3.8/public/java/test/org/broadinstitute/sting/gatk
Mark DePristo 34ea443cdb Better algorithm for choosing which indel alleles are present in samples
-- The previous approach (requiring > 5 copies among all reads) is breaking down in many samples (>1000) just from sequencing errors.
-- This breakdown is producing spurious clustered indels (lots of these!) around real common indels
-- The new approach requires >X% of reads in a sample to carry an indel of any type (no allele matching) to be including in the counting towards 5.  This actually makes sense in that if you have enough data we expect most reads to have the indel, but the allele might be wrong because of alignment, etc.  If you have very few reads, then the threshold is crossed with any indel containing read, and it's counted.
-- As far as I can tell this is the right thing to do in general.  We'll make another call set in ESP and see how it works at scale.
-- Added integration tests to ensure that the system is behaving as I expect on the site I developed the code on from ESP
2012-03-26 16:28:49 -04:00
..
datasources Avoid buffer underflow in GATKBAMIndex by detecting premature EOF in BAM indices 2012-03-08 15:30:44 -05:00
executive Reorganized the codebase beneath top-level public and private directories, 2011-06-28 06:55:19 -04:00
filters Taking care of bad cigars in the GATK 2012-03-20 14:32:57 -04:00
iterators a) Bug fix in calling new functions that give indel bases and length from regular pileup in LocusIteratorByState, b) Added unit test to cover these. 2012-02-25 13:57:28 -05:00
refdata Merge with master 2011-11-19 09:56:06 -05:00
report GATK Report code cleanup 2012-03-19 11:53:57 -04:00
samples Added UnitTests for getFamilies() and getChildrenWithParents() 2011-12-09 15:57:07 +01:00
traversals First pass of the asynchronous block loader. 2011-11-18 15:02:59 -05:00
walkers Better algorithm for choosing which indel alleles are present in samples 2012-03-26 16:28:49 -04:00
EngineFeaturesIntegrationTest.java HierarchicalMicroScheduler no longer attempts to wrap exceptions 2012-03-23 11:27:21 -04:00
GenomeAnalysisEngineUnitTest.java Allow processing of empty intervals, but warn user when this case is encountered. 2011-10-28 12:12:14 -04:00
WalkerManagerUnitTest.java Moving more tools over to use the standard VC arg collection. Also, while I'm in there, I removed all of the empty references to @Requires given that it's no longer relevant. 2011-08-10 12:20:18 -04:00