gatk-3.8/public/java/test/org/broadinstitute/sting
Mark DePristo 34ea443cdb Better algorithm for choosing which indel alleles are present in samples
-- The previous approach (requiring > 5 copies among all reads) is breaking down in many samples (>1000) just from sequencing errors.
-- This breakdown is producing spurious clustered indels (lots of these!) around real common indels
-- The new approach requires >X% of reads in a sample to carry an indel of any type (no allele matching) to be including in the counting towards 5.  This actually makes sense in that if you have enough data we expect most reads to have the indel, but the allele might be wrong because of alignment, etc.  If you have very few reads, then the threshold is crossed with any indel containing read, and it's counted.
-- As far as I can tell this is the right thing to do in general.  We'll make another call set in ESP and see how it works at scale.
-- Added integration tests to ensure that the system is behaving as I expect on the site I developed the code on from ESP
2012-03-26 16:28:49 -04:00
..
alignment Reinitialize random seed in the bwa bindings from the fixed seed stored in the 2011-07-22 13:41:53 -04:00
commandline Added ability to specify arguments in files via -args/--arg_file 2011-10-24 15:58:34 -04:00
gatk Better algorithm for choosing which indel alleles are present in samples 2012-03-26 16:28:49 -04:00
jna When using bam list mode in HSP deriving VCF name from bam list instead of requiring an additional parameter. 2011-12-16 18:09:25 -05:00
utils BQSR with GATKReport implementation 2012-03-23 15:42:32 -04:00
BaseTest.java CalibrateGenotypeLikelihoods now emits the position of the variant for debugging 2012-03-09 16:00:07 -05:00
MD5DB.java Merged bug fix from Stable into Unstable 2011-10-13 18:02:06 -04:00
MedianUnitTest.java Adding more unit tests and contracts to PairHMM util class. Updating HaplotypeCaller to use the new PairHMM util class. Now that the HMM result isn't dependent on the length of the haplotype there is no reason to ensure all haplotypes have the save length which simplifies the code considerably. 2012-03-05 08:28:42 -05:00
StingTextReporter.java Reorganized the codebase beneath top-level public and private directories, 2011-06-28 06:55:19 -04:00
WalkerTest.java Minor: cleaner way of turning off index-on-the-fly checking in case we want to turn it back on. 2012-03-18 00:53:29 -04:00