gatk-3.8/protected/java/test/org/broadinstitute/sting/gatk/walkers
Eric Banks b16c7ce050 A whole slew of improvements to the Haplotype Caller and related code.
1. Some minor refactorings and claenup (e.g. removing unused imports) throughout.

2. Updates to the KB assessment functionality:
   a. Exclude duplicate reads when checking to see whether there's enough coverage to make a call.
   b. Lower the threshold on FS for FPs that would easily be filtered since it's only single sample calling.

3. Make the HC consistent in how it treats the pruning factor.  As part of this I removed and archived
   the DeBruijn assembler.

4. Improvements to the likelihoods for the HC
   a. We now include a "tristate" correction in the PairHMM (just like we do with UG).  Basically, we need
      to divide e by 3 because the observed base could have come from any of the non-observed alleles.
   b. We now correct overlapping read pairs.  Note that the fragments are not merged (which we know is
      dangerous).  Rather, the overlapping bases are just down-weighted so that their quals are not more
      than Q20 (or more specifically, half of the phred-scaled PCR error rate); mismatching bases are
      turned into Q0s for now.
   c. We no longer run contamination removal by default in the UG or HC.  The exome tends to have real
      sites with off kilter allele balances and we occasionally lose them to contamination removal.

5. Improved the dangling tail merging implementation.
2013-07-12 10:09:10 -04:00
..
annotator Fix bug introduced recently in the VariantAnnotator where only the last -comp was being annotated at a site. 2013-07-05 00:04:52 -04:00
beagle Simpler FILTER and info field encoding for BeagleOutputToVCF 2013-06-14 15:56:13 -04:00
bqsr Removed plots generation from the BaseRecalibration software 2013-06-19 14:47:56 -04:00
compression/reducereads Proper fix for previous RR -cancer_mode fix. 2013-06-26 14:48:09 -04:00
diagnostics Update MD5s and the Diagnose Target scala script 2013-05-13 12:06:17 -04:00
diffengine Fixed issues raised by Appistry QA (mostly small fixes, corrections & clarifications to GATKDocs) 2013-03-12 10:57:14 -04:00
fasta Updated all JAVA file licenses accordingly 2013-01-10 17:06:41 -05:00
filters Don't allow users to specify keys and IDs that contain angle brackets or equals signs (not allowed in VCF spec). 2013-04-05 00:52:32 -04:00
genotyper A whole slew of improvements to the Haplotype Caller and related code. 2013-07-12 10:09:10 -04:00
haplotypecaller A whole slew of improvements to the Haplotype Caller and related code. 2013-07-12 10:09:10 -04:00
indels Another fix for the Indel Realigner that arises because of secondary alignments. 2013-06-21 16:59:22 -04:00
phasing Updated all JAVA file licenses accordingly 2013-01-10 17:06:41 -05:00
validation MathUtils.randomSubset() now uses Collections.shuffle() (indirectly, through the other methods 2013-03-29 14:52:10 -04:00
varianteval Move some VCF/VariantContext methods back to the GATK based on feedback 2013-01-29 16:56:55 -05:00
variantrecalibration Update MD5s for VQSR header change 2013-04-16 11:45:45 -04:00
variantutils CombineVariants no longer adds PASS to unfiltered records 2013-05-20 16:53:51 -04:00