gatk-3.8/java/test/org/broadinstitute/sting/gatk/walkers
delangel d534241f35 Major revamp of annotations for indels:
a) All rank sum tests now work for indels including multiallelic sites. For the latter cases, rank sum test is REF vs most common allele
b) Redid computation of HaplotypeScore for indels. It's now trivially easy to do because we are already computing likelihoods of each read vs haplotypes in GL computation so we reuse that if available. For multiallelic case, we score against N haplotypes where N is total called alleles.

Drawback is that all cases need information contained in likelihood table that stores likelihood for each pileup element, for each allele. If this table is not available we dont annotate, so we can only fully annotate indels right now when running UG but not when running VariantAnnotator alone.
 


git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@5947 348d0f76-0448-11de-a6fe-93d51630548a
2011-06-04 15:34:24 +00:00
..
annotator After verifying that the changes to these tests were all in the RankSum annotations, I'm commiting fixes to the test md5s. 2011-06-02 13:01:18 +00:00
beagle PLEASE READ ME! In order to prepare for the upcoming changes to VCF4, we felt it was best to split up the vcf3 and vcf4 codecs (vcf4 is not backwards compatible to vcf3 and certain changes are too complex to handle in both codecs). Using the 'VCF' rod type in the GATK will now throw a UserException for vcf3.2 or vcf3.3 files telling you to use the 'VCF3' type instead (and vice versa). Integration/unit tests have been updated. For programmers: note that there is currently a lot of code duplication in the two codecs (although I pulled out the easy stuff to a VCFCodecUtils class); however WE ARE FREEZING THE VCF3 CODEC AND WILL NO LONGER MAKE CHANGES TO IT. All updates/improvements will be targetted to the vcf4 codec only as vcf3 is there only to be able to read legacy files. People should really be using vcf4 files only. 2011-05-11 12:07:44 +00:00
coverage -ct x no longer includes coverage in the previous bin 2011-02-24 15:52:04 +00:00
fasta Changing testing framework from junit -> testng, for its enhanced configurability. 2010-11-01 21:31:44 +00:00
filters PLEASE READ ME! In order to prepare for the upcoming changes to VCF4, we felt it was best to split up the vcf3 and vcf4 codecs (vcf4 is not backwards compatible to vcf3 and certain changes are too complex to handle in both codecs). Using the 'VCF' rod type in the GATK will now throw a UserException for vcf3.2 or vcf3.3 files telling you to use the 'VCF3' type instead (and vice versa). Integration/unit tests have been updated. For programmers: note that there is currently a lot of code duplication in the two codecs (although I pulled out the easy stuff to a VCFCodecUtils class); however WE ARE FREEZING THE VCF3 CODEC AND WILL NO LONGER MAKE CHANGES TO IT. All updates/improvements will be targetted to the vcf4 codec only as vcf3 is there only to be able to read legacy files. People should really be using vcf4 files only. 2011-05-11 12:07:44 +00:00
genotyper Major revamp of annotations for indels: 2011-06-04 15:34:24 +00:00
indels The underlying bam file for this test was busted for many reasons preventing Picard folks from making unrelated changes, so I needed to fix it. Updating md5s accordingly. 2011-06-03 22:26:06 +00:00
phasing Remove the extra trailing tab at the end of the VCF ## header line. Unfortunately, this meant updating every freaking integration test. 2010-12-08 17:22:29 +00:00
qc Reenabling E.coli ValidatingPileup with MV1994 realigned using the BWA/C bindings. 2011-05-23 21:32:53 +00:00
recalibration We now skip over all covering RODs in the BQSR as intended instead of just those which can be converted into a VariantContext. All the integration tests change because of subtleties in how certain dbsnp rod records are being converted into VCs. Added integration test which uses a bed file as the list of known polymorphic sites. 2011-05-27 16:32:17 +00:00
sequenom continuing from last night, the integration tests weren't covering the right behavior either 2011-04-28 13:30:57 +00:00
varianteval PLEASE READ ME! In order to prepare for the upcoming changes to VCF4, we felt it was best to split up the vcf3 and vcf4 codecs (vcf4 is not backwards compatible to vcf3 and certain changes are too complex to handle in both codecs). Using the 'VCF' rod type in the GATK will now throw a UserException for vcf3.2 or vcf3.3 files telling you to use the 'VCF3' type instead (and vice versa). Integration/unit tests have been updated. For programmers: note that there is currently a lot of code duplication in the two codecs (although I pulled out the easy stuff to a VCFCodecUtils class); however WE ARE FREEZING THE VCF3 CODEC AND WILL NO LONGER MAKE CHANGES TO IT. All updates/improvements will be targetted to the vcf4 codec only as vcf3 is there only to be able to read legacy files. People should really be using vcf4 files only. 2011-05-11 12:07:44 +00:00
variantrecalibration Updated VQSR integration tests because of the new Omni file. Fixed overflow condition in FisherStrand when the depth is too high. 2011-06-01 14:20:37 +00:00
variantutils No one should be calling the createHeader method(s) directly, but instead should be going through the full readHeader method (because it first sets the version); therefore I made them package protected and merged them. Updated the various unit tests that were using createHeader and were dangerously assuming that the header version was defaulting to 4.0 (which it no longer does). 2011-06-03 02:17:37 +00:00
BAQIntegrationTest.java Better query start / stop function that directly parses the cigar string, unlike the previous version. Now properly handles H (hard-clipped) reads. Added -baq OFF and -baq RECALCULATE integration tests on all three 1KG technologies. Please let me know if this new code somehow fails. 2011-01-28 15:08:21 +00:00
ClipReadsWalkersIntegrationTest.java Changing testing framework from junit -> testng, for its enhanced configurability. 2010-11-01 21:31:44 +00:00
PileupWalkerIntegrationTest.java Updated to now longer include 2nd-best base output 2011-04-03 20:13:10 +00:00
PrintReadsIntegrationTest.java Adding integration test to cover the case of a read that only covers an insertion (i.e. no M in the CIGAR string). 2011-06-02 21:02:47 +00:00
PrintReadsWalkerUnitTest.java Changing testing framework from junit -> testng, for its enhanced configurability. 2010-11-01 21:31:44 +00:00