gatk-3.8/public/java/test/org/broadinstitute/sting
Eric Banks 593d3469d4 Refactored the het (polyploid) consensus creation in ReduceReads.
* It is now cleaner and easier to test; added tests for newly implemented methods.
 * Many fixes to the logic to make it work
   * The most important change was that after triggering het compression we actually need to back it out if it
      creates reads that incorporated too many softclips at any one position (because they get unclipped).
   * There was also an off-by-one error in the general code that only manifested itself with het compression.
 * Removed support for creating a het consensus around deletions (which was broken anyways).
   * Mauricio gave his blessing for this.
 * Het compression now works only against known sites (with -known argument).
    * The user can pass in one or more VCFs with known SNPs (other variants are ignored).
    * If no known SNPs are provided het compression will automatically be disabled.
 * Added SAM tag to stranded (i.e. het compressed) reduced reads to distinguish their
   strandedness from normal reduced reads.
    * GATKSAMRecord now checks for this tag when determining whether or not the read is stranded.
    * This allows us to update the FisherStrand annotation to count het compressed reduced reads
       towards the FS calculation.
    * [It would have been nice to mark the normal reads as unstranded but then we wouldn't be
       backwards compatible.]
    * Updated integration tests accordingly with new het compressed bams (both for RR and UG).
 * In the process of fixing the FS annotation I noticed that SpanningDeletions wasn't handling
   RR properly, so I fixed it too.
    * Also, the test in the UG engine for determining whether there are too many overlapping
       deletions is updated to handle RR.
 * I added a special hook in the RR integration tests to additionally run the systematic
   coverage checking tool I wrote earlier.
    * AssessReducedCoverage is now run against all RR integration tests to ensure coverage is
       not lost from original to reduced bam.
    * This helped uncover a huge bug in the MultiSampleCompressor where it would drop reads
       from all but 1 sample (now fixed).
    * AssessReducedCoverage moved from private to protected for packaging reasons.
 * #resolve GSA-639

At this point, this commit encompasses most of what is needed for het compression to go live.
There are still a few TODO items that I want to get in before the 2.5 release, but I will save
those for a separate branch because as it is I feel bad for the person who needs to review all
these changes (sorry, Mauricio).
2013-03-25 09:34:54 -04:00
..
commandline Fixed the logic of the @Output annotation and its interaction with 'required'. 2013-03-14 11:58:51 -04:00
gatk Refactored the het (polyploid) consensus creation in ReduceReads. 2013-03-25 09:34:54 -04:00
jna Fix more tests that fail when run in parallel on the farm 2013-03-12 16:53:30 -04:00
utils Refactored the het (polyploid) consensus creation in ReduceReads. 2013-03-25 09:34:54 -04:00
BaseTest.java Replace org.broadinstitute.variant with jar built from the Picard repo 2013-02-05 17:24:25 -05:00
ExampleToCopyUnitTest.java Replace org.broadinstitute.variant with jar built from the Picard repo 2013-02-05 17:24:25 -05:00
MD5DB.java Encrypt GATK AWS keys using the GATK private key, and decrypt as needed as a resource when uploading to AWS logs 2013-01-30 16:42:23 -05:00
MD5Mismatch.java Updated all JAVA file licenses accordingly 2013-01-10 17:06:41 -05:00
StingTextReporter.java Updated all JAVA file licenses accordingly 2013-01-10 17:06:41 -05:00
TestNGTestTransformer.java Fix tests that were consistently or intermittently failing when run in parallel on the farm 2013-03-06 13:56:54 -05:00
WalkerTest.java Replace org.broadinstitute.variant with jar built from the Picard repo 2013-02-05 17:24:25 -05:00