gatk3的最后一个经典版本3.8
 
 
 
 
Go to file
delangel b45afe5ba8 Several major fixes and changes to new indel likelihood model:
a) Scrapped the way in which we constructed candidate haplotypes because it wasnt fully correct and yielded corner conditions with incorrect genotyping and likelihood computation. Ideally, a haplotype should "cover" the read and the most likely alignments should be such that the ends of the read are inside the ends of the haplotype. This wasn't happening, and if you have a "dangling read off a haplotype" the probabilistic alignment model may prefer to shift a read instead of scoring it correctly - this is especially bad with tandem repeat insertions. 
So now, we build haplotypes based on the reference context and adaptively change them based on read alignment positions, plus some padding and uncertainty in the alignment.

b) Changed the way soft clipped based are dealt with. Instead of either ignoring them or using them, we only use them if the read start or end position (after soft clipping) are within eventDistance of the current location. This is done because it's very common that BWA's strictly local SW implementation will soft clip every single read at an insertion position because it couldn't place that end of the read without too many mismatches, but the read is legit and the bases are good quality. If we don't take these bases into consideration, reads which are informative of an insertion event are essentially discarded because the informative part is clipped away. 

c) Several cleanups and fixes to the context-dependent gap penalty model based on length of HRun.





git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@5464 348d0f76-0448-11de-a6fe-93d51630548a
2011-03-17 18:39:31 +00:00
R Misc changes 2011-02-26 15:35:49 +00:00
analysis/depristo Scripts to create the GATK IAM user and give him/her rights to PutObject (and only PutObject) into the S3 storage instance. Updated the GATKRunReport to now upload using the GATK user, not mark@depristo.com. Running with -et AWS_S3 sends run reports up to the Amazon S3 cloud now. Going to request a few external users try this option so we can see it running at scale. I'm sure S3 can handle a few hundred thousand 1Kb uploads per days, though 2011-01-31 03:48:33 +00:00
archive Moving GLF code to archive 2011-01-15 22:42:42 +00:00
c Bug fixes for the bwa aligner and changes to support compiling against newer releases of the bwa code base. 2010-12-17 14:49:15 +00:00
doc removing the custom reflections library from the libs, and adding a release version. Hopefully this will fix the problem Menachem has been seeing with random JVM crashes. Also 2010-08-19 00:42:37 +00:00
java Several major fixes and changes to new indel likelihood model: 2011-03-17 18:39:31 +00:00
lua forgot to remove a debug line. 2011-02-15 16:25:48 +00:00
matlab Another matlab script -- this time for making power and coverage plots over a specific gene region. Lots of fun file reading, string manipulation, and exploration of the set() function 2009-11-30 20:02:25 +00:00
packages Adding the native jna support libraries from the jna.jar to the Queue package jar. 2011-01-08 10:48:30 +00:00
perl 2 more scripts I found helpful in syncing (and cleaning up) the 1000G mirror 2011-02-22 04:17:36 +00:00
python One last little thing, I swear 2011-03-11 17:37:40 +00:00
ruby accidentally commited an old tool 2010-08-25 15:42:02 +00:00
scala this is a oneoff script to clean the papuans and test TargetCreator and IndelRealigner with scatter gathering. 2011-03-17 17:09:53 +00:00
settings Update Picard / sam-jdk at Tim's request. 2011-01-03 02:17:25 +00:00
shell While generating YAML now warning and skipping TSV rows that don't have all values. 2011-02-25 20:50:03 +00:00
testdata ReplaceReadGroups. Fixes BAM files without read group info. MissingReadGroup points people to this tool now. Please point users on the forum to this tool now. Will migrate to Picard. 2011-02-21 14:02:41 +00:00
LICENSE Adding a license to the root directory in case BOSC checks for one. Has the 2010-04-20 16:04:29 +00:00
build.xml Build.xml contained references to tools now in picard 2011-03-17 18:29:46 +00:00
ivy.xml Added commons math, for Kristian. 2011-02-14 18:57:21 +00:00