gatk-3.8

gatk3的最后一个经典版本3.8

Go to file

Mark DePristo 34ea443cdb Better algorithm for choosing which indel alleles are present in samples -- The previous approach (requiring > 5 copies among all reads) is breaking down in many samples (>1000) just from sequencing errors. -- This breakdown is producing spurious clustered indels (lots of these!) around real common indels -- The new approach requires >X% of reads in a sample to carry an indel of any type (no allele matching) to be including in the counting towards 5. This actually makes sense in that if you have enough data we expect most reads to have the indel, but the allele might be wrong because of alignment, etc. If you have very few reads, then the threshold is crossed with any indel containing read, and it's counted. -- As far as I can tell this is the right thing to do in general. We'll make another call set in ESP and see how it works at scale. -- Added integration tests to ensure that the system is behaving as I expect on the site I developed the code on from ESP		2012-03-26 16:28:49 -04:00
public	Better algorithm for choosing which indel alleles are present in samples	2012-03-26 16:28:49 -04:00
settings	Rev. tribble to fix BED codec bug in tribble 51	2012-01-17 16:40:26 -05:00
.gitignore	Minor additions to the shared .gitignore file, now that Mark has checked one in.	2011-10-26 12:24:28 -04:00
LICENSE	One last test...	2011-06-28 19:18:17 -04:00
build.xml	Public-key authorization scheme to restrict use of NO_ET	2012-03-06 00:09:43 -05:00
ivy.xml	S3 upload now directly creates the XML report in memory and puts that in S3	2012-01-29 15:14:58 -05:00