gatk3的最后一个经典版本3.8
 
 
 
 
Go to file
delangel 55b756f1cc First step in major cleanup/redo of VCF functionality. Specifically, now:
a) VCF track name can work again with 3.3 or 4.0 VCF's when specifying -B name,VCF,file. Code will read header and parse automatically the version. 
b) Old VCF codec is deprecated. Reader goes now direct from parsing VCF lines into producing VariantContext objects, with no intermediate VCF records. If anyone can't resist the urge to still input files using the old method, a new VCF3Codec is in place with the old code, but it will be eventually deleted.
c) VCF headers and VCF info fields no longer keep track of the version. They are parsed into an internal representation and will be output only in VCF4.0 format.
d) As a consequence, the existing GATK bug where files are produced with VCF4 body but VCF3.3 headers is solved.
e) Several VCF 4.0 writer bugs are now solved.
f) Integration test MD5's are changed, mostly because of corrected VCF4.0 headers and because validation data mostly uses now VCF4.0.
g) Several VCF files in the ValidationData/ directory have been converted to VCF 4.0 format. I kept the old versions, and the new versions have a .vcf4 extension.

Pending issues:
a) We are still not dealing with indels consistently or correctly when representing them. This will be a second part of the changes.
b) The VCF writer doesn't use VCFRecord but it does still use a lot of leftovers like VCFGenotypeEncoding, VCFGenotypeRecord, etc. This needs to be simplified and cleaned.



git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@3813 348d0f76-0448-11de-a6fe-93d51630548a
2010-07-16 22:49:16 +00:00
R Can run R scripts on the command line 2010-07-09 00:13:18 +00:00
archive Deleting/archiving as instructed 2010-07-13 15:23:50 +00:00
c Reduce file handle usage. 2010-01-05 18:03:01 +00:00
doc moved to wiki 2009-07-22 16:35:23 +00:00
java First step in major cleanup/redo of VCF functionality. Specifically, now: 2010-07-16 22:49:16 +00:00
matlab Another matlab script -- this time for making power and coverage plots over a specific gene region. Lots of fun file reading, string manipulation, and exploration of the set() function 2009-11-30 20:02:25 +00:00
packages Update to the latest rev of the picard sharding patch. Includes updates reflecting 2010-06-06 20:33:21 +00:00
perl 1. Updated tests and added integration test for liftover code. 2010-07-12 17:58:18 +00:00
python Improvements to 1KG processing pipeline 2010-07-16 15:33:47 +00:00
ruby Adding the ruby directory, and a simple script to restart Bamboo 2009-12-03 22:48:48 +00:00
scala Better DOT support and updated recalibration pipeline 2010-07-16 20:54:51 +00:00
settings Rev Picard with new IndexedFastaSequenceFile patch. 2010-07-01 03:05:43 +00:00
shell a) Add input argument (-ncrate) to BeagleOutputToVCFWalker. If the genotype posterior error probability is higher than this threshold, we declare No-call at this genotype. 2010-07-06 18:33:28 +00:00
testdata and add changes to the vcf used in testing 2010-06-25 02:56:02 +00:00
LICENSE Adding a license to the root directory in case BOSC checks for one. Has the 2010-04-20 16:04:29 +00:00
build.xml adding Ivy lines for findbug, and adding a build task (to run it locally you need to have installation of findbug). I'll put more information on the wiki when it's up and running. 2010-07-08 19:10:19 +00:00
ivy.xml adding Ivy lines for findbug, and adding a build task (to run it locally you need to have installation of findbug). I'll put more information on the wiki when it's up and running. 2010-07-08 19:10:19 +00:00