gatk-3.8/java/test/org/broadinstitute/sting/utils
delangel 55b756f1cc First step in major cleanup/redo of VCF functionality. Specifically, now:
a) VCF track name can work again with 3.3 or 4.0 VCF's when specifying -B name,VCF,file. Code will read header and parse automatically the version. 
b) Old VCF codec is deprecated. Reader goes now direct from parsing VCF lines into producing VariantContext objects, with no intermediate VCF records. If anyone can't resist the urge to still input files using the old method, a new VCF3Codec is in place with the old code, but it will be eventually deleted.
c) VCF headers and VCF info fields no longer keep track of the version. They are parsed into an internal representation and will be output only in VCF4.0 format.
d) As a consequence, the existing GATK bug where files are produced with VCF4 body but VCF3.3 headers is solved.
e) Several VCF 4.0 writer bugs are now solved.
f) Integration test MD5's are changed, mostly because of corrected VCF4.0 headers and because validation data mostly uses now VCF4.0.
g) Several VCF files in the ValidationData/ directory have been converted to VCF 4.0 format. I kept the old versions, and the new versions have a .vcf4 extension.

Pending issues:
a) We are still not dealing with indels consistently or correctly when representing them. This will be a second part of the changes.
b) The VCF writer doesn't use VCFRecord but it does still use a lot of leftovers like VCFGenotypeEncoding, VCFGenotypeRecord, etc. This needs to be simplified and cleaned.



git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@3813 348d0f76-0448-11de-a6fe-93d51630548a
2010-07-16 22:49:16 +00:00
..
bed IndexedFastaSequenceFile is now in Picard; transitioning to that implementation. 2010-07-01 04:40:31 +00:00
collections The copyright tag that I copied/pasted from a LaTeX document into IntelliJ had 2010-04-20 15:26:32 +00:00
genotype First step in major cleanup/redo of VCF functionality. Specifically, now: 2010-07-16 22:49:16 +00:00
interval A simple utility class that implements a merging Iterator<GenomeLoc> built over an interval or bed file (this is NOT a rod, but rather a direct line-by-line file reader that converts strings to genome locs on the fly and merges overlapping intervals) 2010-06-14 15:54:37 +00:00
sam Rev the reservoir downsampler to support partitioning through a functor. 2010-04-21 19:50:26 +00:00
threading A long overdue correction; all unit tests now end in 'UnitTest'. This was something we wanted to do for a while, and now with the performance tests coming, it was a good time to clean-up. Please label any new test appropriately: *UnitTest and *IntegrationTest are the two valid file name patterns for tests. 2010-04-08 06:14:15 +00:00
BaseUtilsUnitTest.java A long overdue correction; all unit tests now end in 'UnitTest'. This was something we wanted to do for a while, and now with the performance tests coming, it was a good time to clean-up. Please label any new test appropriately: *UnitTest and *IntegrationTest are the two valid file name patterns for tests. 2010-04-08 06:14:15 +00:00
GenomeLocParserUnitTest.java Refactoring intervals, separating the process of parsing interval lists, 2010-04-13 15:50:38 +00:00
GenomeLocSortedSetUnitTest.java add a fix so that XL arguments won't cancel out -BTI arguments, fixed a bug for Ben where the ROD -> interval list conversion was throwing an exception, and some old code removal. 2010-04-15 16:31:43 +00:00
GenomeLocUnitTest.java IndexedFastaSequenceFile is now in Picard; transitioning to that implementation. 2010-07-01 04:40:31 +00:00
MathUtilsUnitTest.java The copyright tag that I copied/pasted from a LaTeX document into IntelliJ had 2010-04-20 15:26:32 +00:00
PathUtilsUnitTest.java A long overdue correction; all unit tests now end in 'UnitTest'. This was something we wanted to do for a while, and now with the performance tests coming, it was a good time to clean-up. Please label any new test appropriately: *UnitTest and *IntegrationTest are the two valid file name patterns for tests. 2010-04-08 06:14:15 +00:00
ReservoirDownsamplerUnitTest.java Checking in downsampling iterator alongside LocusIteratorByState, and removing 2010-05-17 21:00:44 +00:00
UtilsUnitTest.java A long overdue correction; all unit tests now end in 'UnitTest'. This was something we wanted to do for a while, and now with the performance tests coming, it was a good time to clean-up. Please label any new test appropriately: *UnitTest and *IntegrationTest are the two valid file name patterns for tests. 2010-04-08 06:14:15 +00:00