gatk-3.8/public/java/test/org/broadinstitute/sting/utils
Mark DePristo fba7dafa0e Finalizing BCF2 mark III commit
-- Moved GENOTYPE_KEY vcf header line to VCFConstants.  This general migration and cleanup is on Eric's plate now
-- Updated HC to initialize the annotation engine in an order that allows it to write a proper VCF header.  Still doesn't work...
-- Updating integration test files.  Moved many more files into public/testdata.  Updated their headers to all work correctly with new strict VCF header checking.
-- Bugfix for TandemRepeatAnnotation that must be unbounded not A count type as it provides info for the REF as well as each alt
-- No longer add FALSE values to flag values in VCs in VariantAnnotatorEngine.  DB = 0 is never seen in the output VCFs now
-- Fixed bug in VCFDiffableReader that didn't differeniate between "." and "PASS" VC filter status
-- Unconditionally add lowQual Filter to UG output VCF files as this is in some cases (EMIT_ALL_SITES) used when the previous check said it wouldn't be
-- VariantsToVCF now properly writes out the GT FORMAT field
-- BCF2 codec explodes when reading symbolic alleles as I literally cannot figure out how to use the allele clipping code.  Eric said he and Ami will clean up this whole piece of instructure
-- Fixed bug in BCF2Codec that wasn't setting the phase field correctly.  UnitTested now
-- PASS string now added at the end of the BCF2 dictionary after discussion with Heng
-- Fixed bug where I was writing out all field values as BigEndian.  Now everything is LittleEndian.
-- VCFHeader detects the case where a count field has size < 0 (some of our files have count = -1) and throws a UserException
-- Cleaned up unused code
-- Fixed bug in BCF2 string encoder that wasn't handling the case of an empty list of strings for encoding
-- Fixed bug where all samples are no called in a VC, in which case we (like the VCFwriter) write out no called diploid genotypes for all samples
-- We always write the number of genotype samples into the BCF2 nSamples header.  How we can have a variable number of samples per record isn't clear to me, as we don't have a map from missing samples to header names...
-- Removed old filtersWereAppliedToContext code in VCF as properly handle unfiltered, filtered, and PASS records internally
-- Fastpath function getDisplayBases() in allele that just gives you the raw bytes[] you'd see for an Allele
-- Genotype fields no longer differentiate between unfiltered, filtered, and PASS values.  Genotype objects are all PASS implicitly, or explicitly filtered.  We only write out the FT values if at least one sample is filtered.  Removed interface functions and cleaned up code
-- Refactored padAllele code from createVariantContextWithPaddedAlleles into the function padAllele so that it actually works.  In general, **** NEVER COPY CODE **** if you need to share funcitonality make a function, that's why there were invented!
-- Increased the default number of records to read for DiffObjects to 1M
2012-06-21 15:16:27 -04:00
..
R Refactoring/fixing up UG HMM code: a) Make code use PairHMM class instead of having duplicated code. That way UG and HaplotypeCaller now use same core code. Changes to be able to do this: 1. Compute context-dependent GOP as a function of read, not of haplotype, b) Extracted code to initialize HMM arrays into separate method, c) Move PairHMM class and unit test to public, d) Reenable banded code in PairHMM, inverted sense of flag (true=enable feature) but leave off in HaplotypeCaller. 2012-04-17 14:22:48 -04:00
activeregion Refactoring/fixing up UG HMM code: a) Make code use PairHMM class instead of having duplicated code. That way UG and HaplotypeCaller now use same core code. Changes to be able to do this: 1. Compute context-dependent GOP as a function of read, not of haplotype, b) Extracted code to initialize HMM arrays into separate method, c) Move PairHMM class and unit test to public, d) Reenable banded code in PairHMM, inverted sense of flag (true=enable feature) but leave off in HaplotypeCaller. 2012-04-17 14:22:48 -04:00
baq Putative fix for BAQ array out of bounds 2011-09-21 11:25:08 -04:00
clipping Refactor on how RR treats soft clips 2012-06-21 14:02:03 -04:00
codecs Finalizing BCF2 mark III commit 2012-06-21 15:16:27 -04:00
collections Reorganized the codebase beneath top-level public and private directories, 2011-06-28 06:55:19 -04:00
crypt Do not fail tests that require the GATK private key if the user does not have permission to read it 2012-03-06 15:57:02 -05:00
fasta Reduced the number of combinations being tested here, which was overkill 2011-09-01 10:42:43 -04:00
fragments GATKSAMRecord refactor 2011-11-03 15:43:26 -04:00
interval Feature request from Tim that could be useful to all: there's now an --interval_padding argument that specifies how many basepairs to add to each of the intervals provided with -L (on both ends). This is particularly useful when trying to run over the exome plus flanks and don't want to have to pre-compute the flanks (just use e.g. --interval_padding 50). Added integration test to cover this feature. 2012-06-18 21:36:27 -04:00
io Public-key authorization scheme to restrict use of NO_ET 2012-03-06 00:09:43 -05:00
pileup GATKSAMRecord refactor 2011-11-03 15:43:26 -04:00
recalibration The next round of BQSR optimizations: no more Long[] array creation 2012-06-14 00:05:42 -04:00
report Reorganized the codebase beneath top-level public and private directories, 2011-06-28 06:55:19 -04:00
runtime No more hunting down R "resources". As a tradeoff Rscript cannot be specified on the commandline and will be found in the environment path. 2011-10-27 14:17:07 -04:00
sam Bug fix for: https://getsatisfaction.com/gsa/topics/problem_with_indelrealigner_and_l_unmapped 2012-04-27 09:58:38 -04:00
text Refactoring/fixing up UG HMM code: a) Make code use PairHMM class instead of having duplicated code. That way UG and HaplotypeCaller now use same core code. Changes to be able to do this: 1. Compute context-dependent GOP as a function of read, not of haplotype, b) Extracted code to initialize HMM arrays into separate method, c) Move PairHMM class and unit test to public, d) Reenable banded code in PairHMM, inverted sense of flag (true=enable feature) but leave off in HaplotypeCaller. 2012-04-17 14:22:48 -04:00
threading Removed GATK use of distributed parallelism framework. 2011-07-20 16:26:09 -04:00
variantcontext Finalizing BCF2 mark III commit 2012-06-21 15:16:27 -04:00
BaseUtilsUnitTest.java Reorganized the codebase beneath top-level public and private directories, 2011-06-28 06:55:19 -04:00
BitSetUtilsUnitTest.java Oops, forgot to push the unit tests 2012-06-12 11:38:30 -04:00
GenomeLocParserUnitTest.java During flanking interval creation merging overlapping flanks so that on scatter the list doesn't accidentally genotype the same site twice. 2011-11-17 13:56:42 -05:00
GenomeLocSortedSetUnitTest.java Reorganized the codebase beneath top-level public and private directories, 2011-06-28 06:55:19 -04:00
GenomeLocUnitTest.java Support for list of known CNVs in VariantEval 2011-11-30 17:05:16 -05:00
HaplotypeUnitTest.java Adding genotype given alleles mode to the HaplotypeCaller. 2012-05-30 15:07:01 -04:00
MWUnitTest.java Reorganized the codebase beneath top-level public and private directories, 2011-06-28 06:55:19 -04:00
MathUtilsUnitTest.java minor misc optimizations to PairHMM 2012-04-18 15:02:26 -04:00
MedianUnitTest.java Final updates to integration tests for BCF2 2012-05-24 10:58:59 -04:00
PairHMMUnitTest.java Resolve merge conflicts 2012-04-18 16:25:03 -04:00
PathUtilsUnitTest.java Reorganized the codebase beneath top-level public and private directories, 2011-06-28 06:55:19 -04:00
QualityUtilsUnitTest.java Caching log calculations cut the non-Map runtime of HaplotypeCaller in half. Moved the qual log cache used in HC and PairHMM into a common place and added unit tests. 2012-03-21 08:45:42 -04:00
ReservoirDownsamplerUnitTest.java Moving reduced read functionality into GATKSAMRecord 2011-10-21 13:28:05 -04:00
SimpleTimerUnitTest.java The right fix for this test is just to delete it. 2011-11-15 14:53:27 -05:00
UtilsUnitTest.java Reorganized the codebase beneath top-level public and private directories, 2011-06-28 06:55:19 -04:00