gatk-3.8/public/java/test/org/broadinstitute/sting/utils
Mark DePristo 43d97c2e00 Rev Tribble to r97, adding binary feature support
From tribble logs:

Binary feature support in tribble

-- Massive refactoring and cleanup
-- Many bug fixes throughout
-- FeatureCodec is now general, with decode etc. taking a PositionBufferedStream
as an argument not a String
-- See ExampleBinaryCodec for an example binary codec
-- AbstractAsciiFeatureCodec provides to its subclass the same String decode,
readHeader functionality before.  Old ASCII codecs should inherit from this base
class, and will work without additional modifications
-- Split AsciiLineReader into a position tracking stream
(PositionalBufferedStream).  The new AsciiLineReader takes as an argument a
PositionalBufferedStream and provides the readLine() functionality of before.
Could potentially use optimizations (its a TODO in the code)
-- The Positional interface includes some more functionality that's now
necessary to support the more general decoding of binary features
-- FeatureReaders now work using the general FeatureCodec interface, so they can
index binary features
-- Bugfixes to LinearIndexCreator off by 1 error in setting the end block
position
-- Deleted VariantType, since this wasn't used anywhere and it's a particularly
clean why of thinking about the problem
-- Moved DiploidGenotype, which is specific to Gelitext, to the gelitext package
-- TabixReader requires an AsciiFeatureCodec as it's currently only implemented
to handle line oriented records
-- Renamed AsciiFeatureReader to TribbleIndexedFeatureReader now that it handles
Ascii and binary features
-- Removed unused functions here and there as encountered
-- Fixed build.xml to be truly headless
-- FeatureCodec readHeader returns a FeatureCodecHeader obtain that contains a
value and the position in the file where the header ends (not inclusive).
TribbleReaders now skip the header if the position is set, so its no longer
necessary, if one implements the general readHeader(PositionalBufferedStream)
version to see header lines in the decode functions.  Necessary for binary
codecs but a nice side benefit for ascii codecs as well
-- Cleaned up the IndexFactory interface so there's a truly general createIndex
function that takes the enumerated index type.  Added a writeIndex() function
that writes an index to disk.
-- Vastly expanded the index unit tests and reader tests to really test linear,
interval, and tabix indexed files.  Updated test.bed, and created a tabix
version of it as well.
-- Significant BinaryFeaturesTest suite.
-- Some test files have indent changes
2012-05-03 07:31:48 -04:00
..
R Refactoring/fixing up UG HMM code: a) Make code use PairHMM class instead of having duplicated code. That way UG and HaplotypeCaller now use same core code. Changes to be able to do this: 1. Compute context-dependent GOP as a function of read, not of haplotype, b) Extracted code to initialize HMM arrays into separate method, c) Move PairHMM class and unit test to public, d) Reenable banded code in PairHMM, inverted sense of flag (true=enable feature) but leave off in HaplotypeCaller. 2012-04-17 14:22:48 -04:00
activeregion Refactoring/fixing up UG HMM code: a) Make code use PairHMM class instead of having duplicated code. That way UG and HaplotypeCaller now use same core code. Changes to be able to do this: 1. Compute context-dependent GOP as a function of read, not of haplotype, b) Extracted code to initialize HMM arrays into separate method, c) Move PairHMM class and unit test to public, d) Reenable banded code in PairHMM, inverted sense of flag (true=enable feature) but leave off in HaplotypeCaller. 2012-04-17 14:22:48 -04:00
baq Putative fix for BAQ array out of bounds 2011-09-21 11:25:08 -04:00
clipping Add support for reads starting with insertion 2012-01-03 09:29:45 -05:00
codecs Rev Tribble to r97, adding binary feature support 2012-05-03 07:31:48 -04:00
collections Reorganized the codebase beneath top-level public and private directories, 2011-06-28 06:55:19 -04:00
crypt Do not fail tests that require the GATK private key if the user does not have permission to read it 2012-03-06 15:57:02 -05:00
fasta Reduced the number of combinations being tested here, which was overkill 2011-09-01 10:42:43 -04:00
fragments GATKSAMRecord refactor 2011-11-03 15:43:26 -04:00
genotype/vcf Rev Tribble to r97, adding binary feature support 2012-05-03 07:31:48 -04:00
interval When constructing VariantContexts from symbolic alleles, check for the END tag in the INFO field; if present, set the stop position of the VC accordingly. Added integration test to ensure that this is working properly for use with -L intervals. 2012-04-04 10:57:05 -04:00
io Public-key authorization scheme to restrict use of NO_ET 2012-03-06 00:09:43 -05:00
pileup GATKSAMRecord refactor 2011-11-03 15:43:26 -04:00
recalibration disabling all BQSR unit tests 2012-04-30 14:39:55 -04:00
report Reorganized the codebase beneath top-level public and private directories, 2011-06-28 06:55:19 -04:00
runtime No more hunting down R "resources". As a tradeoff Rscript cannot be specified on the commandline and will be found in the environment path. 2011-10-27 14:17:07 -04:00
sam Bug fix for: https://getsatisfaction.com/gsa/topics/problem_with_indelrealigner_and_l_unmapped 2012-04-27 09:58:38 -04:00
text Refactoring/fixing up UG HMM code: a) Make code use PairHMM class instead of having duplicated code. That way UG and HaplotypeCaller now use same core code. Changes to be able to do this: 1. Compute context-dependent GOP as a function of read, not of haplotype, b) Extracted code to initialize HMM arrays into separate method, c) Move PairHMM class and unit test to public, d) Reenable banded code in PairHMM, inverted sense of flag (true=enable feature) but leave off in HaplotypeCaller. 2012-04-17 14:22:48 -04:00
threading Removed GATK use of distributed parallelism framework. 2011-07-20 16:26:09 -04:00
variantcontext Rev Tribble to r97, adding binary feature support 2012-05-03 07:31:48 -04:00
BaseUtilsUnitTest.java Reorganized the codebase beneath top-level public and private directories, 2011-06-28 06:55:19 -04:00
BitSetUtilsUnitTest.java BitSets keys to lower BQSR's memory footprint 2012-03-16 13:01:48 -04:00
GenomeLocParserUnitTest.java During flanking interval creation merging overlapping flanks so that on scatter the list doesn't accidentally genotype the same site twice. 2011-11-17 13:56:42 -05:00
GenomeLocSortedSetUnitTest.java Reorganized the codebase beneath top-level public and private directories, 2011-06-28 06:55:19 -04:00
GenomeLocUnitTest.java Support for list of known CNVs in VariantEval 2011-11-30 17:05:16 -05:00
HaplotypeUnitTest.java Moved the Smith-Watermaning of haplotypes to earlier in the process so that alleles sent to genotyping would have the exact genomic sequence of the active region they represent. As a side effect cleaned up some edge case problems with variants, both real and false, which show up on the edges of active regions. Removed code that was replicated between the Haplotype class and ReadUtils. Finally figured out how to ensure that the indel calls coming out of the HC were left aligned. 2012-04-04 16:04:29 -04:00
MWUnitTest.java Reorganized the codebase beneath top-level public and private directories, 2011-06-28 06:55:19 -04:00
MathUtilsUnitTest.java minor misc optimizations to PairHMM 2012-04-18 15:02:26 -04:00
PairHMMUnitTest.java Resolve merge conflicts 2012-04-18 16:25:03 -04:00
PathUtilsUnitTest.java Reorganized the codebase beneath top-level public and private directories, 2011-06-28 06:55:19 -04:00
QualityUtilsUnitTest.java Caching log calculations cut the non-Map runtime of HaplotypeCaller in half. Moved the qual log cache used in HC and PairHMM into a common place and added unit tests. 2012-03-21 08:45:42 -04:00
ReservoirDownsamplerUnitTest.java Moving reduced read functionality into GATKSAMRecord 2011-10-21 13:28:05 -04:00
SimpleTimerUnitTest.java The right fix for this test is just to delete it. 2011-11-15 14:53:27 -05:00
UtilsUnitTest.java Reorganized the codebase beneath top-level public and private directories, 2011-06-28 06:55:19 -04:00