gatk-3.8

Commit Graph

Author	SHA1	Message	Date
Mark DePristo	2d802e17a4	Delete the CachingPairHMM	2013-02-09 13:06:54 -05:00
Mark DePristo	7dcafe8b81	Preliminary version of LoglessCachingPairHMM that avoids positive likelihoods -- Would have been squashed but could not because of subsequent deletion of Caching and Exact/Original PairHMMs -- Actual working unit tests for PairHMMUnitTest -- Fixed incorrect logic in how I compared hmm results to the theoretical and exact results -- PairHMM has protected variables used throughout the subclasses	2013-02-09 13:06:54 -05:00
Mauricio Carneiro	b7593aeadc	Removing the symlink from the private license file We had identified this problem before, but Dropbox tricked me into pushing it again into the repo.	2013-02-09 12:57:44 -05:00
Mark DePristo	ca76de0619	Move ProcessUtilsUnitTest to private	2013-02-09 12:34:45 -05:00
MauricioCarneiro	f5e52b72ea	Merge pull request #23 from broadinstitute/md_process_utils_unit_tests UnitTests for ProcessUtils	2013-02-09 09:27:31 -08:00
MauricioCarneiro	3ff10ab277	Merge pull request #24 from broadinstitute/md_ngsplatform_unittests Expand NGSPlatform to meet SAM 1.4 spec, with full unit tests	2013-02-09 09:27:03 -08:00
MauricioCarneiro	7dbdc4ea6a	Merge pull request #22 from broadinstitute/md_better_contig_comparer Generalize and fixup ContigComparator	2013-02-09 09:26:37 -08:00
Mark DePristo	b127fc6a1a	Expand NGSPlatform to meet SAM 1.4 spec, with full unit tests -- Added CAPILLARY and HELICOS platforms as required by spec 1.4 -- Added extensive unit tests to ensure NGSPlatform functions work as expected. -- Fixed some NPE bugs for reads that don't have RGs or PLs in their RG fields	2013-02-09 11:16:21 -05:00
Mark DePristo	fc3307a97f	UnitTests for ProcessUtils	2013-02-09 10:13:01 -05:00
Mark DePristo	7fb620dce7	Generalize and fixup ContigComparator -- Now uses a SAMSequenceDictionary to do the comparison of contigs (which is the right way to do it) -- Added unit tests	2013-02-09 09:52:13 -05:00
Mark DePristo	a3dc7dc5cb	Extend AWS timeout for uploads of the GATK run reports to 30 seconds	2013-02-08 17:37:36 -05:00
depristo	db5b5e3482	Merge pull request #21 from broadinstitute/mc_base_coverage_distribution_GSATDG-45 Mc base coverage distribution gsatdg 45	2013-02-08 09:45:14 -08:00
Mauricio Carneiro	d004bfbe6f	walker to calculate per base coverage distribution -- Base distribution optionally includes deletions -- Implemented an optional filtered coverage distribution option -- Integration tests added for every feature of the traversal This walker is specially fast for the task due to the ability to calculate uncovered bases without having to visit the loci. This capability should be made generic in the future for the advantage of DiagnoseTargets and DepthOfCoverage. GSATDG-45 #resolve	2013-02-07 16:33:05 -05:00
Mauricio Carneiro	5f49c95cc1	Added distance across contigs calculation to GenomeLocs -- distance across contigs is calculated given a sequence dictionary (from SAMFileHeader) -- unit test added GSATDG-45	2013-02-07 16:31:41 -05:00
depristo	cd4aec177a	Merge pull request #20 from broadinstitute/aw_reduceread_perf_1_GSA-761 Aw reduceread perf 1 gsa 761	2013-02-07 12:11:05 -08:00
depristo	8f9d317a52	Merge pull request #19 from broadinstitute/eb_add_alignment_utils_tests_GSA-735_GSA-736_GSA-737_GSA-738 Added contracts, docs, and tests for several methods in AlignmentUtils. ...	2013-02-07 12:10:35 -08:00
Eric Banks	9826192854	Added contracts, docs, and tests for several methods in AlignmentUtils. There are over 74K tests being run now for this class! * AlignmentUtils.getMismatchCount() * AlignmentUtils.calcAlignmentByteArrayOffset() * AlignmentUtils.readToAlignmentByteArray(). * AlignmentUtils.leftAlignIndel()	2013-02-07 13:04:24 -05:00
Alec Wysoker	e88bc753aa	Replace with map.containsKey followed by map.get with map.get followed by null check.	2013-02-07 11:58:41 -05:00
Alec Wysoker	72e496d6f3	Eliminate unnecessary zeroing out of primitive arrays immediately after new.	2013-02-07 11:57:43 -05:00
depristo	cc7731d61f	Merge pull request #18 from broadinstitute/eb_fix_RR_tests Fixing the failing RR integration tests.	2013-02-06 10:02:13 -08:00
Eric Banks	481982202d	Fixing the failing RR integration tests. * After consulting Tim/David/Mauricio we determined that the md5 changes were due to different encodings of binary arrays in samjdk * However, it made no functional difference to the results (confirmed by Eric) so we agreed to update md5s * Also, the header of one of the test bams was malformed but old picard jar didn't perform checks so it only started failing now * Fixed the bam	2013-02-06 12:40:56 -05:00
depristo	462da7da8f	Merge pull request #17 from broadinstitute/dr_variant_migration_cleanup Minor build.xml cleanup post-variant-migration	2013-02-06 08:32:14 -08:00
David Roazen	df142a389f	Minor build.xml cleanup post-variant-migration -Stop emitting our own (now empty) variant jar -Correct BaseUtils package for the na12878kb jar	2013-02-06 11:16:52 -05:00
eitanbanks	bd0349e570	Merge pull request #12 from broadinstitute/md_exact_fast_path_GSA-726 Fast path for biallelic variants in IndependentAllelesDiploidExactAFCalc	2013-02-06 07:43:02 -08:00
Mark DePristo	59df329776	Fast path for biallelic variants in IndependentAllelesDiploidExactAFCalc -- If the VariantContext is a bi-allelic variant already, don't split up the VC (it doesn't do anything) and then combine it back together. This saves us a lot of work on average -- Be more protective of calls to AFCalc with a VariantContext that might only have ref allele, throwing an exception	2013-02-06 10:34:09 -05:00
eitanbanks	584899329c	Merge pull request #13 from broadinstitute/dr_variant_migration_GSA-692 Replace org.broadinstitute.variant with jar built from the Picard repo	2013-02-06 07:22:30 -08:00
depristo	bee127482b	Merge pull request #16 from broadinstitute/eb_bqsr_fails_on_RR Added check that BaseRecalibrator is not being run on a reduced bam.	2013-02-06 07:20:42 -08:00
Eric Banks	562f2406d7	Added check that BaseRecalibrator is not being run on a reduced bam. - Throws user exception if it is. - Can be turned off with --allow_bqsr_on_reduced_bams_despite_repeated_warnings argument. - Added test to check this is working. - Added docs to BQSRReadTransformer explaining why this check is not performed on PrintReads end. - Added small bug fix to GenomeAnalysisEngine that I uncovered in this process. - Added comment about not changing the program record name, as per reviewer comments. - Removed unused variable.	2013-02-06 10:14:27 -05:00
depristo	c677aa327c	Merge pull request #15 from broadinstitute/eb_hapcaller_dbsnp_fix Bug fix for NPE in HC with --dbsnp argument.	2013-02-06 04:29:23 -08:00
Eric Banks	4e5ff3d6f1	Bug fix for NPE in HC with --dbsnp argument. - I had added the framework in the VA engine but should not have hooked it up to the HC yet since the RefMetaDataTracker is always null. - Added contracts and docs to the relevant methods in the VA engine so that this doesn't happen in the future.	2013-02-05 21:59:19 -05:00
Eric Banks	e7c35a907f	Fixes to BQSR for the --maximum_cycle_value argument. - It's now written into the recal report so that it can be used in the PrintReads step. - Note that we also now write the --deletions_default_quality value which accidentally wasn't being written before! - Added tests to make sure that the value of the --maximum_cycle_value is being used properly by PR with -BQSR. (This is my last non-branch commit; all future pushes will follow new GATK practices)	2013-02-05 17:38:03 -05:00
David Roazen	e7e76ed76e	Replace org.broadinstitute.variant with jar built from the Picard repo The migration of org.broadinstitute.variant into the Picard repo is complete. This commit deletes the org.broadinstitute.variant sources from our repo and replaces it with a jar built from a checkout of the latest Picard-public svn revision.	2013-02-05 17:24:25 -05:00
Ryan Poplin	cb2dd470b6	Moving the random number generator over to using GenomeAnalysisEngine.getRandomGenerator in the logless versus exact pair hmm unit test. We don't believe this will fix the problem with the non-deterministic test failures but it will give us more information the next time it fails.	2013-02-05 12:56:20 -05:00
Mauricio Carneiro	f6bc5be6b4	Fixing license on Yossi's file Somebody needs to set up the license hook ;-)	2013-02-05 11:14:43 -05:00
MauricioCarneiro	050c4794a5	Merge pull request #11 from yfarjoun/per_sample2 -Added Per-Sample Contamination Removal to UnifiedGenotyper: Added an @A...	2013-02-05 08:04:29 -08:00
Eric Banks	00c98ff0cf	Need to reset the static counter before tests are run or else we won't be deterministic. Also need to give credit where credit is due: David was right that this was not a non-deterministic Bamboo failure...	2013-02-05 10:41:46 -05:00
Eric Banks	23c6aee236	Added in some basic unit tests for polyploid consensus creation in RR. - Uncovered small bug in the fix that I added yesterday, which is now fixed properly. - Uncovered massive general bug: polyploid consensus is totally busted for deletions (because of call to read.getReadBases()[readPos]). - Need to consult Mauricio on what to do here (are we supporting het compression for deletions? (Insertions are definitely not supported)	2013-02-05 10:35:45 -05:00
Yossi Farjoun	de03f17be4	-Added Per-Sample Contamination Removal to UnifiedGenotyper: Added an @Advanced option to the StandardCallerArgumentCollection, a file which should contain two columns, Sample (String) and Fraction (Double) that form the Sample-Fraction map for the per-sample AlleleBiasedDownsampling. -Integration tests to UnifiedGenotyper (Using artificially contaminated BAMs created from a mixure of two broadly concented samples) were added -includes throwing an exception in HC if called using per-sample contamination file (not implemented); tested in a new integration test. -(Note: HaplotypeCaller already has "Flat" contamination--using the same fraction for all samples--what it doesn't have is _per-sample_ AlleleBiasedDownsampling, which is what has been added here to the UnifiedGenotyper. -New class: DefaultHashMap (a Defaulting HashMap...) and new function: loadContaminationFile (which reads a Sample-Fraction file and returns a map). -Unit tests to the new class and function are provided. -Added tests to see that malformed contamination files are found and that spaces and tabs are now read properly. -Merged the integration tests that pertain to biased downsampling, whether HaplotypeCaller or unifiedGenotyper, into a new IntegrationTest class.	2013-02-04 18:24:36 -05:00
Eric Banks	70f3997a38	More RR tests and fixes. * Fixed implementation of polyploid (het) compression in RR. * The test for a usable site was all wrong. Worked out details with Mauricio to get it right. * Added comprehensive unit tests in HeaderElement class to make sure this is done right. * Still need to add tests for the actual polyploid compression. * No longer allow non-diploid het compression; I don't want to test/handle it, do you? * Added nearly full coverage of tests for the BaseCounts class.	2013-02-04 15:55:15 -05:00
Mark DePristo	a281fa6548	Resolves Genome Sequence Analysis GSA-750 Don't print an endless series of starting messages from the ProgressMeter -- The progress meter isn't started until the GATK actually calls execute on the microscheduler. Now we get a message saying "Creating shard strategy" while this (expensive) operation runs	2013-02-04 15:47:30 -05:00
Tad Jordan	eb847fa102	Message "script failed" moved to the correct place in the code GSA-719 fixed	2013-02-04 15:37:23 -05:00
Ryan Poplin	79ef41e7b1	Added some docs, unit test, and contracts to SimpleDeBruijnAssembler. -- Testing that cycles in the reference graph fail graph construction appropriately. -- Minor bug fix in assembly with reduced reads. Added some docs and contracts to SimpleDeBruijnAssembler Added a unit test to SimpleDeBruijnAssembler	2013-02-04 15:17:22 -05:00
Geraldine Van der Auwera	43e3a040b6	Updated UnifiedGenotyper GATKDoc (note on ploidy model)	2013-02-04 14:18:56 -05:00
Chris Hartl	41a030f4b7	Apparently I'm a failure at rebasing...there should have been only one commit message to write. But whatever, here it is again: Part 1 of Variant Annotator Unit tests: PerReadAlleleLikelihoodMap - Added contract enforcement for public methods - Refactored the conversion from read -> (allele -> likelihood) to allele -> list[read] into its own method - added method documentation for non getters/setters - finals, finals everywhere - Add in a unit test for the PerReadAlleleLikelihoodMap. Complete coverage except for .clear() and a method that is a straight call into a separately-tested utility class.	2013-02-04 14:16:28 -05:00
Chris Hartl	3c99010be4	Part 1 of Variant Annotator Unit tests: PerReadAlleleLikelihoodMap - Added contract enforcement for public methods - Refactored the conversion from read -> (allele -> likelihood) to allele -> list[read] into its own method - added method documentation for non getters/setters - finals, finals everywhere - Add in a unit test for the PerReadAlleleLikelihoodMap. Complete coverage except for .clear() and a method that is a straight call into a separately-tested utility class.	2013-02-04 14:16:06 -05:00
Ryan Poplin	d9fd89ecaa	Somehow these md5 updates got lost in my previous git rebase disaster. Sorry for the trouble.	2013-02-04 13:26:18 -05:00
Eric Banks	2d518f3063	More RR-related updates and tests. - ReduceReads by default now sets up-front ReadWalker downsampling to 40x per start position. - This is the value I used in my tests with Picard to show that memory issues pretty much disappeared. - This should hopefully take care of the memory issues being reported on the forum. - Added javadocs to SlidingWindow (the main RR class) to follow GATK conventions. - Added more unit tests to increase coverage of BaseCounts class. - Added more unit tests to test I/D operators in the SlidingWindow class.	2013-02-04 12:57:43 -05:00
Guillermo del Angel	971ded341b	Swap java Random generator for GATK one to ensure test determinism	2013-02-04 10:57:34 -05:00
Guillermo del Angel	5521bf3dd7	Fix bad contract implementation	2013-02-03 16:15:14 -05:00
Guillermo del Angel	f31bf37a6f	First step in better BQSR unit tests for covariates (not done yet): more test coverage in basic covariates, test logging several read groups/read lengths and more combinations simultaneously. Add basic Javadocs headers for PerReadAlleleLikehoodMap.	2013-02-03 15:31:30 -05:00

1 2 3 4 5 ...

11837 Commits (2d802e17a476cef1b13989c5f75e66ccb40d688e) All Branches Search

11837 Commits (2d802e17a476cef1b13989c5f75e66ccb40d688e)

All Branches