Commit Graph

1528 Commits (2d06f332276fdb25e8ca59c8d660d3118d2a09a4)

Author SHA1 Message Date
Geraldine Van der Auwera bd2626bea2 Add colt > cern.jet.normal dependency 2016-06-01 13:24:50 -04:00
Ron Levine 30665c7dbc Move Move htsjdk and picard to version 2.4.1 2016-05-31 22:36:38 -04:00
Geraldine Van der Auwera a76cb052e2 Ability to retry building VQSR model (contributed by mdp) 2016-05-31 18:57:55 -04:00
Geraldine Van der Auwera d87345cd1d GATKDocs overhaul
- Fixed displaying of default values
    - Removed code cruft
    - Reorganized tooldoc categories and improved names
    - Reorganized tools within categories where applicable
    - Touched up various tool docs
    - Switched default gatkdocs output to html
    - Added parameter in agrregator pom to control output type
    - Set gatkdocs publishing script to output php
    - Deprecated GenotypeAndValidate walker
    - Added back PhoneHome arguments with @Deprecated annotations
2016-05-29 16:35:08 -04:00
Geraldine Van der Auwera efbbbb1bd9 Add M2 to the HC annotations check 2016-05-27 13:49:31 -04:00
Geraldine Van der Auwera c4a06ad20a Move indel realignment to public 2016-05-27 12:39:58 -04:00
Valentin Ruano Rubio 9d32dec9cd Fix for the sum(AD) > DP bug.
Closes issue #1340
2016-05-26 15:04:52 -04:00
Yossi Farjoun 25fa25b618 Added option to validate gvcf (for ValidateVariants) (#1379)
* with option --gvcf CLP will now put extra checks that a gvcf must adhere to (existance of <NON_REF> allele at every variant, and that the variants in total cover the entire requested intervals, or the whole genome if no intervals have been specified)
* works on gvcf produced by HC when using either GVCF or BP_RESOLUTION mode
* added positive and negative tests
2016-05-26 06:42:45 -04:00
Steve Huang e1fadae139 Fix error in InfiniteRandomMatingPopulationModel.getLikelihoodsCalculator
Same issue noticed in GATK4 [here](https://github.com/broadinstitute/gatk/issues/1856)
2016-05-25 17:23:26 -04:00
Geraldine Van der Auwera 2c8356519c Merge pull request #1375 from broadinstitute/gvda_fix_offbyone_maxAltAlls_M2_#1297
Fixed M2 max alt alleles threshold evaluation error
2016-05-19 13:59:02 -04:00
samuelklee 7fdd3c2a0c Merge pull request #1358 from broadinstitute/sl_issue_1327
Changed calls with GQ=0 to no-call for HaplotypeCaller in normal mode.
2016-05-19 12:26:35 -04:00
Laura Gauthier 644076b1e1 Add fix and test for finalizing MQ annotation at BP resolution for variant and ref samples
Addresses issue #1356
2016-05-19 08:15:30 -04:00
Geraldine Van der Auwera f5456a3761 Fixed M2 max alt alleles threshold evaluation error
Also clarified some argument docs
2016-05-18 21:54:30 -04:00
Samuel Lee bf4b1a5421 Changed calls for GQ=0 from 0/0 to ./. for HaplotypeCaller in normal mode. 2016-05-18 13:17:27 -04:00
Ron Levine 35a06879f1 Move htsjdk and picard to version 2.3.0 2016-05-16 14:50:00 -04:00
David Benjamin 8623830267 Fixed bug in which consecutive SPAN_DELS were merged into a ** MNP.# 2016-05-06 01:36:14 -04:00
samuelklee aa0c76a166 Merge pull request #1326 from broadinstitute/sl_issue_1293
Added maxNumPLValues argument to allow users to set maximum number of PL values in output.
2016-04-28 10:52:27 -04:00
David Benjamin aecaa6d38e Allow GenotypeGVCFs to emit ref sites. 2016-04-27 15:53:44 -04:00
Geraldine Van der Auwera 14fe8b1e0e Moved BQSRGatherer and dependencies to the public module 2016-04-27 07:15:28 -04:00
Samuel Lee e08940a5a8 Added maxNumPLValues argument to allow users to set maximum number of PL values in output. 2016-04-26 23:30:25 -04:00
Ron Levine f337b45724 Move htsjdk and picard to version 2.0.0
Conflicts:
	protected/gatk-tools-protected/src/test/java/org/broadinstitute/gatk/tools/walkers/genotyper/UnifiedGenotyperGeneralPloidySuite1IntegrationTest.java
	protected/gatk-tools-protected/src/test/java/org/broadinstitute/gatk/tools/walkers/genotyper/UnifiedGenotyperGeneralPloidySuite2IntegrationTest.java
	protected/gatk-tools-protected/src/test/java/org/broadinstitute/gatk/tools/walkers/genotyper/UnifiedGenotyperIndelCallingIntegrationTest.java
	protected/gatk-tools-protected/src/test/java/org/broadinstitute/gatk/tools/walkers/haplotypecaller/HaplotypeCallerIntegrationTest.java
2016-04-25 14:51:25 -04:00
meganshand 509400495b Changes edge case calculation for RankSumTest #1341 2016-04-22 14:41:05 -04:00
David Benjamin c040da427d Replace string literals for annotation groups. Closes #1216. 2016-04-19 15:54:44 -04:00
Ron Levine e2828104b1 SelectVariants and VariantFiltration not updating AC, AN and AF for --setFilteredGtToNocall 2016-04-17 10:24:05 -04:00
Ron Levine 0eba8822e2 Change HashMap to LinkedHashMap for predictable iteration 2016-04-10 20:10:38 -04:00
Laura Gauthier d573fc4adf Add some comments to AFCalculationResult
Add note that VC may be null (in this case because there are too many alts)
Add todo for possible inefficient code
2016-04-04 09:35:54 -04:00
Geraldine Van der Auwera 68b068d2b3 Merge pull request #1329 from broadinstitute/cn_murmur_hash
Use older version of murmur hash implementation included in gatk…
2016-03-31 13:00:43 -04:00
Ron Levine edc1b20132 Output a summary of WARN messages 2016-03-29 11:39:18 -04:00
Chris Norman 017828e4b7 Use the older murmur hash implementation that is included in the gatk jar. 2016-03-29 10:20:44 -04:00
Chris Norman c81795cf50 Fix license in DroppedReadsTracker. 2016-03-22 13:39:13 -04:00
Chris Norman a8ebf21ac3 Implementation of dropped read tracking. 2016-03-21 18:36:29 -04:00
Ron Levine b873467756 Merge pull request #1321 from broadinstitute/rhl_fix_hr_logger_name
Change logger class to HomopolymerRun
2016-03-16 11:44:40 -04:00
Ron Levine 5c0e97def4 Change logger class to HomopolymerRun 2016-03-16 10:01:21 -04:00
Laura Gauthier 9ffdfeccd5 Add test case for fix 2016-03-15 11:06:48 -04:00
Laura Gauthier 31fc64f82c To address weird case with all hom-refs, but alt allele is present, skip AS_QD for all-ref sites and remove raw annotations whether or not they can be successfully finalized 2016-03-15 09:34:47 -04:00
Ron Levine 6d7ec7377c Set --reference_window_stop if homopolymer is greater than window size 2016-03-14 18:09:41 -04:00
Geraldine Van der Auwera c81d5b898e Clarify VQSR inputs documentation 2016-03-11 16:38:33 -05:00
Geraldine Van der Auwera 4990ed706a Fixup for licensing update 2016-03-11 16:23:02 -05:00
Takuto Sato 6308b0f036 Add MuTect2 Tumor-only test. 2016-03-11 09:10:43 -05:00
ldgauthier d0432713e0 Merge pull request #1311 from broadinstitute/ldg_BetaTestAnnotationsGroup
Add classes from "annotation party" to BetaTesting group
2016-03-11 08:13:32 -05:00
Geraldine Van der Auwera 16ef36088e Merge pull request #1308 from broadinstitute/gvda_fix_license_quotes_#1307
Update licenses
2016-03-10 12:37:53 -05:00
Laura Gauthier 28cfb06513 Add AS_culprit and AS_VQSLOD to VCF header in ApplyRecalibration so output passes VCF validation 2016-03-09 08:22:58 -05:00
Laura Gauthier d9f9bd1d56 Add classes from "annotation party" to BetaTesting group 2016-03-09 08:17:44 -05:00
Geraldine Van der Auwera 9a306ca221 Update licenses 2016-03-05 01:09:43 -08:00
Geraldine Van der Auwera 2b70f14740 Misc documentation improvements
Added caveat to VariantFiltration documentation
  Fixed PON creation example in M2 doc
  Improved MalformedReadFilter doc
  Updated N CIGAR error message
2016-03-03 15:48:54 -08:00
Ron Levine e5c5804141 Modify MD5s to correct RankSum annotations 2016-03-01 14:05:51 -05:00
Ron Levine 625941dc50 Merge pull request #1290 from broadinstitute/rhl_sac_nonref
StrandAlleleCountsBySample selects most likely only from VCF alleles
2016-02-29 17:05:57 -05:00
Ron Levine 5e2ffc188b Merge pull request #1295 from broadinstitute/rhl_sv_error_output_1194
Correct error messages and error handling in multiple tools
2016-02-29 17:05:24 -05:00
Ron Levine 40a5adf767 Change error output to use the correct argument 2016-02-29 13:21:03 -05:00
Ron Levine 80a22aad77 StrandAlleleCountsBySample selects most likely only from VCF alleles 2016-02-29 12:41:55 -05:00
meganshand c7e0f5b225 Removes Dithering from Rank Sum Test
Fixing empty group case

Fixing MD5s

First comments addressed

Added permutation test

Adding new RankSum to AS_RankSum

Speeding up permutation algorithm and updating MD5s

Missed a few tests

Addressing comments

Changing md5s
2016-02-29 11:45:27 -05:00
Takuto Sato 243a0fcb74 Allele-specific insert size ranksum annotation 2016-01-28 16:03:57 -05:00
Laura Gauthier 5592e4ead0 Add new -AS mode to run VQSR (both VariantRecalibrator and ApplyRecalibration) in an allele-specific manner 2016-01-22 13:18:21 -05:00
Ron Levine ed933013fe Remove variant contig order check 2016-01-16 19:32:28 -05:00
Eric Banks c57c32b915 Merge pull request #1270 from broadinstitute/eb_small_genotyping_optimizations
Small optimizations to the joint calling code.
2016-01-13 08:46:57 -05:00
Eric Banks ab2f541d1f Small optimizations to the joint calling code.
Thanks to profiling I noticed that the determineCoefficient() method was being called too often.
Because it returns a constant result in half of the invocations, its value should be cached when possible.
Also, the various calls to getLog10Likelihoods() showed up in the profiler, so I pulled those out too.

All told, it speeds up the genotyping by about 10 percent according to the profiler.
2016-01-12 21:48:20 -05:00
Laura Gauthier 593c9ddf01 Allow VariantsToTable to evaluate the type of each split variant when -F TYPE and -SMA are specified 2016-01-12 08:12:29 -05:00
Laura Gauthier 204cad3646 Remove "mem_free" from resident memory request params for Queue because it doesn't work and wouldn't actually reserve memory anyway 2016-01-08 10:27:56 -05:00
Ron Levine d16ed98c9e Backport maxNoCall functionality from GATK4 2016-01-06 11:09:38 -05:00
ldgauthier 1d72ab099c Merge pull request #1247 from broadinstitute/ldg_VQSRmodelOutput
Add optional argument for VQSR to output the model to a file as a GAT…
2016-01-05 08:35:18 -05:00
Ron Levine fa1d90d236 Merge consecutive SNPs on the same read 2016-01-04 13:48:59 -05:00
Laura Gauthier 2ddc48914e Add optional argument for VQSR to output the model to a file as a GATKReport
GATKReport output also has mean and variance for annotation normalization info
2016-01-04 08:37:08 -05:00
ldgauthier 71c6709765 Merge pull request #1145 from broadinstitute/ldg_M2_HapMapSensitivity
Fix no-normal bug; add HapMap sensitivity benchmarking
2016-01-04 08:27:37 -05:00
Ron Levine aa5e88a393 Fix exception when writing gVCF to stdout 2015-12-29 15:30:53 -05:00
meganshand eb6bdb2a62 MQ of Mate RankSum annotation
Intermediate commit for tests

Adding tests

Fixing tests after rebase

Fixing one MD5

Fixing documentation

Removing annotation from standard group

Adding documentation
2015-12-23 10:24:40 -05:00
Laura Gauthier f9e9d2e273 Fix no-normal bug; add HapMap sensitivity benchmarking 2015-12-22 08:29:01 -05:00
Ron Levine 9c8f035780 LeftAlignAndTrimVariants --splitMultiallelics keeps GT if valid 2015-12-14 10:42:32 -05:00
Geraldine Van der Auwera 4767a83d8a Update pom versions to mark the start of GATK 3.6 development 2015-11-25 01:52:51 -05:00
Geraldine Van der Auwera bf875974d1 Prep MuTect2 and ContEst for release
Renamed M2 to MuTect2
    Renamed ContaminationWalker to ContEst
    Refactored related tests and usages (including in Queue scripts)
    Moved M2 and ContEst + accompanying classes from private to protected
    Made QSS a StandardSomaticAnnotation (new annotation group/interface) to prevent it from being sucked in with the rest of the StandardAnnotation group
2015-11-24 16:43:20 -05:00
Mark Fleharty 5a3756410c Merge pull request #1231 from broadinstitute/mf_fixBQSRIntegrationTest
Fixes testPRWithConflictingArguments_qqAndSQQ to use -ql rather than -q1
2015-11-23 17:08:30 -05:00
Geraldine Van der Auwera b0730c2b81 Merge pull request #1239 from broadinstitute/gvda_straggler_doc_fixes_1237
Improve doc block of GatherBqsrReports
Annotation doc enhancements (QD, InbreedingCoeff, ExcessHet and AS versions where applicable)
2015-11-22 13:58:20 -05:00
Geraldine Van der Auwera a7748368f8 Yet more doc improvements prior to 3.5 release
Improve doc block of GatherBqsrReports
    Annotation doc enhancements (QD, InbreedingCoeff, ExcessHet and AS versions where applicable)
2015-11-22 10:59:24 -05:00
Geraldine Van der Auwera 46ba0e519e Restore FindCoveredIntervals + add docs 2015-11-22 10:19:04 -05:00
Geraldine Van der Auwera 22fa1511be Merge pull request #1235 from broadinstitute/gvda_deprecate_useless_tools_1192
Deprecate tools that were outdated or redundant
2015-11-21 14:58:00 -05:00
Geraldine Van der Auwera 1cf66addaa Deprecate tools that were outdated or redundant
ReadAdaptorTrimmer (unsound and untested)
BaseCoverageDistribution (redundant with DiagnoseTargets)
CoveredByNSamplesSites (redundant with DiagnoseTargets)
FindCoveredIntervals (redundant with DiagnoseTargets)
VariantValidationAssessor (has a scary TODO -- REWRITE THIS TO WORK WITH VARIANT CONTEXT comment and zero tests)
LiftOverVariants, FilterLiftedVariants and liftOverVCF.pl (in #1106) (use Picard liftover tool)
sortByRef.pl (use Picard SortVCF)
ListAnnotations (useless)

Also deleted the java archive from the private repository (old junk we never use)
2015-11-20 22:49:40 -05:00
meganshand 2570cab24c Assorted documentation fixes, enhancements and reorganization.
See issues referenced by the pull request for details.
2015-11-20 22:44:46 -05:00
Mark Fleharty 1443ee8c7f Fixes testPRWithConflictingArguments_qqAndSQQ to use -ql rather than -q1 2015-11-20 11:23:02 -05:00
Ron Levine ccaddefa19 Validate VCF with sequence dictionary 2015-11-20 09:23:24 -05:00
Yossi Farjoun 4da0d1300c adding fraction informative reads annotation. 2015-11-18 08:39:47 -05:00
David Roazen 9d5be24778 Move GatherBqsrReports from private to protected 2015-11-10 17:40:58 -05:00
Laura Gauthier 25b8ba45f4 More allele-specific annotations: AS_QD and AS_InbreedingCoeff
Grouped default output annotations to keep them from getting dropped when -A is specified; addresses #918
Also refactored code shared by ExcessHet and InbreedingCoeff
2015-11-09 16:38:31 -05:00
vruano e3d5d96076 Added the AF indepdent calculator for any ploidy but seems that is not doing a good job for haploid
Addresses issue #1078 by implementing a any-ploidy version of the independent-allele-exact-ac-calculator already available for diploids.

Notice that this will change result somewhat when dealing with noisy data (low GQs).
2015-11-07 16:17:30 -05:00
Mark Fleharty 8857bc9b3f Resolves issue #1061 to use testid1 rather than testid in two integrationtests. 2015-11-05 22:20:20 -05:00
Eric Banks 975f8a8502 Merge pull request #1206 from broadinstitute/eb_suppress_alt_allele_warnings
Suppress emission of the scary warning message from genotyping to no …
2015-11-05 16:12:50 -05:00
Eric Banks 2cc7de4886 Suppress emission of the scary warning message from genotyping to no more than once in
anything but DEBUG logging mode.  Otherwise it fills up our output logs.
2015-11-05 14:19:21 -05:00
meganshand e4627ed5c3 Addressing comments 2015-11-04 11:00:01 -05:00
meganshand b5165b8d30 Fix for out of date VCF version output 2015-11-03 17:35:47 -05:00
ldgauthier 3d1dc303b3 Merge pull request #1197 from broadinstitute/ts_ve_nullPointer
Prevent null pointer exception in PrintMissingComp module
2015-11-02 14:42:50 -05:00
Takuto Sato 33462c7b50 Removed the line that caused a null pointer, as the information it logged was not useful. Updated docs and added an integration test to ensure the code no longer throws the exception. 2015-11-02 12:45:09 -05:00
Laura Gauthier f7eb5d3082 Enable family-level stratification (if a ped file is provided) 2015-10-28 09:55:04 -04:00
Laura Gauthier 68a2f1243d Finished draft of code for new map-combine-reduce annotation framework
All VQSR annotations can be generated in allele-specific mode
Pull out allele-specific annotations in AS_Standard annotation group
2015-10-27 09:44:49 -04:00
Laura Gauthier fcaf37279c Finished draft of code for new map-combine-reduce annotation framework
All VQSR annotations can be generated in allele-specific mode
Pull out allele-specific annotations in AS_Standard annotation group
2015-10-27 09:23:29 -04:00
Ron Levine 36ca9fe898 Allow LeftAlignAndTrimVariants to handle alleles longer than the default processing window 2015-10-25 20:33:56 -04:00
Ron Levine 795fe75886 Update doc for multiallelics, trimming is the default behavior 2015-10-22 04:04:09 -04:00
Takuto Sato df7a482335 VariantAnnotator now supports annotating FILTER field from an external resource.
Updated the docs.
2015-10-14 14:26:21 -04:00
Ron Levine 2bcded11cb VariantAnnotator checks alleles when annotationg with external resource 2015-10-08 17:01:30 -04:00
Eric Banks 622ec352bb Fix for combining records in which one has a spanning deletion and needs a padded reference allele.
This was erroring out and not working.
2015-10-02 16:28:16 -04:00
Kate Noblett 506958a0b7 Implemented a new VariantEval evaulation module, MetricsCollection. Fixed null pointer exception, updated tests. 2015-09-30 17:21:30 -04:00
Ron Levine 792142ec50 Implement BaseCounts per-sample 2015-09-30 08:59:11 -04:00
Samuel Lee c7f76b945e addressing PR comments 2015-09-24 15:42:51 -04:00