Commit Graph

4585 Commits (ed7ff65b2ecbc1ccd7f02cf066ea3caa00cc6775)

Author SHA1 Message Date
Geraldine Van der Auwera 118c559278 Trivial doc typo fix 2015-09-25 18:15:29 -04:00
Ami Levy Moonshine 1ad00cc9d4 fix typo in the ASEReadCounter document 2015-09-21 15:30:06 -04:00
meganshand cdfe0d7b7c Adding PER_TARGET_COVERAGE option
Comments addressed
2015-09-18 09:34:51 -04:00
Ron Levine 3ecabf7e45 Allow overriding ValidateVariants' hard-coded cutoff for allele length 2015-09-17 10:49:14 -04:00
ldgauthier 5870225f83 Merge pull request #1153 from broadinstitute/ms_excess_het
Excess Het P-value
2015-09-15 11:52:25 -04:00
Khalid Shakir 24e24b9468 Using `SamIndexes.asBaiSeekableStreamOrNull()` to support `.cram.crai`.
Updated other IntelliJ IDEA warnings in GATKBAMIndex.
Updated example .cram files to match versions generated by current GATK/HTSJDK.
Bumped HTSJDK and Picard to 1.139 releases.
Added support for using `-SNAPSHOT` of HTSJDK in the future.
2015-09-14 12:20:36 -04:00
meganshand d767e1722e Excess Het P-value
Added input exception

Added header line

Updated MD5s

Changing more MD5s

Made edge case clearer

Fixed formatting

Changed mid-point to mode
2015-09-14 12:00:44 -04:00
Laura Gauthier 53b506a0b8 Make sure inputPriors get used if they are specified
Fix usage of AF prior (i.e. theta) in probability of non-reference calculation
Refactored duplicate functions
Updated docs for heterozygosity
2015-09-10 10:08:03 -04:00
Ron Levine 83a7012d69 Mask snps with --snpmask 2015-09-09 16:20:48 -04:00
Eric Banks 5f76ae6a37 Don't have the Indel Realigner change IUPAC reference bases.
This change doesn't affect the performance of the Indel Realigner at all (as per tests).
This is just a request from the Picard side (where further testing is happening).
2015-09-04 13:42:23 -04:00
Ron Levine 29ac64f6ce Calculate GenotypeAnnotations before InfoFieldAnnotations 2015-09-03 09:22:46 -04:00
Laura Gauthier a86f3909ca Update md5s for BAM header version change in Queue test output 2015-08-28 14:19:25 -04:00
Laura Gauthier 3dc68732fb Little changes to M2 code and docs
Make MQ threshold a parameter (compare to M1 by setting to zero)
Add logic for multiple alternate alleles in tumor
Exclude MQ0 normal reads from normal LOD calculation
Fix path errors in Dream_Evaluations.md
Move M2 eval scripts out of walkers package so they run
2015-08-27 15:31:27 -04:00
Mark Fleharty daeb55429e Adding Static Binning to BQSR 2015-08-24 13:36:17 -04:00
Ron Levine 2afe3f7a21 Make GenotypeGVCFs subset Strand Allele Counts intelligently 2015-08-22 08:33:09 -04:00
Bertrand Haas 158477ea6c Re-ran the updateAllLicenses.sh script 2015-08-21 11:32:51 -04:00
Ron Levine 900fe3f675 Merge pull request #1132 from broadinstitute/rhl_rev_htsjdk
Move htsjdk & picard to rev 1.138
2015-08-20 11:58:41 -04:00
Bertrand Haas eae4c875a9 Logistic transform of MQ + jitter to capped MQ in VariantDataManager 2015-08-20 11:10:45 -04:00
Ron Levine beec624a63 Move htsjdk & picard to rev 1.138 2015-08-20 10:42:25 -04:00
meganshand 5c9935ba10 Adding CollectWgsMetrics wrapper for queue
Fix license

Fixed IncludeBQHistogram
2015-08-14 10:18:12 -04:00
Yossi Farjoun 69fd4af15a Merge pull request #1111 from jsilter/overclippedreadfilter_endsoption
Add additional option to OverclippedReadFilter
2015-08-12 10:43:52 -04:00
Jacob Silterra 62625b4bc6 Add option to not require soft-clips on both ends
Previous version of OverclippedReadFilter would only filter a read if both ends of a read had a soft-clipped block.
This adds a boolean option to relax that requirement, and only require 1 soft-clipped block, while also filtering on read length - softclipped length
2015-08-12 10:38:27 -04:00
Khalid Shakir 9bee183f6c Switched to using CRAM's SamReader.Indexing implementation.
CRAM now requires .bai index, just like BAM.
Test updates:
- Updated existing MD5s, as TLEN has changed.
- Tests multiple contigs.
- Tests several intervals per contig.
- Tests when `.cram.bai` is missing, even when `.cram.crai` is present.
Updated gatk docs for CRAM support, including:
- Arguments that work for both BAM and CRAM listed as such.
- Arguments that don't work for CRAM either explicitly say "BAM" or "doesn't work for CRAM".
- Instructions on how to recreate a `.cram.bai` using cramtools.
Cleaned up IntelliJ IDEA warnings regarding `Arrays.asList()` -> `Collections.singletonList()`.
2015-08-11 17:52:49 -03:00
Geraldine Van der Auwera 19bbe45cbc Updated licenses for 2015 2015-08-06 15:23:11 -04:00
David Benjamin ddb01058d3 moved DiffObjects 2015-08-05 21:19:02 -04:00
Geraldine Van der Auwera 875c7ffa1a Fixed typos and made some argument docs improvements 2015-07-29 23:06:19 -04:00
Louis Bergelson 9d9827f176 Merge pull request #1031 from broadinstitute/lb_update_for_java8
Updated gatk so it compiles with java 8
2015-07-28 11:09:19 -04:00
vruano 8f6daf70db Refactoring of ReferenceConfidenceModel likelihood calculation in non variant sites
Changed a division by -10.0 to a multiplication by -.1 in QualUtils (typically multiplication is faster than division).

Addresses performance issue #1081.
2015-07-26 08:33:46 -04:00
David Roazen 5fd3d2be76 Move swapExt() methods to QScriptUtils, have versions in QScript class call into the util versions 2015-07-23 10:23:55 -04:00
Valentin Ruano Rubio 66cf22b28f Merge pull request #1069 from broadinstitute/vrr_ad_genotype_gvcfs_bugfix
Fix AD propagation when subsetting alleles in non-diploid GenotypeGVCF.
2015-07-22 18:53:43 -04:00
vruano 315e193e51 Fix AD propagation when subsetting alleles in non-diploid GenotypeGVCF.
Addresses issue #913.

Also remove some commented out code and toxic debugging code that uses System.out/err.println.
2015-07-22 17:08:13 -04:00
Joseph White 3bd988825f Removed walkers for handling Beagle data
Added deprecation statements to DeprecatedToolChecks.java
    Removed integration test for Beagle walker
    Added URL for Beagle documentation
2015-07-21 18:36:08 -04:00
Eric Banks 178bf12b27 Merge pull request #1046 from broadinstitute/rhl_catvariants_sort
Fix for mis-sorted VCF files in CatVariants
2015-07-21 17:37:27 -04:00
Valentin Ruano Rubio 9360e1d293 Merge pull request #1059 from broadinstitute/vrr_true_false_list_removal
More efficient implementation of the indel read qualities recalculati…
2015-07-21 17:13:45 -04:00
vruano 82f1236633 More efficient implementation of the indel read qualities recalculation for the PCR error model.
Addresses #1054.
2015-07-21 14:25:11 -04:00
Ron Levine 6e46b3696e Merge contiguous intervals properly 2015-07-14 15:23:37 -04:00
John Wallace 8fc631b7ae Fix for mis-sorted VCF files in CatVariants
When using CatVariants, VCF files were being sorted solely on the base
pair position of the first record, ignoring the chromosome.  This can
become problematic when merging files from different chromosomes,
espeically if you have multiple VCFs per chromosome.

As an example, assume the following 3 lines are all in separate files:
1       10
1       100
2       20

The merged VCF from CatVariants (without -assumeSorted) would read:
1       10
2       20
1       100

This has the potential to break tools that expect chromosomes to be
contiguous within a VCF file.

This commit changes the comparator from one of Pair<Integer, File> to
one of Pair<VariantContext, File>.  We construct a
VariantContextComparator from the provided reference, which will sort
the first record by chromosome and position properly.  Additionally, if
-assumeSorted is given, we simply use a null VariantContext as the first
record, which will all be equal (as all will be null)
2015-07-14 14:12:31 -04:00
Louis Bergelson e1c41b2c38 Updated gatk so it compiles on java 8
updated cofoja to 1.2 from 1.0
added explicit type casts in places that java 8 required them
2015-06-26 15:59:46 -04:00
Ron Levine 09686f4595 Make VQSLOD definition accurate 2015-06-25 16:47:50 -04:00
Geraldine Van der Auwera 719bb15340 Merge pull request #1019 from broadinstitute/rhl_var_index_param_gz
Indexing parameters not required if output file has the g.vcf.gz exte…
2015-06-17 14:30:20 -04:00
Geraldine Van der Auwera 697c4b0cf1 Added else clause to handle symbolic alleles
Add test for createAlleleMapping
2015-06-17 10:52:56 -04:00
Laura Gauthier ce5ecf1383 Enable contamination correction via downsampling (as for HaplotypeCaller), added test
Add oxoG read count annotation and add as default annotation
Add ##SAMPLE VCF header line in accordance with TCGA VCF spec, specifying "File" line in sample header with BAM file name and "SampleName" with BAM sample name (Don't print sample file path if --no_cmdline_in_header is specified to help with test consistency)
Turn on active region assembly-based physical phasing for M2
Clean up M2-related annotations so UG doesn't crash if M2 annotations are called
2015-06-15 07:59:15 -04:00
Ron Levine b35085ca28 Indexing parameters not required if output file has the g.vcf.gz extensionv 2015-06-13 11:46:56 -04:00
Ron Levine dbed660183 Add spannning deletions allele 2015-06-12 16:43:06 -04:00
Joseph White 398dc7a123 Changed error message for Contigs Out of Order
Changed confusing error message for out of order contigs

Updated Exception message.
2015-06-11 21:46:06 -04:00
Geraldine Van der Auwera 2a7f95eddb Merge pull request #1009 from broadinstitute/gg_patch_depthofcoverage_#1002
User (mnw21cam) patch to fix DoC slowdown in 3.4
2015-06-10 11:16:08 -04:00
droazen 5e3f3d69db Merge pull request #1012 from broadinstitute/rhl_build_vec_pairhmm_lib
Built VectorLoglessPairHMM lib with icc with gcc 4.4.7
2015-06-08 15:25:57 -04:00
Geraldine Van der Auwera 95f2899f05 User (mnw21cam) patch to fix DoC slowdown in 3.4 2015-06-05 21:12:46 -04:00
Louis Bergelson ebdda72c88 fix typo in queue arguments 2015-06-05 17:06:23 -04:00
Ron Levine 40d8fb99a3 Built VectorLoglessPairHMM lib with icc with gcc 4.4.7 2015-06-05 15:38:25 -04:00