Commit Graph

13891 Commits (919c3eaa2e0c4038f3a09ef8440dadd814de4e31)

Author SHA1 Message Date
Geraldine Van der Auwera 919c3eaa2e Numerous doc fixes; mostly formatting and clarifications 2015-05-03 00:28:46 +02:00
droazen 9c2dd29bd0 Merge pull request #952 from broadinstitute/dr_queue_qsub_broad
Queue: add -qsub-broad argument
2015-04-27 20:24:50 -04:00
David Roazen 19ceca5e86 Queue: add -qsub-broad argument
When -qsub-broad is specified instead of -qsub, use the "h_vmem" parameter
instead of "h_rss" to specify memory limit requests.

Also cause the GridEngine native arguments to be output by default to the logger,
instead of only when in debug mode.
2015-04-27 17:43:25 -04:00
Geraldine Van der Auwera cb4c3ea09e Merge pull request #943 from broadinstitute/ldg_fixAllowNonUniqueKmers
Fix implementation of allowNonUniqueKmersInRef so that it applies to all...
2015-04-27 04:08:50 +02:00
Laura Gauthier 97caf94807 Fix implementation of allowNonUniqueKmersInRef so that it applies to all kmer sizes 2015-04-23 13:01:47 -04:00
Eric Banks d4753f189f Merge pull request #930 from broadinstitute/rhl_bad_edge_pileup
Bypass edge alignment reads while making pileup
2015-04-21 19:02:14 -04:00
Ron Levine d5f98e99f0 Bypass reads with a bad CIGAR length 2015-04-21 11:55:56 -04:00
Geraldine Van der Auwera bfcac455c9 Merge pull request #932 from broadinstitute/yf_fix_picard_md
Fix the scala wrapper for Picard MarkDuplicates
2015-04-16 12:08:39 -04:00
kshakir 8329fe46d1 Merge pull request #934 from broadinstitute/ks_cat_variants_extensions
CatVariants now allows different input / output file types.
2015-04-15 11:44:56 -03:00
Khalid Shakir 90b579c78e CatVariants now allows different input / output file types.
Escaping the CatVariantsIntegrationTest classpaths for possible spaces in the directory names.
2015-04-13 14:39:46 -03:00
Yossi Farjoun a7487e282a since Picard mark duplicates moved to a different package, this class was broken. here's the fix. it would be good to have tests for all the scala picard-wrappers, but that is out of scope for this commit. 2015-04-13 08:44:30 -04:00
kcibul 6adf91d49f Merge pull request #922 from broadinstitute/kc_refactor_acs
Refactoring of HC & M2 Argument Collections
2015-04-08 09:52:11 -04:00
Kristian Cibulskis 45610a142c initial refactoring of arguments into individual argument collections
fix blasted license blurbs

updates based on PR comments (abstractify HaplotypeCallerArgumentCollection into AssemblyBasedCallerArgumentCollection)

comments on comments from PR review
2015-04-07 16:55:32 -04:00
Geraldine Van der Auwera 2053afe52a Merge pull request #914 from broadinstitute/ldg_fixDitheringRandomness
Initialize annotations so that --disableDithering actually works
2015-04-06 15:40:30 -04:00
Geraldine Van der Auwera 7cc802f3d0 Merge pull request #915 from broadinstitute/yf_fix_missing_filename_in_error
added the missing file to the error message
2015-04-06 15:33:22 -04:00
Yossi Farjoun d30a6258bc added the missing file to the error message 2015-04-06 08:21:55 -04:00
Laura Gauthier 9c842df3a3 Initialize annotations so that --disableDithering actually works 2015-04-02 17:34:46 -04:00
droazen 761e456d07 Merge pull request #911 from broadinstitute/ab_vcf_cmd_line_header_909
Modify GATK command line header for unique keys
2015-04-02 16:52:42 -04:00
Alex Baumann 024ec69e97 Modify GATK command line header for unique keys
The GATK command line header keys were being repeated in the VCF and
subsequently lost to a single key value by HTSJDK.  This resolves
the issue by appending the name of the walker after the text
"GATKCommandLine" and a number after that if the same walker was
used more than once in the form: GATKCommandLine.(walker name) for
the first occurrence of the walker, and GATKCommandLine.(walker name).#
where # is the number of the occurrence of the walker (e.g.
GATKCommandLine.SomeWalker.2 for the second occurrence of SomeWalker).
Integration test added to EngineFeaturesIntegrationTest to verify
two runs of same walker follow expected form.

Resolves #909
See also: HTSJDK #43
2015-04-02 13:56:11 -04:00
Geraldine Van der Auwera c2c0037969 Merge pull request #910 from broadinstitute/rhl_mv_example_doc
Update -mv example documentation
2015-04-01 14:12:47 -04:00
Ron Levine fe87484074 Update -mv example documentation
Made general doc fixes
2015-04-01 02:37:42 -04:00
Geraldine Van der Auwera d7f7022dce Merge pull request #904 from broadinstitute/pd_orig_dp
Added keepOriginalDP argument to SelectVariants
2015-03-30 09:01:33 -04:00
ldgauthier 0101003138 Merge pull request #899 from broadinstitute/ldg_M2_tandemRepeatsAndContamination
Lots of changes to M2:
2015-03-30 07:58:35 -04:00
Geraldine Van der Auwera 87b3dddb39 Merge pull request #894 from broadinstitute/gg_ami_docs_license
Edited ASEReadCounter documentation
2015-03-28 13:15:24 -04:00
Laura Gauthier 9a73196813 Fix license 2015-03-27 18:26:20 -04:00
Laura Gauthier 5a10758e2e Annotation changes for M2:
Build a ReferenceContext in ActiveRegionWalkers to pass in to annotation engine so we can call the TandemRepeatAnnotator from M2
Make TandemRepeatAnnotator default annotation for M2.
Setup (but don't use yet) HC-style contamination downsampling.
New HC integration test with TandemRepeatAnnotator
2015-03-27 18:25:23 -04:00
droazen 1f81c033f5 Merge pull request #890 from broadinstitute/rhl_writers_use_index_args
Automatically choose indexing strategy by file extension
2015-03-27 15:00:58 -04:00
Ron Levine aef0a83c52 Automatically choose indexing strategy by file extension 2015-03-27 11:10:35 -04:00
Geraldine Van der Auwera 9b812308b1 Edited ASEReadCounter documentation
Also changed output file variable type from String to Enum
2015-03-26 02:43:53 -04:00
Geraldine Van der Auwera b47bebb5a3 Merge pull request #900 from broadinstitute/ami-fix-MQ-after-Indel-Realingment-palantir
create a new read filter that can revert the MQ of reads that were reali...
2015-03-26 01:08:05 -04:00
Ami Levy-Moonshine bb76821942 New (private) read filter that can revert the MQ of reads affected by indel realignment.
In the process of realignment we add +10 to the MQ of each realigned read, breaking the MQ 60 cap. That may contribute to problems observed by Bertrand (a step in the ROC curve plot of the VQSR results)
2015-03-26 01:07:08 -04:00
Phillip Dexheimer c97c253ec8 Added keepOriginalDP argument to SelectVariants
Fixes #830
2015-03-25 22:45:31 -04:00
Geraldine Van der Auwera a77c8dd421 Merge pull request #895 from broadinstitute/pd_indel_qd
Remove indel-length normalization of QD for GGVCFs

Following up on the VA question in #901
2015-03-25 11:25:53 -04:00
Geraldine Van der Auwera dfa18a8fc6 Merge pull request #887 from broadinstitute/pd_vcf_cmdline_hdr
Fixed logging of 'out' command line parameter in VCF headers
2015-03-25 00:48:55 -04:00
Phillip Dexheimer 9e63696315 Remove indel-length normalization of QD for GGVCFs
* Fixes #848
* length normalization is now only applied if the annotation is calculated in UG
2015-03-24 08:22:19 -04:00
Geraldine Van der Auwera 0a45b2d79d Merge pull request #883 from broadinstitute/rhl_hc_mq0
Exclude MappingQualityZero from default annotations
2015-03-23 12:59:08 -04:00
amilev 244dbe8634 Merge pull request #886 from broadinstitute/ami-RNA-ASEtools-march2015
Ami RNA ASE tools
2015-03-21 18:31:24 -04:00
Ami Levy-Moonshine c5fc5c4f8c create 2 new tools:
- ASEReadCounter (public tool) replce Tuuli's script to produce the input to Manny's tool.
   It count the number of reads that support the ref allele and the alt allele, filtereing low qual reads and bases and keep only properPaired reads
- ASECaller (private tool) take both RNA and DNA, and produce ontingencyTables ** still under development **

minor changes in other tools:
- update RNA HC variant calling scala script
- expose FS method pValueForContingencyTable to be able to call it from ASEcaller

In ASEReadCounter:
- allow different option to deal with overlaping read from the same fragment
- add option to ignore or include indels in the pileups
- add option to disabled DuplicateRead

add ASEReadCounterIntegrationTest.java and files for the test
2015-03-21 16:56:00 -04:00
Phillip Dexheimer 3b567d7a98 Fixed logging of 'out' command line parameter in VCF headers 2015-03-18 23:12:13 -04:00
Ron Levine 46668d469a Exclude MappingQualityZero from default annotations 2015-03-17 21:46:18 -04:00
Geraldine Van der Auwera bf0c72aab0 Merge pull request #881 from broadinstitute/gg_fix_gsalib_failures
Fixes the gsalib build failure that caused the master meltdown
2015-03-17 07:47:18 -04:00
Geraldine Van der Auwera a75e1d4ce4 Fixes the test that was failing due to gsalib build failure 2015-03-17 04:26:03 -04:00
Geraldine Van der Auwera c374d126d7 Merge pull request #808 from broadinstitute/pd_gsalib_concordance
Added gsa.reshape.concordance.table function to gsalib
2015-03-17 00:05:30 -04:00
Phillip Dexheimer 4d4d33404e Added gsa.reshape.concordance.table function to gsalib 2015-03-16 22:52:27 -04:00
Geraldine Van der Auwera 517320092c Merge pull request #863 from broadinstitute/kc_m2_initial_commit
Seeking comments on visibility changes to HaplotypeCaller-related classes

Welcome to GATK-master, MuTect2!
2015-03-13 21:05:39 -04:00
Kristian Cibulskis ab1053e83c It compiles, and produces results!
fixed NPE when normal contains no reads

first integration test (micro) and unit tests, also rename of MuTectHC -> M2

adding in standard GATK license terms

incorporated HOSTILE mode to PCR Error Correction

removed tumor and normal name parameters and cleaned up internal name handling

changes to allow for calling without a matched normal (technically, not true 'tumor-only' calling).  Used for panel-of-normals creation

additional regression tests, based on DREAM data.  Removed accidental addition of TandemRepeatAnnotator to default annotations

updated MD5 based on run from GSA4 to fix bamboo issue

reverted unneeded visibility changes
2015-03-13 18:28:01 -04:00
Geraldine Van der Auwera 1d39ed9156 Merge pull request #814 from broadinstitute/biocyberman_maven_patches
Biocyberman maven patches
2015-03-13 16:26:02 -04:00
Geraldine Van der Auwera 39a972f348 Merge pull request #872 from broadinstitute/eb_create_rgq_format_field
Added the RGQ format annotation to monomorphic sites in the VCF output of GenotypeGVCFs. Fixes #870
2015-03-13 13:59:53 -04:00
Geraldine Van der Auwera 7681e89454 Merge pull request #869 from broadinstitute/gg_fix_vqsr_plots_GSA-860
Switched VQSR tranches plot ordering rule
2015-03-13 10:46:55 -04:00
Geraldine Van der Auwera 3276a964f4 Merge pull request #871 from broadinstitute/pd_queue_unmapped_regression
Regression: The new 'includeUnmapped' PartitionBy annotation was incorrectly set for HC
2015-03-13 10:42:52 -04:00