Commit Graph

13550 Commits (0b72f7a16d42c27e2f26275d4bcc784cdd67f569)

Author SHA1 Message Date
Eric Banks 0b72f7a16d Merge pull request #685 from broadinstitute/ldg_inbreedingCoeffForMultiallelics
Update inbreeding coefficient calculation to give a better estimate for ...
2014-08-07 09:00:09 -04:00
Laura Gauthier 74affcc077 Update inbreeding coefficient calculation to give a better estimate for multialleleic sites
Add unit test for compound het and for multiallelic hets
2014-08-07 08:12:47 -04:00
Eric Banks b9486f5b4d Merge pull request #693 from broadinstitute/ldg_SORfromHC
Allow SOR to be calculated from HC
2014-08-06 21:48:09 -04:00
Eric Banks 5f31e54d67 Merge pull request #696 from broadinstitute/pd_DoC_sorting
Fix sample sort order bug in DepthOfCoverage
2014-08-06 08:35:35 -04:00
Eric Banks 6fa4764fc1 Merge pull request #694 from broadinstitute/pd_check_missing_arg_values
Improved detection of missing argument values
2014-08-06 08:32:22 -04:00
Phillip Dexheimer b0c026e671 Fix sample sort order bug in DepthOfCoverage
Rare bug triggered by hash collision between sample names
 PT 66183936
2014-08-05 21:55:34 -04:00
Phillip Dexheimer 593663d9b6 Improved detection of missing argument values
In particular, it was possible to specify arguments for Files or Compound types without values
 Added a special "none" value for annotations, since a bare "-A" is no longer allowed
 Delivers PT 71792842 and 59360374
2014-08-05 20:31:31 -04:00
Eric Banks 03e7ee6e9c Merge pull request #695 from broadinstitute/pd_genotype_concordance_doc
Documentation fix (closed HTML tag)
2014-08-05 07:15:40 -04:00
Phillip Dexheimer 359fe150c9 Documentation fix (closed HTML tag) 2014-08-04 23:19:16 -04:00
Laura Gauthier 5533199402 Allow SOR to be calculated from HC
Refactor StrandBiasTest classes
2014-08-01 20:47:58 -04:00
Ryan Poplin e69b0d6316 Merge pull request #692 from broadinstitute/rp_typo_analyze_covariates
Fixing typos in AnalyzeCovariates
2014-07-31 10:39:31 -04:00
Ryan Poplin 63b3f7dfd3 Fixing typos in AnalyzeCovariates 2014-07-31 10:36:18 -04:00
Eric Banks 66ccc636b8 Merge pull request #691 from broadinstitute/ldg_update_htsjdk
Update GATK to work with latest htsjdk
2014-07-30 14:32:23 -04:00
Laura Gauthier 4373922ee6 Update GATK to work with latest htsjdk
ValidationStringency was moved from htsjdk.samtools.SAMFileReader to htsjdk.samtools
samtools find BAM index file method was also moved (and made public!)
2014-07-30 12:05:14 -04:00
Valentin Ruano Rubio d69af637bf Merge pull request #690 from broadinstitute/vrr_fix_non_diploid_unsupproted_message_in_haplotype_caller
Add diploid only support message to HaplotypeCaller
2014-07-29 22:50:00 -04:00
Valentin Ruano-Rubio 750eb4b5a6 Add diploid only support message to HaplotypeCaller
Story:

  https://www.pivotaltracker.com/story/show/73440292

Changes:

  - Just add the conditional in HaplotypeCaller#initialize

Testing:

  - Nothing added, checked locally, trivial change that would eventually be removed anyway.
2014-07-29 17:05:36 -04:00
Eric Banks 43bd6b4436 Merge pull request #689 from broadinstitute/eb_fix_samrecord_test_constructor
The copy constructor for a GATKSAMRecord (used for testing only) should ...
2014-07-23 23:03:30 -04:00
Eric Banks 84af1fc75f The copy constructor for a GATKSAMRecord (used for testing only) should use the actual read's contig index, not its mate's. 2014-07-23 15:31:03 -04:00
Valentin Ruano Rubio 881fc9cc35 Merge pull request #687 from broadinstitute/vrr_omniploidy_assesment_na12878
Further changes to AssessNA12878 for evaluating Omniploidy
2014-07-22 14:45:59 -04:00
Valentin Ruano-Rubio 8c14609477 Added -gtType argument to AssessNA12878 to stratify assessment based on true genotype call in NA12878 (or input call if the former is not in KB).
Story:

  https://www.pivotaltracker.com/story/show/75028590

Changes:

  Added the possibility of indicating the genotype type to consider (with argument [-gtType TYPE]*, where TYPE is HET, HOM_REF or HOM_VAR)
  Removed conditional Het evaluation based on input ploidy; now you need to use -gtType explictly.

Tests:

  Added integration tests to check on new argument (-gtType) behaviour in AssessNA12878KnowledgeBaseTest
2014-07-22 11:02:36 -04:00
David Roazen 0798a4b768 Update pom versions to mark the start of GATK 3.3 development 2014-07-17 12:09:33 -04:00
David Roazen 323f22f852 Update pom versions for the 3.2 release 2014-07-17 12:06:22 -04:00
Eric Banks 98d88eb07e Fixed IndexOutOfBounds error associated with tail merging.
Don't expand out source nodes for tail merging, since that's a head merging action only.
This shows up as a bug only because we now allow merging tails against non-reference paths.
2014-07-17 12:04:22 -04:00
David Roazen 799071b520 Merge remote-tracking branch 'unstable/master' 2014-07-14 14:07:16 -04:00
Geraldine Van der Auwera 104d94a1ac Merge pull request #686 from broadinstitute/gg_gatkdocs_updates
Various documentation improvements
2014-07-14 12:10:36 -04:00
Geraldine Van der Auwera a6f632874b Various documentation improvements
- Edited intervals merging docs for correctness & clarity
- Edited VQSR arg docs and made mode required (+added -mode SNP to VQSR tests)
- Moved PaperGenotyper to Toy Walkers to declutter the actually useful docs
- Moved GenotypeGVCFs to Variant Discovery category and clarified a few points
- Clarified that the -resource argument depends on using the -V:tag format
- Clarified how the pcr indel model works
- Added caveat for -U ALLOW_N_CIGAR_READS
- Added MathJax support for displaying equations in GATKDocs
- Updated HC example commands and caveats
2014-07-14 12:03:03 -04:00
droazen db53d096c9 Merge pull request #684 from broadinstitute/ks_add_cofoja_to_gatk_packages
Added cofoja to the gatk packages for tests to pass.
2014-07-14 11:15:49 -04:00
Eric Banks 2138aa135c Merge pull request #683 from broadinstitute/eb_disable_complex_variant_merging_PT74816146
Disable the complex variant merging for now, as requested by ATGU
2014-07-13 08:57:13 -04:00
Eric Banks ecefcb383d Disable the complex variant merging for now, as requested by ATGU 2014-07-11 17:27:40 -04:00
Khalid Shakir c7e357eb59 Added cofoja to the gatk packages for tests to pass. 2014-07-11 23:19:42 +08:00
droazen b8751ad598 Merge pull request #680 from broadinstitute/ldg_VQSRscript
Update VQSR Rnd BQSR  script generation code for compatibility with late...
2014-07-11 10:16:37 -04:00
kshakir 4f4ff5c327 Merge pull request #682 from broadinstitute/ks_revert_md5_db_per_test
Reverting md5 db per test
2014-07-11 03:42:18 +08:00
Ryan Poplin 193e389b41 Merge pull request #679 from broadinstitute/eb_better_tail_merging_PT74222522
Improved tail merging: now tails can be merged to branches that are not ...
2014-07-10 13:54:33 -04:00
Valentin Ruano Rubio 598b481733 Merge pull request #671 from broadinstitute/vrr_omniploidy_assesment_na12878
Now AssessNA12878 can handle different input ploidy for omniploidy asses...

Story:

  https://www.pivotaltracker.com/story/show/74469346
2014-07-10 13:19:32 -04:00
Khalid Shakir 18f6d56b4c Revert "Using the base directory for each test run when outputting MD5DB mismatches."
This reverts commit f192f032a153755a84b1d682f6e652a7c6787fb9.
2014-07-11 01:11:25 +08:00
Khalid Shakir cc09ef9190 Revert "Appending to md5db in the gatkdir, with additional logging."
This reverts commit 0aa2884f7b006f5d48c325bf942b92c183e45074.
2014-07-11 01:11:20 +08:00
droazen a105cc4c0a Merge pull request #681 from broadinstitute/dr_md5mismatch_generator_script
Shell script to generate md5 mismatches file from a Bamboo log of an integration test run
2014-07-10 10:40:09 -04:00
David Roazen af0aab8791 Shell script to generate md5 mismatches file from a Bamboo log of an integration test run 2014-07-10 10:32:10 -04:00
Eric Banks 1d97b4a191 Improved tail merging: now tails can be merged to branches that are not entirely reference.
This is useful for e.g. cases where there are SNPs on insertions.  Before tails were forced to be merged
(incorrectly) only to a reference node, but now they can be merged to any path in the graph from which they
directly branch.

Also, I've transferred over Ryan's code to refuse to process kmer sizes such that there are non-unique kmers
in the reference sequence with them.
2014-07-10 08:57:01 -04:00
kshakir aecd34d274 Merge pull request #677 from broadinstitute/ks_md5_db_per_test_type
Appending to md5db in the gatkdir, with additional logging.
2014-07-10 17:53:24 +08:00
Ryan Poplin 5eee065133 Merge pull request #674 from broadinstitute/rp_improve_genotyping
Improvements to genotyping accuracy.
2014-07-09 16:03:09 -04:00
Khalid Shakir a7d1904c63 Appending to md5db in the gatkdir, with additional logging. 2014-07-10 03:58:47 +08:00
Laura Gauthier 99026eb51b Update VQSR Rnd BQSR script generation code for compatibility with latest ggplot version. Update queueJobReport.R and public/gsalib/src/R/R/gsa.variantqc.utils.R also 2014-07-09 15:36:58 -04:00
Valentin Ruano-Rubio cd2c5ce1b2 Now AssessNA12878 can handle different input ploidy for omniploidy assessment where we may run HC as if it was a haploid, tetraploid (and so forth) organism.
For example, when the input is Haploid it is considered ok to have a FN if the actual genotype is 0/1 as there is 50% chance to not call it at all.

Also it considers that the genotype call is concordant as long as the AC is as close as it can be to the 50% percent given the ploidy. So for a 0/1 true call is it ok
to have a 0 or 1 call in haploids and also 0/0/1/1 in tetraploid, and also 0/0/1 or 0/1/1 with triploid input, but it is not a 0/0/0/1 in tetraploids or 0/0/0/0/1/1 with hextaploid input.

Story:

  http://www.pivotaltracker.com/story/show/72090992

Changes:

  AssessNA12878 has a new argument (-ploidy / --inputPloidy) to indicate the expected ploidy of the input.
  By default this is the obvious choice of 2 as NA12878 is human.

  In the input has calls with a different ploidy it will complain with an user exception.

  Also some refactoring has been done to make the code a bit more concise in some parts.
2014-07-09 14:42:33 -04:00
droazen a85bc0b577 Merge pull request #678 from broadinstitute/dr_remove_junit_imports
Remove junit imports in the test suite
2014-07-09 14:40:38 -04:00
Ryan Poplin 74a7674d70 Improvements to genotyping accuracy.
-- Global mismapping penalty was only applied to the reference haplotype. This led to problems with overlapping events, mostly STR haplotypes. Now the penalty is applied to every haplotype.
-- We subset the reads down to only those which overlap the event (after assembly based realignment) for likelihood calculations.
2014-07-09 13:11:07 -04:00
David Roazen 719e685759 Remove junit imports in the test suite 2014-07-09 12:09:27 -04:00
droazen 465c5ed9b0 Merge pull request #676 from broadinstitute/cww_junit_assert
Replace improper use of org.junit imports with org.testng.
2014-07-08 16:12:34 -04:00
Chris Whelan 635b9bd1af Replace improper use of org.junit imports with org.testng. 2014-07-08 16:07:39 -04:00
droazen b2e51838d7 Merge pull request #673 from broadinstitute/ks_md5_db_per_test_run
Creating and MD5DB per each test, instead of overwriting in the top dir
2014-07-08 10:58:59 -04:00