Commit Graph

13529 Commits (323f22f852c90a6cf53ece5e72c165b1166ad8c7)

Author SHA1 Message Date
David Roazen 323f22f852 Update pom versions for the 3.2 release 2014-07-17 12:06:22 -04:00
Eric Banks 98d88eb07e Fixed IndexOutOfBounds error associated with tail merging.
Don't expand out source nodes for tail merging, since that's a head merging action only.
This shows up as a bug only because we now allow merging tails against non-reference paths.
2014-07-17 12:04:22 -04:00
David Roazen 799071b520 Merge remote-tracking branch 'unstable/master' 2014-07-14 14:07:16 -04:00
Geraldine Van der Auwera 104d94a1ac Merge pull request #686 from broadinstitute/gg_gatkdocs_updates
Various documentation improvements
2014-07-14 12:10:36 -04:00
Geraldine Van der Auwera a6f632874b Various documentation improvements
- Edited intervals merging docs for correctness & clarity
- Edited VQSR arg docs and made mode required (+added -mode SNP to VQSR tests)
- Moved PaperGenotyper to Toy Walkers to declutter the actually useful docs
- Moved GenotypeGVCFs to Variant Discovery category and clarified a few points
- Clarified that the -resource argument depends on using the -V:tag format
- Clarified how the pcr indel model works
- Added caveat for -U ALLOW_N_CIGAR_READS
- Added MathJax support for displaying equations in GATKDocs
- Updated HC example commands and caveats
2014-07-14 12:03:03 -04:00
droazen db53d096c9 Merge pull request #684 from broadinstitute/ks_add_cofoja_to_gatk_packages
Added cofoja to the gatk packages for tests to pass.
2014-07-14 11:15:49 -04:00
Eric Banks 2138aa135c Merge pull request #683 from broadinstitute/eb_disable_complex_variant_merging_PT74816146
Disable the complex variant merging for now, as requested by ATGU
2014-07-13 08:57:13 -04:00
Eric Banks ecefcb383d Disable the complex variant merging for now, as requested by ATGU 2014-07-11 17:27:40 -04:00
Khalid Shakir c7e357eb59 Added cofoja to the gatk packages for tests to pass. 2014-07-11 23:19:42 +08:00
droazen b8751ad598 Merge pull request #680 from broadinstitute/ldg_VQSRscript
Update VQSR Rnd BQSR  script generation code for compatibility with late...
2014-07-11 10:16:37 -04:00
kshakir 4f4ff5c327 Merge pull request #682 from broadinstitute/ks_revert_md5_db_per_test
Reverting md5 db per test
2014-07-11 03:42:18 +08:00
Ryan Poplin 193e389b41 Merge pull request #679 from broadinstitute/eb_better_tail_merging_PT74222522
Improved tail merging: now tails can be merged to branches that are not ...
2014-07-10 13:54:33 -04:00
Valentin Ruano Rubio 598b481733 Merge pull request #671 from broadinstitute/vrr_omniploidy_assesment_na12878
Now AssessNA12878 can handle different input ploidy for omniploidy asses...

Story:

  https://www.pivotaltracker.com/story/show/74469346
2014-07-10 13:19:32 -04:00
Khalid Shakir 18f6d56b4c Revert "Using the base directory for each test run when outputting MD5DB mismatches."
This reverts commit f192f032a153755a84b1d682f6e652a7c6787fb9.
2014-07-11 01:11:25 +08:00
Khalid Shakir cc09ef9190 Revert "Appending to md5db in the gatkdir, with additional logging."
This reverts commit 0aa2884f7b006f5d48c325bf942b92c183e45074.
2014-07-11 01:11:20 +08:00
droazen a105cc4c0a Merge pull request #681 from broadinstitute/dr_md5mismatch_generator_script
Shell script to generate md5 mismatches file from a Bamboo log of an integration test run
2014-07-10 10:40:09 -04:00
David Roazen af0aab8791 Shell script to generate md5 mismatches file from a Bamboo log of an integration test run 2014-07-10 10:32:10 -04:00
Eric Banks 1d97b4a191 Improved tail merging: now tails can be merged to branches that are not entirely reference.
This is useful for e.g. cases where there are SNPs on insertions.  Before tails were forced to be merged
(incorrectly) only to a reference node, but now they can be merged to any path in the graph from which they
directly branch.

Also, I've transferred over Ryan's code to refuse to process kmer sizes such that there are non-unique kmers
in the reference sequence with them.
2014-07-10 08:57:01 -04:00
kshakir aecd34d274 Merge pull request #677 from broadinstitute/ks_md5_db_per_test_type
Appending to md5db in the gatkdir, with additional logging.
2014-07-10 17:53:24 +08:00
Ryan Poplin 5eee065133 Merge pull request #674 from broadinstitute/rp_improve_genotyping
Improvements to genotyping accuracy.
2014-07-09 16:03:09 -04:00
Khalid Shakir a7d1904c63 Appending to md5db in the gatkdir, with additional logging. 2014-07-10 03:58:47 +08:00
Laura Gauthier 99026eb51b Update VQSR Rnd BQSR script generation code for compatibility with latest ggplot version. Update queueJobReport.R and public/gsalib/src/R/R/gsa.variantqc.utils.R also 2014-07-09 15:36:58 -04:00
Valentin Ruano-Rubio cd2c5ce1b2 Now AssessNA12878 can handle different input ploidy for omniploidy assessment where we may run HC as if it was a haploid, tetraploid (and so forth) organism.
For example, when the input is Haploid it is considered ok to have a FN if the actual genotype is 0/1 as there is 50% chance to not call it at all.

Also it considers that the genotype call is concordant as long as the AC is as close as it can be to the 50% percent given the ploidy. So for a 0/1 true call is it ok
to have a 0 or 1 call in haploids and also 0/0/1/1 in tetraploid, and also 0/0/1 or 0/1/1 with triploid input, but it is not a 0/0/0/1 in tetraploids or 0/0/0/0/1/1 with hextaploid input.

Story:

  http://www.pivotaltracker.com/story/show/72090992

Changes:

  AssessNA12878 has a new argument (-ploidy / --inputPloidy) to indicate the expected ploidy of the input.
  By default this is the obvious choice of 2 as NA12878 is human.

  In the input has calls with a different ploidy it will complain with an user exception.

  Also some refactoring has been done to make the code a bit more concise in some parts.
2014-07-09 14:42:33 -04:00
droazen a85bc0b577 Merge pull request #678 from broadinstitute/dr_remove_junit_imports
Remove junit imports in the test suite
2014-07-09 14:40:38 -04:00
Ryan Poplin 74a7674d70 Improvements to genotyping accuracy.
-- Global mismapping penalty was only applied to the reference haplotype. This led to problems with overlapping events, mostly STR haplotypes. Now the penalty is applied to every haplotype.
-- We subset the reads down to only those which overlap the event (after assembly based realignment) for likelihood calculations.
2014-07-09 13:11:07 -04:00
David Roazen 719e685759 Remove junit imports in the test suite 2014-07-09 12:09:27 -04:00
droazen 465c5ed9b0 Merge pull request #676 from broadinstitute/cww_junit_assert
Replace improper use of org.junit imports with org.testng.
2014-07-08 16:12:34 -04:00
Chris Whelan 635b9bd1af Replace improper use of org.junit imports with org.testng. 2014-07-08 16:07:39 -04:00
droazen b2e51838d7 Merge pull request #673 from broadinstitute/ks_md5_db_per_test_run
Creating and MD5DB per each test, instead of overwriting in the top dir
2014-07-08 10:58:59 -04:00
Khalid Shakir 2129aa05d8 Bug fix for poms missing package test artifacts. 2014-07-08 06:34:26 +08:00
Khalid Shakir e5be9c7073 Using the base directory for each test run when outputting MD5DB mismatches. 2014-07-08 06:34:25 +08:00
Ryan Poplin f084fa5ab2 Merge pull request #669 from broadinstitute/eb_improve_decomposition_of_haplotypes_PT71966916
When converting a haplotype to a set of variants we now check for cases that are overly complex.
2014-07-02 10:08:32 -04:00
Eric Banks bad7865078 When converting a haplotype to a set of variants we now check for cases that are overly complex.
In these cases, where the alignment contains multiple indels, we output a single complex
variant instead of the multiple partial indels.

We also re-enable dangling tail recovery by default.
2014-07-01 14:18:59 -04:00
ldgauthier 297c2b0651 Merge pull request #670 from broadinstitute/rp_SBBS_min_count
SB tables should be created even if the ref or alt columns have no count...
2014-07-01 08:34:23 -04:00
Ryan Poplin e14bff212d SB tables should be created even if the ref or alt columns have no counts. This is so that FS/SOR will still be calculated when the variant is extremely high or low frequency.
-- Removed long running HC integration test... sorry
2014-06-30 15:19:15 -04:00
Ryan Poplin 5c45641051 Merge pull request #668 from broadinstitute/rp_AD_realigned_reads_8b9d1c1
Reads are now realigned to the most likely haplotype before being used b...
2014-06-30 12:35:40 -04:00
Ryan Poplin 0127799cba Reads are now realigned to the most likely haplotype before being used by the annotations.
-- AD,DP will now correspond directly to the reads that were used to construct the PLs
-- RankSumTests, etc. will use the bases from the realigned reads instead of the original alignments
-- There is now no additional runtime cost to realign the reads when using bamout or GVCF mode
-- bamout mode no longer sets the mapping quality to zero for uninformative reads, instead the read will not be given an HC tag
2014-06-30 10:35:50 -04:00
jmthibault79 dc93507023 Merge pull request #667 from broadinstitute/ks_refactor_doc_scala_package
Refactored DoC scala package
2014-06-25 16:05:30 -04:00
Khalid Shakir f5345e903a Fixed minor pom.xml line that was out of place after sortpom. 2014-06-26 01:01:19 +08:00
Khalid Shakir 7b5f88a49c Refactored DoC custom Queue wrappers to a non-package object.
Now, "mvn verify && mvn verify" should work again.
2014-06-26 00:59:18 +08:00
droazen b935ed0df1 Merge pull request #665 from broadinstitute/ks_force_delete_bad_symlinks
Executing a version of the delete_maven_links.sh
2014-06-25 00:13:05 -04:00
Eric Banks 92028027c9 Merge pull request #666 from broadinstitute/pd_merge_selectvariants_test
Removed redundant SelectVariantsIntegrationTest, merged its only test i...
2014-06-24 21:19:46 -04:00
Phillip Dexheimer 06d619e9aa Removed redundant SelectVariantsIntegrationTest, merged it's only test into protected version 2014-06-24 18:59:59 -04:00
Khalid Shakir 45d819a00e For now, executing the delete_maven_links.sh just ahead of creating the symbolic links during the process-test-resources phase.
Better than running it during the "clean" phase, since these users may not run "mvn clean" before attempting to build.
2014-06-25 02:32:15 +08:00
kshakir 42cfa3b53b Merge pull request #664 from broadinstitute/ks_picard_maven_gatk_root
Fixed gatk-root path in private/picard-maven/pom.xml
2014-06-24 18:48:40 +08:00
Khalid Shakir bffc9fbabd Fixed gatk-root path in private/picard-maven/pom.xml 2014-06-24 18:45:51 +08:00
Eric Banks c191103326 Merge pull request #663 from broadinstitute/pd_jexl_user_exception
Recast the "Invalid JEXL expression detected" error in SelectVariants fr...
2014-06-20 17:17:54 -04:00
Phillip Dexheimer 65eeb4a7ab Recast the "Invalid JEXL expression detected" error in SelectVariants from a RuntimeException to a UserException
- PT 68931448
2014-06-20 00:05:23 -04:00
Eric Banks db7dc8ab5f Merge pull request #660 from broadinstitute/pd_catvariants_list
Added functionality to CatVariants to process .list files with -V
2014-06-19 23:54:03 -04:00
Phillip Dexheimer da5e567b73 Added functionality to CatVariants to process .list files with -V
- Pivotal 70305712
2014-06-19 21:46:13 -04:00