Commit Graph

13439 Commits (b77589696e5e6273cb2db779f4dd9d3df6d6d38b)

Author SHA1 Message Date
Eric Banks b77589696e Merge pull request #643 from broadinstitute/rp_remove_hwp
Removing HWP from GenotypeSummaries because of integer overflow issues w...
2014-05-27 17:21:19 -04:00
Eric Banks 3f694d61ee Merge pull request #641 from broadinstitute/dr_post_test_cleanup
Improve test suite tmp file cleanup
2014-05-27 15:59:03 -04:00
Eric Banks 400d19a126 Merge pull request #642 from broadinstitute/dr_maven_symlink_cleanup_script
Script to manually delete maven-created symlinks to testdata/qscripts
2014-05-27 15:52:18 -04:00
David Roazen 6ca483e574 Script to manually delete maven-created symlinks to testdata/qscripts
This script can be run when "mvn clean" fails due to an inability to
remove existing symlinks.
2014-05-27 15:33:11 -04:00
David Roazen 74b51c5c7a Improve test suite tmp file cleanup
-Make BaseTest.createTempFile() mark any possible corresponding index files for deletion on exit

-Make WalkerTest mark shadow BCF files and auxiliary for deletion on exit

-Make VariantRecalibrationWalkersIntegrationTest mark PDF files for deletion on exit
2014-05-27 13:41:44 -04:00
Ryan Poplin b24cff780b Removing HWP from GenotypeSummaries because of integer overflow issues with 91K samples. Removing CCC because it is redundant. 2014-05-27 10:14:49 -04:00
Eric Banks 3696c2d817 Merge pull request #639 from broadinstitute/rp_dont_merge_tails_in_exomes
Unfortunately dangling tail recovery is dangerous in exome data. Turning...
2014-05-27 09:37:49 -04:00
Ryan Poplin ec7c4ea2ba Unfortunately dangling tail recovery is dangerous in exome data. Turning it off by default for now.
-- disabling HC+VA integration test because, as noted in the comments, it keeps switching PairHMM implementations and giving different results at a particular site used in that particular test
2014-05-23 14:33:44 -04:00
Valentin Ruano Rubio cfde6e72bf Merge pull request #633 from broadinstitute/vrr_sw_pairwise_alignment
Fix for inconsistent indel variant location based on active-region size
2014-05-23 09:27:55 -04:00
Valentin Ruano-Rubio 979ab0453e Moved GlobalEdgeGreedySWPairwiseAlignment to the archive 2014-05-23 01:48:48 -04:00
Valentin Ruano-Rubio 7c8a1ae892 Fix for SW to make double comparisons with a tolerance
Stories:

  - https://www.pivotaltracker.com/story/show/69577868

Changes:

  - Added a epsilon difference tolerance in weight comparisons.

Tests:

  - Added HaplotypeCallerIntegrationTest#testDifferentIndelLocationsDueToSWExactDoubleComparisonsFix
  - Updated md5 due to minor likelihood changes.
  - Disabled a test for PathUtils.calculateCigar since does not work and is unclear what is causing the error (needs original author input)
2014-05-23 01:48:48 -04:00
kshakir d4340de179 Merge pull request #637 from broadinstitute/ks_package_rename_patches
Fixed GATK docs artifact, moved protected ExampleUG tests.
2014-05-22 21:06:25 -04:00
Khalid Shakir b7e98bdae9 Fixed GATK docs artifact, moved protected ExampleUG tests. 2014-05-22 21:03:55 -04:00
David Roazen 248f7394ca Update manual private testdata linking script for maven changes 2014-05-20 14:16:33 -04:00
David Roazen 56445f6eef Update nightly build and internal build release scripts for maven changes 2014-05-20 13:20:29 -04:00
Ryan Poplin f24fb4e518 Merge pull request #635 from broadinstitute/rp_HC_gvcf_banding_docs
Minor updates to HC docs.
2014-05-20 11:01:41 -04:00
Ryan Poplin 581843d994 Minor updates to HC docs. 2014-05-20 10:01:11 -04:00
kshakir 40f289c622 Merge pull request #632 from broadinstitute/ks_package_rename
Sting to GATK renaming.
For more info see http://gatkforums.broadinstitute.org/discussion/4173/sting-to-gatk-renaming
2014-05-19 18:05:07 -04:00
Khalid Shakir 88d7e23c44 After talking with Mauricio and Karthik, updated MD5s and added a note about PairHMM causing test variability. 2014-05-19 17:36:41 -04:00
Karthik Gururaj 972a82d386 Changed 'sting' to 'gatk' in the VectorLoglessPairHMM classes and the
C++ code
2014-05-19 17:36:41 -04:00
Khalid Shakir 3939971d78 After renaming the packages, instead of updating the JNI library used for testing bwa, moving the classes to the archive.
NOTE: The migrated READEME.md has been added that will allow others to possibly ressurect this code as needed.
2014-05-19 17:36:41 -04:00
Khalid Shakir 2c854e554a Refactored maven directories and java packages replacing "sting" with "gatk".
To reduce merge conflicts, this commit modifies contents of files, while file renamings are in previous commit.
See previous commit message for list of changes.
2014-05-19 17:36:39 -04:00
Khalid Shakir 4e6d43d003 Refactored maven directories and java packages replacing "sting" with "gatk".
To reduce merge conflicts, this commit only renames files, while file modifications are in next commit.
Some updates/fixes here are actually included in the next commit.
= Maven updates
Moved artifacts to new package names:
* private/queue-private -> private/gatk-queue-private
* private/gatk-private -> private/gatk-tools-private
* public/gatk-package -> protected/gatk-package-distribution
* public/queue-package -> protected/gatk-queue-package-distribution
* protected/gatk-protected -> protected/gatk-tools-protected
* public/queue-framework -> public/gatk-queue
* public/gatk-framework -> public/gatk-tools-public
New poms for new artifacts and packages:
* private/gatk-package-internal
* private/gatk-queue-package-internal
* private/gatk-queue-extensions-internal
* protected/gatk-queue-extensions-distribution
* public/gatk-engine
Updated references to StingText.properties to GATKText.properties.
Updated ant-bridge.sh to use gatk.* properties instead of sting.*.
= Engine updates
Renaming files containing engine parts from o.b.gatk.tools to o.b.gatk.engine.
Changed package references from tools to engine for CommandLineGATK, GenomeAnalysisEngine, ReadMetrics, ReadProperties, and WalkerManager.
Changed package reference tools.phonehome to engine.phonehome.
Renamed classes *Sting* to *GATK*, such as ReviewedGATKException.
= Test updates
Moved gatk example resources.
Moved test engine files from tools to engine packages.
Moved resources for phonehome to proper package.
Moved test classes under o.b.gatk into packages:
* o.b.g.utils.{BaseTest,ExampleToCopyUnitTest,GATKTextReporter,MD5DB,MD5Mismatch,TestNGTestTransformer}
* o.b.g.engine.walkers.WalkerTest
Updated package names in DependencyAnalyzerOutputLoaderUnitTest's data.
= Queue updates
Moving queue scripts to location where generated extensions can be used.
Renamed *.q to *.scala, updating licenses previously missed by git hooks.
Moved queue extensions to new artifact gatk-queue-extensions.
Fixed import statments frequently merge-conflicting on FullProcessingPipeline.scala.
= BWA
Added README on how to obtain and include bwa as a library.
Updated libbwa build.
Fixed packaged names under bwa/java implementation.
Updated contents of BWCAligner native implementation.
= Other fixes
Don't duplicate the resource bundle entries by both unpacking *and* appending.
(partial fix) Staged engine and utils poms to build GATKText.properties, once Utils random generator dependency on GATK engine is fixed.
Re-enabled custom testng listeners/reporters and moved testng dependencies to the gatk-root.
Updated comments referencing Sting with GATK.
Moved a couple untangled classes from gatk-tools-public to gatk-utils and gatk-engine.
2014-05-19 16:43:47 -04:00
Eric Banks 6dff219bb1 Merge pull request #634 from broadinstitute/ldg_newSingleSampleScript
Updated single sample exome calling script to use HaplotypeCaller and Ge...
2014-05-19 15:14:16 -04:00
Laura Gauthier 3dea8fed63 Updated single sample exome calling script to use HaplotypeCaller and GenotypeGVCFs 2014-05-19 15:08:51 -04:00
jmthibault79 d15dbadeca Merge pull request #631 from broadinstitute/pd_queue_final_status_line
Revised final Queue status line to display number of jobs in each state ...
2014-05-16 12:03:28 -04:00
Phillip Dexheimer a5abc079dc Revised final Queue status line to display number of jobs in each state when the script fails
* Addresses PT 61552466
* Included a simple scala script in private/testdata that will always fail
2014-05-15 21:30:44 -04:00
Eric Banks 1b2d6ae5e4 Merge pull request #629 from broadinstitute/jg_add_gather_bqsr
Add a (picard) command line wrapper around BQSRGatherer
2014-05-14 12:09:15 -04:00
jmthibault79 78560212d0 Merge pull request #630 from broadinstitute/pd_blank_lines_in_listfile
Allow blank lines in a (non-BAM) list file
2014-05-14 11:32:44 -04:00
droazen 8297cd1a1a Merge pull request #619 from broadinstitute/pd_intervalmerge_doc
Made IntervalSharder respect the IntervalMergingRule specified on the co...
2014-05-14 11:22:18 -04:00
Phillip Dexheimer 77449961ab Allow blank lines in a (non-BAM) list file
* Addresses PT Bug 67841052
 * Added Unit Test
2014-05-13 23:14:15 -04:00
geoffjentry 88ae203075 Add a (picard) command line wrapper around BQSRGatherer 2014-05-13 11:39:38 -04:00
kshakir cca3d167db Merge pull request #627 from broadinstitute/ks_picard_htsjdk_package_rename
picard/htsjdk package rename
2014-05-08 13:12:22 -04:00
Khalid Shakir 67e44985b1 Java/Scala imports updated for new package names.
Fourth of four commits for picard/htsjdk package rename.
2014-05-08 19:13:31 +08:00
Khalid Shakir cc3f1f2b96 Revved picard libraries.
Third of four commits for picard/htsjdk package rename.
2014-05-08 19:13:27 +08:00
Khalid Shakir a894a2dddb Updates to GATK classes and POMs that need updating, plus RodSystemValidation md5 updates.
GATK classes accessing package protected htsjdk classes changed to new package names.
POMs updated to support merging of sam/tribble/variant -> htsjdk and changes to picard artifact.
RodSystemValidation outputs changed due to variant codec packages changes, requiring test md5 updates.
Second of four commits for picard/htsjdk package rename.
2014-05-08 19:13:27 +08:00
Khalid Shakir 3ce3e27aa1 Moved GATK classes and POMs that will need updating.
GATK classes accessing package protected htsjdk classes will need new package names.
POMs will merge sam/tribble/variant into htsjdk.
Move only, contents updated in next commit.
First of four commits for picard/htsjdk package rename.
2014-05-08 19:13:27 +08:00
ldgauthier 53b2364cf8 Merge pull request #623 from broadinstitute/ldg_concordanceSites
Add ability to output to a file discordant loci and their respective gen...
2014-05-07 10:19:09 -04:00
Laura Gauthier bf7b97393e Add ability to output to a file discordant loci and their respective genotypes for each sample 2014-05-07 10:12:45 -04:00
MauricioCarneiro f03a12263a Merge pull request #625 from broadinstitute/intel_updateCell_inlined
(Optional) Inlined the code from updateCell
2014-05-07 10:11:09 -04:00
MauricioCarneiro bd33c3334e Merge pull request #626 from broadinstitute/intel_pairhmm
Removed scary warning messages for VectorPairHMM
2014-05-06 19:23:35 -04:00
Karthik Gururaj d9c489f928 Removed scary warning messages for VectorPairHMM 2014-05-06 10:59:24 -07:00
Karthik Gururaj fb8578ec8e Inlined the code from updateCell - helps Java JIT to detect hotspots and
produce good native code
2014-05-06 10:37:10 -07:00
MauricioCarneiro 587e81fbd9 Merge pull request #573 from broadinstitute/intel_pairhmm
Intel pairhmm
2014-05-05 16:27:04 -04:00
Karthik Gururaj f6ea25b4d1 Parallel version of the JNI for the PairHMM
The JNI treats shared memory as critical memory and doesn't allow any
parallel reads or writes to it until the native code finishes. This is
not a problem *per se* it is the right thing to do, but we need to
enable **-nct** when running the haplotype caller and with it have
multiple native PairHMM running for each map call.

Move to a copy based memory sharing where the JNI simply copies the
memory over to C++ and then has no blocked critical memory when running,
allowing -nct to work.

This version is slightly (almost unnoticeably) slower with -nct 1, but
scales better with -nct 2-4 (we haven't tested anything beyond that
because we know the GATK falls apart with higher levels of parallelism

* Make VECTOR_LOGLESS_CACHING the default implementation for PairHMM.
* Changed version number in pom.xml under public/VectorPairHMM
* VectorPairHMM can now be compiled using gcc 4.8.x
* Modified define-* to get rid of gcc warnings for extra tokens after #undefs
* Added a Linux kernel version check for AVX - gcc's __builtin_cpu_supports function does not check whether the kernel supports AVX or not.
* Updated PairHMM profiling code to update and print numbers only in single-thread mode
* Edited README.md, pom.xml and Makefile for users to pass path to gcc 4.8.x if necessary
* Moved all cpuid inline assembly to single function Changed info message to clog from cinfo
* Modified version in pom.xml in VectorPairHMM from 3.1 to 3.2
* Deleted some unnecessary code
* Modified C++ sandbox to print per interval timing
2014-05-02 19:12:48 -04:00
Ryan Poplin 38b7cfbda9 Merge pull request #621 from broadinstitute/vrr_combine_gvcf_bugfix
Fix for CombineGVCFs and GenotypeGVCFs recurrent exception about missing...
2014-05-02 11:52:32 -04:00
Ryan Poplin 4957359963 Merge pull request #620 from broadinstitute/ldg_fixCGL
Fixed GP annotation reading bug and updated integration test
2014-05-02 11:38:06 -04:00
Valentin Ruano-Rubio d563072282 Fix for CombineGVCFs and GenotypeGVCFs recurrent exception about missing PLs
Story:

  https://www.pivotaltracker.com/story/show/68220438

Changes:

   - PL-less input genotypes are now uncalled and so non-variant sites when combining GVCFs.
   - HC GVCF/BP_RESOLUTION Mode now outputs non-variant sites in sites covered by deletions.
   - Fixed existing tests

Test:

   - HaplotypeCallerGVCFIntegrationTest
   - ReferenceConfidenceModelUnitTest
   - CombineGVCFsIntegrationTest
2014-05-02 09:21:06 -04:00
Laura Gauthier e64479d4f5 Fixed GP annotation reading bug and updated integration test 2014-05-02 09:08:28 -04:00
jmthibault79 cb97afd7d8 Merge pull request #618 from broadinstitute/mm_java_8
Java 8 compatability fix: `Reflections` NPE.
2014-05-01 10:33:31 -04:00