gatk-3.8

Commit Graph

Author	SHA1	Message	Date
Karthik Gururaj	6e98e9e589	Removed g_haplotype* global variables in native code so that it works with multi-threading in Java. Modified VectorLoglessPairHMM.java so that jniInitializeRegion and jniFinalizeRegion are empty	2014-03-06 22:08:35 -08:00
Karthik Gururaj	3999677c93	Changed to delete[] where applicable	2014-03-06 12:23:08 -08:00
Karthik Gururaj	a29777765d	Binary library	2014-03-06 11:14:46 -08:00
Karthik Gururaj	7844d956ac	Modified delete to delete[]	2014-03-06 11:13:34 -08:00
Karthik Gururaj	27e640d640	Modified SSE4.1 and 4.2 checks with _may_i_use_cpu_feature()	2014-03-06 08:51:11 -08:00
Karthik Gururaj	37f107cb3a	Using Mustafa's function _may_i_use_cpu_feature() for AVX check	2014-03-06 08:37:48 -08:00
Karthik Gururaj	ec54528605	Fixed error in Sandbox.java	2014-03-05 09:36:55 -08:00
Karthik Gururaj	8fcbf9272c	Merge branch 'intel_pairhmm' of /data/broad/gsa-unstable into intel_pairhmm Conflicts: protected/gatk-protected/src/main/java/org/broadinstitute/sting/gatk/walkers/haplotypecaller/PairHMMLikelihoodCalculationEngine.java public/VectorPairHMM/src/main/c++/Sandbox.java	2014-03-05 09:35:50 -08:00
Intel Repocontact	d81116eb1d	Added vectorized PairHMM implementation by Mohammad and Mustafa into the Maven build of GATK. C++ code has PAPI calls for reading hardware counters Followed Khalid's suggestion for packing libVectorLoglessCaching into the jar file with Maven Native library part of git repo 1. Renamed directory structure from public/c++/VectorPairHMM to public/VectorPairHMM/src/main/c++ as per Khalid's suggestion 2. Use java.home in public/VectorPairHMM/pom.xml to pass environment variable JRE_HOME to the make process. This is needed because the Makefile needs to compile JNI code with the flag -I<JRE_HOME>/../include (among others). Assuming that the Maven build process uses a JDK (and not just a JRE), the variable java.home points to the JRE inside maven. 3. Dropped all pretense at cross-platform compatibility. Removed Mac profile from pom.xml for VectorPairHMM Moved JNI_README 1. Added the catch UnsatisfiedLinkError exception in PairHMMLikelihoodCalculationEngine.java to fall back to LOGLESS_CACHING in case the native library could not be loaded. Made VECTOR_LOGLESS_CACHING as the default implementation. 2. Updated the README with Mauricio's comments 3. baseline.cc is used within the library - if the machine supports neither AVX nor SSE4.1, the native library falls back to un-vectorized C++ in baseline.cc. 4. pairhmm-1-base.cc: This is not part of the library, but is being heavily used for debugging/profiling. Can I request that we keep it there for now? In the next release, we can delete it from the repository. 5. I agree with Mauricio about the ifdefs. I am sure you already know, but just to reassure you the debug code is not compiled into the library (because of the ifdefs) and will not affect performance. 1. Changed logger.info to logger.warn in PairHMMLikelihoodCalculationEngine.java 2. Committing the right set of files after rebase Added public license text to all C++ files Added license to Makefile Add package info to Sandbox.java Conflicts: protected/gatk-protected/src/main/java/org/broadinstitute/sting/gatk/walkers/haplotypecaller/HaplotypeCaller.java protected/gatk-protected/src/main/java/org/broadinstitute/sting/gatk/walkers/haplotypecaller/PairHMMLikelihoodCalculationEngine.java protected/gatk-protected/src/main/java/org/broadinstitute/sting/utils/pairhmm/DebugJNILoglessPairHMM.java protected/gatk-protected/src/main/java/org/broadinstitute/sting/utils/pairhmm/JNILoglessPairHMM.java protected/gatk-protected/src/main/java/org/broadinstitute/sting/utils/pairhmm/VectorLoglessPairHMM.java public/VectorPairHMM/src/main/c++/.gitignore public/VectorPairHMM/src/main/c++/LoadTimeInitializer.cc public/VectorPairHMM/src/main/c++/LoadTimeInitializer.h public/VectorPairHMM/src/main/c++/Makefile public/VectorPairHMM/src/main/c++/Sandbox.cc public/VectorPairHMM/src/main/c++/Sandbox.h public/VectorPairHMM/src/main/c++/Sandbox.java public/VectorPairHMM/src/main/c++/Sandbox_JNIHaplotypeDataHolderClass.h public/VectorPairHMM/src/main/c++/Sandbox_JNIReadDataHolderClass.h public/VectorPairHMM/src/main/c++/baseline.cc public/VectorPairHMM/src/main/c++/define-double.h public/VectorPairHMM/src/main/c++/define-float.h public/VectorPairHMM/src/main/c++/define-sse-double.h public/VectorPairHMM/src/main/c++/define-sse-float.h public/VectorPairHMM/src/main/c++/headers.h public/VectorPairHMM/src/main/c++/jnidebug.h public/VectorPairHMM/src/main/c++/org_broadinstitute_sting_utils_pairhmm_DebugJNILoglessPairHMM.cc public/VectorPairHMM/src/main/c++/org_broadinstitute_sting_utils_pairhmm_DebugJNILoglessPairHMM.h public/VectorPairHMM/src/main/c++/org_broadinstitute_sting_utils_pairhmm_VectorLoglessPairHMM.cc public/VectorPairHMM/src/main/c++/org_broadinstitute_sting_utils_pairhmm_VectorLoglessPairHMM.h public/VectorPairHMM/src/main/c++/pairhmm-template-kernel.cc public/VectorPairHMM/src/main/c++/pairhmm-template-main.cc public/VectorPairHMM/src/main/c++/run.sh public/VectorPairHMM/src/main/c++/shift_template.c public/VectorPairHMM/src/main/c++/utils.cc public/VectorPairHMM/src/main/c++/utils.h public/VectorPairHMM/src/main/c++/vector_function_prototypes.h	2014-03-05 09:30:29 -08:00
Joel Thibault	57747ad35e	Logger output should go to STDERR instead of STDOUT	2014-03-05 10:01:06 -05:00
Joel Thibault	b4dde6a78c	Add WARN to the valid log types error message - order if statements and error message in increasing severity	2014-03-05 10:01:06 -05:00
Valentin Ruano Rubio	243d1bc07a	Merge pull request #542 from broadinstitute/vrr_efficient_find_best_haplotypes Added a more efficient implementation of the KBest haplotype finder code...	2014-03-05 09:44:50 -05:00
David Roazen	58905e8fe0	Disable the intermittently-failing and flawed ProgressMeterDaemonUnitTest -created a Pivotal ticket to eventually redesign this test	2014-03-05 09:15:26 -05:00
Valentin Ruano-Rubio	69bf2b3247	Added a more efficient implementation of the KBest haplotype finder code (CONT.) Changes: 1. Addressed review comments on new K-best haplotype assembly graph finder. 2. Generalize KBestHaplotypeFinder to deal with multiple source and sink vertices. 3. Updated test to use KBestHaplotypeFinder instead of KBestPaths 4. Retired KBestPaths to the archive. 5. Small improvements to the code and documentation.	2014-03-04 23:22:27 -05:00
Valentin Ruano-Rubio	7acf2eb0e7	Added a more efficient implementation of the KBest haplotype finder code. Story: https://www.pivotaltracker.com/story/show/66238286 Changes: 1. Created a new k-best haplotype search implementation in class KBestHaplotypeFinder. 2. Changed HC code to use the new implementation. This seems to fix the original problem without causing significant changes in outputs using some empirical data test cases 3. Moved haplotype's cigar calculation code from Path to CigarUtils; need that in order to gain independence from Path in some parts of the code. In any case that seems like a more natural location for that functionality.	2014-03-04 12:22:14 -05:00
Karthik Gururaj	a893765ae2	Added license to Makefile	2014-03-03 09:11:02 -08:00
Karthik Gururaj	7cd23543a1	Added public license text to all C++ files	2014-03-03 09:04:00 -08:00
Eric Banks	22ad18b919	Moving Reduce Reads to the archive. The GATK now fails with a user error if you try to run with a reduced bam. (I added a unit test for that; everything else here is just the removal of all traces of RR)	2014-03-02 02:03:14 -05:00
Khalid Shakir	387188e5bb	Attempting to limit gc during Maven tests, using defaults found in JavaCommandLineFunction	2014-03-01 15:24:45 +08:00
Karthik Gururaj	1b395a871a	1. Changed logger.info to logger.warn in PairHMMLikelihoodCalculationEngine.java 2. Committing the right set of files after rebase	2014-02-28 16:08:28 -08:00
Karthik Gururaj	37526dfad5	1. Added the catch UnsatisfiedLinkError exception in PairHMMLikelihoodCalculationEngine.java to fall back to LOGLESS_CACHING in case the native library could not be loaded. Made VECTOR_LOGLESS_CACHING as the default implementation. 2. Updated the README with Mauricio's comments 3. baseline.cc is used within the library - if the machine supports neither AVX nor SSE4.1, the native library falls back to un-vectorized C++ in baseline.cc. 4. pairhmm-1-base.cc: This is not part of the library, but is being heavily used for debugging/profiling. Can I request that we keep it there for now? In the next release, we can delete it from the repository. 5. I agree with Mauricio about the ifdefs. I am sure you already know, but just to reassure you the debug code is not compiled into the library (because of the ifdefs) and will not affect performance.	2014-02-28 08:59:55 -08:00
Chris Whelan	e61ba8b340	Added command line checks for duplicate files in ROD lists -- Keep a list of processed files in ArgumentTypeDescriptor.getRodBindingsCollection -- Throw user exception if a file name duplicates one that was previously parsed -- Throw user exception if the ROD list is empty -- Added two unit tests to RodBindingCollectionUnitTest	2014-02-27 13:32:18 -05:00
Karthik Gururaj	2d0ce45bb0	Moved JNI_README	2014-02-27 10:12:23 -08:00
Karthik Gururaj	c645725fc3	1. Renamed directory structure from public/c++/VectorPairHMM to public/VectorPairHMM/src/main/c++ as per Khalid's suggestion 2. Use java.home in public/VectorPairHMM/pom.xml to pass environment variable JRE_HOME to the make process. This is needed because the Makefile needs to compile JNI code with the flag -I<JRE_HOME>/../include (among others). Assuming that the Maven build process uses a JDK (and not just a JRE), the variable java.home points to the JRE inside maven. 3. Dropped all pretense at cross-platform compatibility. Removed Mac profile from pom.xml for VectorPairHMM	2014-02-26 15:17:15 -08:00
Karthik Gururaj	bd71ba35e5	Moved pom.xml to VectorPairHMM and updated artifactId	2014-02-26 14:01:46 -08:00
Khalid Shakir	da587d48ed	Using absolute paths in generated diff commands, to ease running them from any directory.	2014-02-27 04:43:39 +08:00
Khalid Shakir	c163e6d0d2	Separate failsafe directories for each of the integration test types [#66515572 ]	2014-02-27 04:43:39 +08:00
Karthik Gururaj	b81e2c2948	Native library part of git repo	2014-02-26 11:47:42 -08:00
Karthik Gururaj	0fe843bfd9	Followed Khalid's suggestion for packing libVectorLoglessCaching into the jar file with Maven	2014-02-26 11:47:42 -08:00
Karthik Gururaj	15fe244e4b	Now has PAPI values	2014-02-26 11:47:42 -08:00
Intel Repocontact	e32e9e6af6	Merge branch 'master' of github.com:broadinstitute/gsa-unstable	2014-02-26 11:47:01 -08:00
Intel Repocontact	ff2a972ab5	Merge branch 'master' of github.com:broadinstitute/gsa-unstable Conflicts: .gitignore	2014-02-25 20:56:28 -08:00
Khalid Shakir	f02ce6eca7	Added tests for cleaning up scattered .bai files, and using the log directory. Re-added import java.io.File for BamGatherFunction. Other cleanup to resolve scala syntax warnings from intellij. Moved Example UG script to from protected to public.	2014-02-26 02:11:28 +08:00
pdexheimer	0405afeab2	Inherit BamGatherFunction from MergeSamFiles rather than PicardBamFunction - This change means that BamGatherFunction will now have an @Output field for the BAM index, which will allow the bai to be deleted for intermediate functions Signed-off-by: Khalid Shakir <kshakir@broadinstitute.org>	2014-02-26 02:11:28 +08:00
pdexheimer	504c125c26	Ensure .out files are saved into logDirectory Signed-off-by: Khalid Shakir <kshakir@broadinstitute.org>	2014-02-26 02:11:28 +08:00
pdexheimer	51dcd364a5	Added logDirectory argument Signed-off-by: Khalid Shakir <kshakir@broadinstitute.org>	2014-02-26 02:11:28 +08:00
Khalid Shakir	7e516b294f	Replaced local drmaa and Jama artifacts with versions from maven central. Removed unused caliper binary from local repo.	2014-02-22 01:21:35 +08:00
Khalid Shakir	a75043b207	When git describe fails use "exported" instead of "unknown".	2014-02-22 01:21:35 +08:00
Khalid Shakir	4670c87313	Fixed mvn run for packagetests over external-example.	2014-02-22 01:21:34 +08:00
Khalid Shakir	70ecce2a0f	Fixed scope for test-jar depedencies.	2014-02-22 01:21:34 +08:00
Eric Banks	235f0c6fa0	Merge pull request #528 from broadinstitute/eb_fix_cat_variants_usage_message Fix the usage message for CatVariants to make it accurate.	2014-02-19 22:45:22 -05:00
Eric Banks	341d1bf2dd	Fix the usage message for CatVariants to make it accurate. It just hit a user on our forum...	2014-02-19 20:42:08 -05:00
Valentin Ruano-Rubio	c167fb5fdf	Fixing GenotypesGVCF. Bug uncovered by some untrimmed alleles in the single sample pipeline output. Notice however does not fix the untrimmed alleles in general. Story: https://www.pivotaltracker.com/story/show/65481104 Changes: 1. Fixed the bug itself. 2. Fixed non-working tests (sliently skipped due to exception in dataProvider).	2014-02-19 14:20:39 -05:00
Ryan Poplin	43c20264b0	Initial commit of the random forest classifier.	2014-02-17 13:07:27 -05:00
Khalid Shakir	a505db79f5	Fixed build bug in ./ant-bridge.sh unittest -Dsingle=..., due to external-example. pipeline.run property no longer required to be passed by test executor.	2014-02-15 13:52:20 +08:00
droazen	1e82f117ad	Merge pull request #518 from broadinstitute/ks_skashin_gatkdocs_arguments Ks skashin gatkdocs arguments	2014-02-14 13:57:19 -05:00
Eric Banks	f6022a944b	Merge pull request #513 from broadinstitute/eb_clean_up_genotype_posteriors Various small fixes for CalculateGenotypePosteriors based on feedback fr...	2014-02-14 13:50:46 -05:00
Eric Banks	3724d4e5f3	Various small fixes for CalculateGenotypePosteriors based on feedback from guys in Ben Neale's group. Note that this tool is still a work in progress and very experimental, so isn't 100% stable. Most of the features are untested (both by people and by unit/integration tests) because Chris Hartl implemented it right before he left, and we're going to need to add tests at some point soon. I added a first integration test in this commit, but it's just a start. The fixes include: 1. Stop having the genotyping code strip out AD values. It doesn't make sense that it should do this so I don't know why it was doing that at all. Updated GenotypeGVCFs so that it doesn't need to manually recover them anymore. This also helps CalculateGenotypePosteriors which was losing the AD values. Updated code in LeftAlignAndTrimVariants to strip out PLs and AD, since it wasn't doing that before. Updated the integration test for that walker to include such data. 2. Chris was calling Math.pow directly on the normalized posteriors which isn't safe. Instead, the normalization routine itself can revert back to log scale in a safe manner so let's use it. Also, renamed the variable to posteriorProbabilities (and not likelihoods). 3. Have CGP update the AC/AF/AN counts after fixing GTs.	2014-02-14 13:48:14 -05:00
kshakir	8b136d53b9	Merge pull request #524 from broadinstitute/ks_symlink_bin_jar Create symlinks target/GenomeAnalysisTK.jar and target/Queue.jar	2014-02-15 02:32:59 +08:00
Khalid Shakir	bc9ac93b6c	Adding the external example to the build.	2014-02-15 01:26:07 +08:00

1 2 3 4 5 ...

4249 Commits (e7d6db033bc099cd6abf800f759afe6fa8a64481)