gatk-3.8

Commit Graph

Author	SHA1	Message	Date
droazen	7cd304fb41	Merge pull request #470 from broadinstitute/mf_new_RBP Mf new rbp	2014-01-13 08:46:27 -08:00
Ryan Poplin	3b8209f3b2	Merge pull request #467 from broadinstitute/rp_fix_names_NA12878ROCCurve The ROC Curve report lists the name as the name of the vcf file now inst...	2014-01-09 06:56:34 -08:00
MauricioCarneiro	50cd6781b3	Merge pull request #465 from broadinstitute/eb_improvements_to_ref_confidence_merger Improvements to ref confidence merger	2014-01-08 10:51:01 -08:00
Ryan Poplin	8881926bc6	The ROC Curve report lists the name as the name of the vcf file now instead of project+name.	2014-01-08 09:44:21 -05:00
Ryan Poplin	c86e36c909	Merge pull request #466 from broadinstitute/rp_phase3_vqsr_scala Adding here the Qscript used to perform the VQSR for 1000 Genomes Projec...	2014-01-08 06:39:46 -08:00
Ryan Poplin	7d5a710ea6	Adding here the Qscript used to perform the VQSR for 1000 Genomes Project phase 3	2014-01-08 09:38:13 -05:00
Eric Banks	553b3e56bd	Merge pull request #463 from broadinstitute/eb_fix_realigner_bugs_from_pearson Fixed edge condition in the realigner where a realigned read can sometim...	2014-01-08 05:36:11 -08:00
Eric Banks	0323caefc8	Added some bug fixes to the gVCF merging code after finally getting some real data to play with. Still under construction, awaiting more test data from Valentin.	2014-01-08 08:34:35 -05:00
Eric Banks	f172c349f6	Adding the functionality to enable users to input a file of VCFs for -V. To do this I have added a RodBindingCollection which can represent either a VCF or a file of VCFs. Note that e.g. SelectVariants allows a list of RodBindingCollections so that one can intermix VCFs and VCF lists. For VariantContext tags with a list, by default the tags for the -V argument are applied unless overridden by the individual line. In other words, any given line can have either one token (the file path) or two tokens (the new tags and the file path). For example: foo.vcf VCF,name=bar bar.vcf Note that a VCF list file name must end with '.list'. Added this functionality to CombineVariants, CombineReferenceCalculationVariants, and VariantRecalibrator.	2014-01-08 00:45:00 -05:00
Eric Banks	c133909d32	Fixed edge condition in the realigner where a realigned read can sometimes get partially aligned off the end of the contig. Now we ignore such reads (which is much easier than trying to figure out when to soft-clip). Added unit test.	2014-01-08 00:37:28 -05:00
Menachem Fromer	e33d3dafc6	Add documentation for RBP, and also update the MD5 for the tests now that the output uses HP tags instead of '\|', which is now reserved for trio-based phasing	2014-01-03 12:04:47 -05:00
Menachem Fromer	d1275651ae	Merge remote-tracking branch 'origin/master' into mf_new_RBP	2014-01-03 01:13:40 -05:00
Eric Banks	f6a44afa3a	Merge pull request #464 from broadinstitute/eb_rev_variant_jar_for_bcf_fixes Rev'ing the Variant jar to incorporate some patches to the BCF encoder t...	2014-01-02 21:05:13 -08:00
Eric Banks	856c17868b	Rev'ing the Variant jar to incorporate some patches to the BCF encoder that Menachem needs.	2014-01-02 23:33:17 -05:00
Ryan Poplin	5c32ad174a	Merge pull request #452 from broadinstitute/rp_vqsr_aggregate_model Allow for additional input data to be used in the VQSR for clustering bu...	2014-01-02 12:54:45 -08:00
Ryan Poplin	856c1f87c1	Allow for additional input data to be used in the VQSR for clustering but don't carry it forward into the output VCF file. -- New -a argument in the VQSR for specifying additional data to be used in the clustering -- New NA12878KB walker which creates ROC curves by partitioning the data along VQSLOD and calculating how many KB TP/FP's are called.	2014-01-02 14:46:04 -05:00
Ryan Poplin	c82501ac35	Merge pull request #462 from broadinstitute/rp_SingleSampleHC_exome_scala Adding SingleSampleHC_exome.scala for Valentin to use as a jumping off p...	2014-01-02 08:57:27 -08:00
Ryan Poplin	15372c4873	Adding SingleSampleHC_exome.scala for Valentin to use as a jumping off point.	2014-01-02 11:56:17 -05:00
amilev	f81a38f596	Merge pull request #446 from broadinstitute/ami-RNAseq-tools Write a new tool for spliting reads that have N cigar string.	2014-01-01 21:06:25 -08:00
MauricioCarneiro	1223345726	Merge pull request #459 from broadinstitute/eb_fix_bad_hmm_clipping Fixed up edge condition for clipping long reads in the HMM.	2014-01-01 20:00:34 -08:00
Ami Levy-Moonshine	6da53aea09	Write a new tool for spliting reads that have N cigar string. For example, this tool can be used for processing bowtie RNA-seq data. Each read with k N-cigar elemments is plit to k+1 reads. The split is done by hard clipping the bases rest of the bases. In order to do it, few changes were introduced to some other clipping methods: - make a segnificant change in ClippingOp.hardClip() that prevent the spliting of read with cigar: 1M2I1N1M3I. - change getReadCoordinateForReferenceCoordinate in ReadUtil to recognize Ns create unitTests for that walker: - change ReadClipperTestUtils to be more general in order to use its code and avoid code duplication - move some useful methods from ReadClipperTestUtils to CigarUtils create integration test for that class small change in a comment in FullProcessingPipeline last commit: Address review comments: - move to protected under walkers/rnaseq - change the read splitting methods to be more readable and more efficiant - change (minor changes) some methods in ReadClipper to allow the changes in split reads - add (minor change) one method to CigarUtils to allow the changes in split reads - change ReadUtils.getReadCoordinateForReferenceCoordinate to include possible N in the cigar - address the rest of the review comments (minor changes) - fix ReadUtilsUnitTest.testReadWithNs acoording to the defult behaviour of getReadCoordinateForReferenceCoordinate (in case of refernce index that fall into deletion, return the read index of the base before the deletion). - add another test to ReadUtilsUnitTest.testReadWithNs - Allow the user to print the split positions (not working proparly currently)	2014-01-01 22:21:36 -05:00
Eric Banks	bb4c4b1fcd	Fixed up edge condition for clipping long reads in the HMM. MD5s change because some reads were incorrectly getting clipped before. [delivers #62584746]	2014-01-01 19:05:09 -05:00
Eric Banks	ece346689c	Merge pull request #460 from broadinstitute/mc_document_readclippingstats Better documentation for ReadClippingStats walker	2014-01-01 16:00:40 -08:00
Eric Banks	154bab0849	Merge pull request #461 from broadinstitute/mc_make_comparebams_private Move CompareBAMs to private	2014-01-01 15:58:05 -08:00
Mauricio Carneiro	d52bd44867	Move CompareBAMs to private This is a tool that we use internally validate the ReduceReads development. I think it should be private. There is no need to improve docs. [delivers #54703398]	2014-01-01 14:33:23 -05:00
Mauricio Carneiro	d1febb89c8	Better documentation for ReadClippingStats walker * add overall walker GATKDocs * add explanation for skip parameter and make it advanced * reverse the logic on exculding unmapped reads for clarity * fix read length calculation to no longer include indels ps: I am not sure how useful this walker is (I didn't write it) but the skip logic is poor and calculates the entire statistic for the reads it is eventually going to skip. This would be an easy fix, but only worth our time if people actually use this.	2014-01-01 14:26:26 -05:00
Eric Banks	9355598129	Merge pull request #458 from broadinstitute/eb_dont_fail_when_using_incompatible_annotation Don't fail in annotations if the wrong tools are calling them, just silently skip them.	2013-12-31 21:22:26 -08:00
Eric Banks	050ca8ae09	Merge pull request #457 from broadinstitute/eb_rev_variant_for_doc_updates Updating variant jar.	2013-12-31 20:49:20 -08:00
Eric Banks	9665f75ad4	Don't fail in annotations if the wrong tools are calling them, just silently skip them. This is important for cases when users want to use annotation groups (like all experimental annotations).	2013-12-31 23:45:21 -05:00
Eric Banks	f82a7c3f4c	Updating variant jar. The update contains: 1. documentation changes for VariantContext and Allele (which used to discuss the now obsolete null allele) 2. better error messages for VCFs containing complex rearrangements with breakends 3. instead of failing badly on format field lists with '.'s, just ignore them Also, there is a trivial change to use a more efficient method to remove a bunch of attributes from a VC. Delivers PT#s 59675378, 59496612, and 60524016.	2013-12-31 22:48:29 -05:00
Eric Banks	5a1564d1f2	Merge pull request #456 from broadinstitute/eb_unify_hc_combination_steps Created a new walker to do the full combination of N gVCFs from the HC single-sample ref calc pipeline.	2013-12-31 18:57:27 -08:00
Eric Banks	83e09b1f64	Created a new walker to do the full combination of N gVCFs from the HC single-sample ref calc pipeline. Basically, it does 3 things (as opposed to having to call into 3 separate walkers): 1. merge the records at any given position into a single one with all alleles and appropriate PLs 2. re-genotype the record using the exact AF calculation model 3. re-annotate the record using the VariantAnnotatorEngine In the course of this work it became clear that we couldn't just use the simpleMerge() method used by CombineVariants; combining HC-based gVCFs is really a complicated process. So I added a new utility method to handle this merging and pulled any related code out of CombineVariants. I tried to clean up a lot of that code, but ultimately that's out of the scope of this project. Added unit tests for correctness testing. Integration tests cannot be used yet because the HC doesn't output correct gVCFs.	2013-12-31 12:07:56 -05:00
Eric Banks	9394af1230	Merge pull request #454 from jsilter/master Make na12878kb functionality more transparent to users	2013-12-19 08:47:24 -08:00
Menachem Fromer	48ef7a1a2f	Merge remote-tracking branch 'origin/master' into mf_new_RBP	2013-12-19 10:42:20 -05:00
Eric Banks	26a7082018	Merge pull request #455 from broadinstitute/dr_add_min_max_argument_values Add ability to specify min/max required/recommended values for numeric arguments in the @Argument annotation	2013-12-18 20:40:06 -08:00
David Roazen	4a79831adc	Add ability to specify min/max required/recommended values for numeric arguments in the @Argument annotation -You can now add "minValue", "maxValue", "minRecommendedValue", and "maxRecommendedValue" attributes to @Argument annotations for command-line arguments -"minValue" and "maxValue" specify hard limits that generate an exception if violated -"minRecommendedValue" and "maxRecommendedValue" specify soft limits that generate a warning if violated -Works only for numeric arguments (int, double, etc.) with @Argument annotations -Only considers values actually specified by the user on the command line, not default values assigned in the code As requested by Geraldine	2013-12-18 18:09:08 -05:00
Jacob Silterra	0c7ea2d823	Add label and specVersion fields to MongoDBManager.Locator Add "BLANK" option for DBType Want to get away from adding extensions to dbname	2013-12-18 17:21:53 -05:00
Eric Banks	d32c900018	Merge pull request #453 from broadinstitute/eb_rev_variant_for_validation_bug Updated the variant jar to grab a bug fix that I made to it	2013-12-18 11:13:11 -08:00
Eric Banks	265cb3eb5b	Updated the variant jar to grab a bug fix that I made to it	2013-12-17 11:52:34 -05:00
Valentin Ruano Rubio	5ed627d448	Merge pull request #450 from broadinstitute/vrr_graphLikelihoods_fix250PCRFree Fixed issue > 0 log likelihoods using GraphBased likelihood engine reported by Mauricio	2013-12-13 09:22:46 -08:00
Valentin Ruano-Rubio	5db520c6fa	Fixed issue > 0 log likelihoods using GraphBased likelihood engine reported by Mauricio Added some integration test to check on the fix	2013-12-13 11:19:57 -05:00
Eric Banks	3e8feff429	Merge pull request #451 from broadinstitute/jt_mongo_migration Move the SelectVariantsFromMongo helper classes to archive	2013-12-13 07:40:35 -08:00
Joel Thibault	58217a5c4b	Move the SelectVariantsFromMongo helper classes to archive	2013-12-12 18:50:10 -05:00
Bertrand	d6169a28cd	Merge pull request #448 from broadinstitute/eb_add_stuff_to_the_bundle Eb add stuff to the bundle	2013-12-12 07:31:59 -08:00
Eric Banks	400e7c1404	Fixed bug in the filtering of lifted over variants where a deletion at the end of a contig could cause it to error out. Added a unit test.	2013-12-11 14:07:18 -05:00
Eric Banks	ab33db625f	Merge pull request #449 from broadinstitute/eb_move_calc_posteriors_to_protected Moved CalculatePosteriors from private to protected, in preparation for 3.0	2013-12-07 22:18:46 -08:00
Eric Banks	f1970b923e	Moved CalculatePosteriors from private to protected, in preparation for 3.0. Renamed it CalculateGenotypePosteriors. Also, moved the utility code to a proper utility class instead of where Chris left it. No actual code modifications made in this commit.	2013-12-08 00:08:34 -05:00
Eric Banks	418fbdfbab	Added HC trio calls and NA12878 KB snapshot to resource bundle. Also, don't touch the current link until the resources are finished being produced.	2013-12-07 22:08:34 -05:00
David Roazen	932cd3ada7	Fix 3rd-party library dependency issues in the HC/PairHMM tests In general, test classes cannot use 3rd-party libraries that are not also dependencies of the GATK proper without causing problems when, at release time, we test that the GATK jar has been packaged correctly with all required dependencies. If a test class needs to use a 3rd-party library that is not a GATK dependency, write wrapper methods in the GATK utils/* classes, and invoke those wrapper methods from the test class.	2013-12-06 13:16:55 -05:00
Eric Banks	70e2d21e12	Merge remote-tracking branch 'unstable/master'	2013-12-06 11:45:12 -05:00

1 2 3 4 5 ...

12936 Commits (7cd304fb41d9e7bfc0bf41edc74b87f1ef943373) All Branches Search

12936 Commits (7cd304fb41d9e7bfc0bf41edc74b87f1ef943373)

All Branches