Commit Graph

7318 Commits (0037b61e5d206e3eb94de7076bae22c6f001c34f)

Author SHA1 Message Date
Mark DePristo 0037b61e5d Class of scala file should be close to filename, not MDP 2011-09-07 14:48:54 -04:00
Mark DePristo 3bcbfa6e06 Merge branch 'master' of ssh://gsa1/humgen/gsa-scr1/gsa-engineering/git/unstable 2011-09-07 13:13:17 -04:00
Mark DePristo 430da23446 At least 2 minutes must pass before a status message is printed, further stabilizing time estimates 2011-09-07 13:13:07 -04:00
Mauricio Carneiro 6857d0324e Merge branch 'master' into rr 2011-09-07 12:59:08 -04:00
Mark DePristo 7e9e20fed0 Forgot to delete previous call 2011-09-07 12:54:52 -04:00
Mark DePristo d23d620494 Pushing traversal engine timer start to as close to actual start as possible
-- Should make initial timings more accurate
2011-09-07 12:52:33 -04:00
Mark DePristo 6ff432e1f2 BugFix for TF argument to VariantEval, actually making it work properly 2011-09-07 12:50:17 -04:00
Mauricio Carneiro 131cb7effd Bringing Reduce Reads bug fixes to the main repository 2011-09-07 12:25:53 -04:00
Mark DePristo a1920397e8 Major bugfix for per sample VariantEval
-- per sample stratification was not being calculated correctly.  The alt allele was always remaining, even if the genotype of the sample was hom-ref.  Although conceptually fine, this breaks the assumptions of all of the eval modules, so per sample stratifications actually included all variants for everything.  Eric is going to fix the system in general, so this commit may break the build.
2011-09-07 12:18:11 -04:00
Mark DePristo d5641cfac5 Merge branch 'variantEvalST' 2011-09-07 10:44:23 -04:00
Mark DePristo 2f4cf82e3b VariantEval cleanup. Added VariantType Stratification
-- ArrayList are List where possible
-- states refactored into VariantStratifier base class (reduces many lines of duplicate code)
-- Added VariantType stratification that partitions report by VariantContext.Type
2011-09-07 10:43:53 -04:00
Christopher Hartl 436f6eb52b Reverting Eric's change and pushing in some command-line-option documentation. 2011-09-07 08:53:30 -04:00
Eric Banks 1ef8a1750a I asked nicely and got nothing. Then I threatened and still got nothing. So I am carrying through on my threats. Guillermo, you have a short reprieve because you were away on vacation, but let's get yours done tomorrow afternoon. 2011-09-06 21:07:49 -04:00
Eric Banks da9c8ab386 Revving the Tribble jar where the DbsnpCodec class was renamed to OldDbsnpCodec. Updating GATK code accordingly. 2011-09-06 20:39:42 -04:00
Mark DePristo 9559115ad5 Bugfix for singleton runs. Now with histograms where possible 2011-09-06 16:54:01 -04:00
Mark DePristo 388d377677 Merge branch 'rrOpt' 2011-09-06 15:11:49 -04:00
Mark DePristo 3db7ecb920 ReducedRead flag cached in GATKSAMRecord. 20% performance improvement 2011-09-06 15:11:38 -04:00
Mark DePristo 2d5509e8a6 Now includes the RQ flag in the consensus reads 2011-09-06 15:11:38 -04:00
Mark DePristo 284f83469b ReducedRead flag cached in GATKSAMRecord. 20% performance improvement 2011-09-06 15:09:37 -04:00
Mark DePristo f3ab7d7c0d Now includes the RQ flag in the consensus reads 2011-09-06 14:42:27 -04:00
Roger Zurawicki 47607a7eff Fixed bug where deletions messed up interval clipping
- Instead of using readLength, the ReadUtil function are used to get a proper read coordinate
 - Added debug info in interval clipping ( with -dl)

  NOTE: method might not be safe for production and checks need to be added to the ClippingOp code
2011-09-06 14:25:57 -04:00
Mark DePristo b0b803aa00 Fix for default value of maximum_consensus_base_qual, which can be at most 94 2011-09-06 14:12:54 -04:00
Khalid Shakir 0adb388dee Fixed bug in SelectVariants that was annotating sample_file / exclude_sample_file as @Argument instead of @Input meaning they weren't tracked in Queue.
Updates for HybridSelectionPipeline:
- Use VQSR on SNPs for projects using bait set whole_exome_agilent_1 and applying cut at 98.5.
- If a whole_exome_agilent_1 project has less than 50 samples also mixing in 1000G samples to reach VQSR thresholds.
- Updated SNP hard filters based on analysis done with ebanks to approximate VQSR results on small target batches.
- Removed GSA_PRODUCTION_ONLY flag from indel caller.
- Updated indel hard filters based on delangel's analysis.
- Updated HybridSelectionPipelineTest to use HARD SNP filters only, for now.
2011-09-06 12:41:46 -04:00
Mauricio Carneiro 28d782b4c7 Allowing multiple dnsnp and indel files in the DPP 2011-09-02 13:38:47 -04:00
Ryan Poplin 8da36a965e Moving the ReadClipping further upstream into the HaplotypeCaller 2011-09-02 13:27:47 -04:00
Mauricio Carneiro 08ae6c0c61 ReadClipper is now handling unmapped reads 2011-09-02 11:32:30 -04:00
Eric Banks d241f0e903 Adding docs for the pcr error rate argument. 2011-09-01 21:57:02 -04:00
Mauricio Carneiro ad4ea0b80b Merged bug fix from Stable into Unstable 2011-09-01 18:14:45 -04:00
Mauricio Carneiro e253f6f05d Fixing typo in DPP
platform and library were exchanged when rebuilding the read group information
2011-09-01 18:13:52 -04:00
Mauricio Carneiro d2a33beff7 Added WGS/WEX b37-decoy CEU trio datasets 2011-09-01 13:14:40 -04:00
Eric Banks 827fe6130c Adding hidden printing option. Also, always run UG in mode GENOTYPE_GIVEN_ALLELES given that we don't actually test for the correct alleles (otherwise UG may choose a different allele and we may falsely validate the wrong one). 2011-09-01 11:40:35 -04:00
Mark DePristo 0b794b5491 Reving Tribble to 23 2011-09-01 10:43:03 -04:00
Mark DePristo 1aa4b12ff0 Reduced the number of combinations being tested here, which was overkill 2011-09-01 10:42:43 -04:00
Mark DePristo ac49b8d26b Conditional support for PerformanceTrackingQuerySource to measure Tribble / GATK bridge performance
-- Removed DEBUG option, instead use MEASURE_TRIBBLE_QUERY_PERFORMANCE in RMDTrackerBuilder
2011-09-01 10:41:55 -04:00
Mark DePristo 2b97017871 Fix @Output that should be @Argument 2011-09-01 10:37:20 -04:00
Ryan Poplin a522488121 First pass at using the indel realigner code to realign reads to the newly assembled haplotypes. Misc bug fix in genotyping engine when both haplotype's alt alleles were deletions. 2011-09-01 09:55:30 -04:00
Ryan Poplin 8529075e60 misc cleanup 2011-08-31 18:53:41 -04:00
Ryan Poplin 70a1ba6fa1 Using ReadClipper to clip the reads instead of manually looking for strings of consecutive high quality bases 2011-08-31 18:52:08 -04:00
Ryan Poplin f57547ad63 Enabling soft clipped bases in the assembly engine of the haplotype caller 2011-08-31 18:21:56 -04:00
Matt Hanna dd89755e74 Merged bug fix from Stable into Unstable 2011-08-31 17:28:44 -04:00
Matt Hanna 65a9159ac6 Point ivy to the maven repo instead of the default ibiblio repo. Drastically
simplify ivy config by completely cutting out module specifications.
2011-08-31 17:27:25 -04:00
Mauricio Carneiro 4b5a7046c5 Making ReadLengthDistribution Public
Found this neat little walker Kiran wrote stashed in the private tree. Very useful. Generalized it a bit, added GATKDocs and moved it to public. I might include it as a QC step on the pacbio processing pipeline.
* generalize it so it works with non pair ended reads.
* generalize it to work with no read group information
2011-08-31 15:52:28 -04:00
Mark DePristo d604019362 Finished my broken tribble code. Updated to rev 22 2011-08-30 16:56:48 -04:00
David Roazen 2585728f03 Fixing the build 2011-08-30 12:45:53 -04:00
Mark DePristo bdf04b8057 Merge branch 'master' of ssh://gsa1/humgen/gsa-scr1/gsa-engineering/git/unstable 2011-08-30 11:08:23 -04:00
Mark DePristo 173ca1e215 Reverting tribble temporarily while I fix my subtle problems 2011-08-30 11:08:13 -04:00
Ryan Poplin e040ea5c0d Merge branch 'master' of ssh://nickel.broadinstitute.org/humgen/gsa-scr1/gsa-engineering/git/unstable 2011-08-30 08:34:01 -04:00
Ryan Poplin 60582215be Upping the kmer length in order to see larger events 2011-08-30 08:33:49 -04:00
Mauricio Carneiro 7d79de91c5 Merge branch 'master' into rr 2011-08-30 02:50:19 -04:00
Mauricio Carneiro 0cd9438ac2 fixed soft unclipped calculation
* getRefCoordSoftUnclippedEnd was not resetting the shift when hitting insertions. Fixed.
* getReadCoordinateForReferenceCoordinateBeforeAlignmentEnd was returning the wrong read coordinate position. Fixed.
2011-08-30 02:45:29 -04:00