Mark DePristo
9b2be795a7
Initial working version of new ActiveRegionTraversal based on the LocusIteratorByState read stream
...
-- Implemented as a subclass of TraverseActiveRegions
-- Passes all unit tests
-- Will be very slow -- needs logical fixes
2013-01-11 15:17:17 -05:00
Mark DePristo
8b83f4d6c7
Near final cleanup of PileupElement
...
-- All functions documented and unit tested
-- New constructor interface
-- Cleanup some uses of old / removed functionality
2013-01-11 15:17:17 -05:00
Mark DePristo
fb9eb3d4ee
PileupElement and LIBS cleanup
...
-- function to create pileup elements in AlignmentStateMachine and LIBS
-- Cleanup pileup element constructors, directing users to LIBS.createPileupFromRead() that really does the right thing
2013-01-11 15:17:17 -05:00
Mark DePristo
2f2a592c8e
Contracts and documentation for AlignmentStateMachine and LocusIteratorByState
...
-- Add more unit tests for both as well
2013-01-11 15:17:17 -05:00
Mark DePristo
cc1d259cac
Implement get Length and Bases of OfImmediatelyFollowingIndel in PileupElement
...
-- Added unit tests for this behavior. Updated users of this code
2013-01-11 15:17:17 -05:00
Mark DePristo
2c38310868
Create LIBS using new AlignmentStateMachine infrastructure
...
-- Optimizations to AlignmentStateMachine
-- Properly count deletions. Added unit test for counting routines
-- AlignmentStateMachine.java is no longer recursive
-- Traversals now use new LIBS, not the old one
2013-01-11 15:17:17 -05:00
Mark DePristo
80d9b7011c
Complete rewrite of low-level machinery of LIBS, not hooked up
...
-- AlignmentStateMachine does what SAMRecordAlignmentState should really do. It's correct in that it's more accurate than the LIB_position tests themselves. This is a non-broken, correct implementation. Needs cleanup, contracts, etc.
-- This version is like 6x slower than the original implementation (according to the google caliper benchmark here). Obvious optimizations for future commit
2013-01-11 15:17:16 -05:00
Mark DePristo
0ac4352614
LIBS can now (optionally) track the unique reads it uses from the underlying read iterator
...
-- This capability is essential to provide an ordered set of used reads to downstream users of LIBS, such as ART, who want an efficient way to get the reads used in LIBS
-- Vastly expanded the multi-read, multi-sample LIBS unit tests to make sure this capability is working
-- Added createReadStream to ArtificialSAMUtils that makes it relatively easy to create multi-read, multi-sample read streams for testing
2013-01-11 15:17:16 -05:00
Mark DePristo
b3ecfbfce8
Refactor LIBS into component parts, expand unit tests, some code cleanup
...
-- Split out all of the inner classes of LIBS into separate independent classes
-- Split / add unit tests for many of these components.
-- Radically expand unit tests for SAMRecordAlignmentState (the lowest level piece of code) making sure at least some of it works
-- No need to change unit tests or integration tests. No change in functionality.
-- Added (currently disabled) code to track all submitted reads to LIBS, but this isn't accessible or tested
2013-01-11 15:17:16 -05:00
Mark DePristo
2e5d38fd0e
Updating to latest google caliper code
2013-01-11 15:17:16 -05:00
Mark DePristo
b2990497e2
Refactor LIBS into utils.locusiterator before refactoring
2013-01-11 15:17:16 -05:00
Mauricio Carneiro
9ed922d562
Updating licenses to Eric's last commit
...
- for now we're still running the script by hand, soon automated solution will be in place.
GSATDG-5
2013-01-11 14:33:00 -05:00
Mauricio Carneiro
bc64d4240f
Licensing update -- batch #2
...
- caught all scala files that didn't have proper package information / class names
- included all source files in archive as well
GSATDG-5
2013-01-11 13:38:11 -05:00
Mauricio Carneiro
28235f57f2
Adding package information to scala scripts that were missing it. Including archived ones.
...
GSATDG-5
2013-01-11 13:38:05 -05:00
Eric Banks
e7906713d9
Moving some random walkers back to public as requested by Mark. Mauricio will the licenses get updated automatically?
2013-01-11 02:03:43 -05:00
Ami Levy-Moonshine
352cb831d0
Merge branch 'master' of github.com:broadinstitute/gsa-unstable
2013-01-10 21:27:06 -05:00
Ami Levy-Moonshine
fac0bce916
add RunCoveredByNSamplesSites; changes in CoveredByNSamplesSites so it can work in parallel; also, move it to diagnostics
2013-01-10 21:26:49 -05:00
Mauricio Carneiro
ea8c8573d2
Fixing ParseLicense script for scala syntax
...
- Scala allows package objects in its syntax, so the script needs to be aware of that and not add "*/" every time it sees it.
GSATDG-5
2013-01-10 18:24:24 -05:00
Mauricio Carneiro
e5913e50b2
Updating licenses for all scala files
...
GSATDG-5
2013-01-10 17:46:10 -05:00
Mauricio Carneiro
2a4ccfe6fd
Updated all JAVA file licenses accordingly
...
GSATDG-5
2013-01-10 17:06:41 -05:00
Joel Thibault
3e52ce5fa8
Remove DepthOfCoverage.java because it is no longer public
...
- Move Pileup.java and PrintReads.java to their new homes
2013-01-10 11:45:38 -05:00
Ryan Poplin
487fb2afb4
Bug fix for the case of overlapping assembled and partially-assembled events created by the HC. Unfortunately the symbolic allele can't be combined with the indel allele because the reference basis will change.
2013-01-09 15:30:46 -05:00
Eric Banks
4fa439d89e
Move some classes back to public because they are used in the engine. Move some test classes to protected. We should have no more public->protected dependancies now
2013-01-09 11:06:10 -05:00
Eric Banks
676e79542a
Bring CombineVariants back to public since it's used for SG. I needed to break ChromosomeCountConstants out of ChromosomeCounts to make this work.
2013-01-09 10:39:48 -05:00
Ryan Poplin
c87ad8c0ef
Bug fixes related to HC's GGA mode. Tracking just the artificial allele isn't sufficient when there are multiple GGA records that change the reference basis. Also, duplicated records screw up the tracking of merged alleles.
2013-01-09 10:00:46 -05:00
Ami Levy-Moonshine
15ca5015cd
Merge branch 'master' of github.com:broadinstitute/gsa-unstable
2013-01-08 21:53:36 -05:00
Ami Levy-Moonshine
d6071728e8
add new walker to find sites with good coverage
2013-01-08 17:10:38 -05:00
Eric Banks
264cc9e78d
Resolve protected->public dependencies for BQSR by wrapping the BQSR-specific arguments in a new class.
...
Instead of the GATK Engine creating a new BaseRecalibrator (not clean), it just keeps track of the arguments (clean).
There are still some dependency issues, but it looks like they are related to Ami's code. Need to look into it further.
2013-01-08 16:23:29 -05:00
Eric Banks
f0bd1b5ae5
Okay, all public->protected dependencies are gone except for the BQSR arguments. I'll need to think through this but should be able to make that work too.
2013-01-08 15:46:32 -05:00
Eric Banks
245fcc8bb5
Merged bug fix from Stable into Unstable
2013-01-08 12:59:15 -05:00
Eric Banks
d6146d369a
Remove all of the references to ProgramElementDoc
2013-01-08 12:58:31 -05:00
Eric Banks
b099e2b4ae
Moving integration tests to protected
2013-01-08 09:34:08 -05:00
Eric Banks
47d030a52d
Oops, move the covariates over too
2013-01-07 15:47:25 -05:00
Eric Banks
35699a8376
Move bqsr utils to protected
2013-01-07 15:41:21 -05:00
Eric Banks
5371613ad1
Tests seem to pass (can't be positive though because I ran before Tad's recent push), so I'm going to push now (this push touches so many files that I don't want to keep it around much longer).
...
Merge branch 'master' of github.com:broadinstitute/gsa-unstable
2013-01-07 15:27:43 -05:00
Ami Levy-Moonshine
8bbb9e1cc2
Merge branch 'master' of github.com:broadinstitute/gsa-unstable
2013-01-07 15:07:25 -05:00
Ami Levy-Moonshine
d4b4f95e12
move CatVariants to public
2013-01-07 15:07:16 -05:00
Eric Banks
1a4b112865
Merge branch 'master' of github.com:broadinstitute/gsa-unstable
2013-01-07 15:00:35 -05:00
Eric Banks
a0219acfaa
Collapse the PerReadAlleleLikelihoodMap classes into 1 now that Lite is gone
2013-01-07 14:55:21 -05:00
Mauricio Carneiro
d3e2352072
Moved processing pipelines to private
...
These pipelines were supposed to serve as an example for the community, they were written a long-long-long time ago and are being used today by users as the 'best practice pipeline'. Unless we decide we want to support and maintain an example best-practices pipeline, I'm moving these to private.
2013-01-07 14:49:57 -05:00
Eric Banks
35d9bd377c
Moved (nearly) all Walkers from public to protected and removed GATKLite utils
2013-01-07 14:42:40 -05:00
Eric Banks
78f7a4e300
Received permission from Mauricio to archive the DPP and PBPP PipelineTests
2013-01-07 14:03:08 -05:00
Eric Banks
b4e7b3d691
Fixed precision problem in the Bayesian calculation of Qemp: we need to cap below max integer because the MathUtils code add +1.
...
Added unit tests for handling large number of observations.
2013-01-07 13:07:36 -05:00
Tad Jordan
04e3978b04
Fixed VariantEval tests
...
-Added sorting by rows to VariantEval
2013-01-07 12:45:32 -05:00
Ryan Poplin
4f95f850b3
Bug fix in the HC's allele mapping for multi-allelic events. Using the allele alone as a key isn't sufficient because alleles change when the reference allele changes during VariantContextUtils.simpleMerge for multi-allelic events.
2013-01-07 11:05:44 -05:00
Ami Levy-Moonshine
d3c2c97fb2
Merge branch 'master' of github.com:broadinstitute/gsa-unstable
2013-01-06 23:35:47 -05:00
Ami Levy-Moonshine
c554d9db25
add TODO
2013-01-06 23:04:38 -05:00
Ami Levy-Moonshine
81eef3aa37
merge development branchs of log-less HMM and FastGatherer to master
2013-01-06 23:01:58 -05:00
Eric Banks
0249e1f497
Resolving merge conflicts from VCF move
2013-01-06 14:32:31 -05:00
Eric Banks
8822b8e7c8
Moving HelpConstants out of HelpUtils so that we stop getting these ProgramElementDoc errors when com.sun.javadoc cannot load on a user's system.
2013-01-06 14:30:45 -05:00