Commit Graph

6646 Commits (38a2518668b684a820babedc8b0a23711581a0d1)

Author SHA1 Message Date
Mark DePristo 38a2518668 Merge branch 'master' into rodRefactor 2011-07-27 15:34:54 -04:00
Mark DePristo 60db6cc836 Warnings for old ROD system use.
Removed unused class GATKRODFeature
2011-07-27 12:39:12 -04:00
Mark DePristo 097828a466 ParsingEngine now maintains the list of rodBindings
No longer try to reparser objects to find the right fields
Direct support in RodBinding for getTags()
2011-07-27 11:36:53 -04:00
Mark DePristo 81f8e05bfa Merge branch 'master' of ssh://gsa1/humgen/gsa-scr1/gsa-engineering/git/unstable 2011-07-26 17:35:46 -04:00
Mark DePristo f6a5e0e36a Go for global integrationtest path first, if possible. 2011-07-26 17:35:30 -04:00
Matt Hanna fec495e292 Fix a nasty little bug in the sharding system: if the last shard in contig n
overlaps exactly on disk with the first shard in contig n+1, the shards
would be merged together to avoid duplicate extraction.  Unfortunately,
the interval overlap filter couldn't handle shards spanning contigs, and
was choosing to filter out reads from contig n+1 which should have been
included.
I'm not completely sure why the BAM indexing code would ever specify that the
end of one chromosome had the same on-disk location as the start of the next
one.  I suspect that this is a indexer performance bug.
2011-07-26 15:43:20 -04:00
Mark DePristo 9dfb57168a RodBinding source is no longer assumed to be a file 2011-07-26 13:59:44 -04:00
Mark DePristo d0badd5bd6 RodBinding subclassed to VariantContextRodBinding for easy access to VariantContext providing RODs 2011-07-26 13:54:55 -04:00
Mark DePristo 7ab8b53339 Support for List<RodBinding> argument type 2011-07-26 11:37:31 -04:00
Mark DePristo 38969b9783 Prototype of RODBinding @Arguments instead of -B syntax
Initial version of RodBinding class.
Flow from walker Rodbinding @Arguments -> RMDTriplet (old system) -> GATK engine (standard).  Will need refactoring.
2011-07-26 11:09:06 -04:00
Matt Hanna 088fc39308 Merge branch 'master' of ssh://gsa1/humgen/gsa-scr1/gsa-engineering/git/unstable 2011-07-25 15:54:56 -04:00
Matt Hanna 6fd6684e86 Merged bug fix from Stable into Unstable
Conflicts:
	build.xml
2011-07-25 15:21:17 -04:00
Matt Hanna 6f43284053 Switch to including our entire exception tree rather than just UserException. 2011-07-25 15:19:41 -04:00
Mark DePristo 3afcb3415d Max of 1000 records will be loaded and compared to avoid heap size problem. 2011-07-25 14:58:31 -04:00
Matt Hanna af0b5883c3 In unstable, add a reference to DocumentedGATKFeature to vcf.jar, which is now
a static dependency of UserExceptions.
2011-07-25 14:36:12 -04:00
Matt Hanna 6c49197de6 Merged bug fix from Stable into Unstable 2011-07-25 14:32:33 -04:00
Matt Hanna a00e302321 Fix formatting issue. 2011-07-25 14:31:40 -04:00
Mark DePristo 2a51543693 Actually should have been gone... 2011-07-25 13:27:42 -04:00
Mark DePristo ebfd8df06c Restoring accidentially deleted unit test 2011-07-25 13:25:30 -04:00
Mark DePristo f3049fba63 refdata directory cleanup
Removing unused files RODRecordIterator, ReferenceOrderedData, QueryableTrack, RMDTrackCreationException, GATKFeatureIterator, ReferenceOrderedDataUnitTest
Refactored dbSNP and refseq utilities to be closer to the other files implementing these features
2011-07-25 13:21:52 -04:00
Matt Hanna 8014fad6ff Merge branch 'master' of ssh://gsa1/humgen/gsa-scr1/gsa-engineering/git/unstable 2011-07-25 13:20:44 -04:00
Matt Hanna 2ac490dbdf Fix improper detection of command-line arguments with missing values. 2011-07-25 13:20:00 -04:00
Mark DePristo 90947ab359 Merge branch 'master' of ssh://gsa1/humgen/gsa-scr1/gsa-engineering/git/unstable 2011-07-25 12:53:56 -04:00
Mark DePristo 44bd9ae703 Restoring UninstantiableWalker, as it is not going to be possible to run ant test; ant gatkdocs without ant clean in between 2011-07-25 12:53:06 -04:00
Mark DePristo acda8eb09c Commented out test that causes new CommandLineGATK() to fail 2011-07-25 12:43:27 -04:00
Kiran V Garimella bbb8473f03 Merge branch 'master' of ssh://copper.broadinstitute.org/humgen/gsa-scr1/gsa-engineering/git/unstable 2011-07-25 10:59:00 -04:00
Mark DePristo 1a268ff1fd Refactor so that GenotypeAnnotation and InfoFieldAnnotation share common superclass VariantAnnotatorAnnotation 2011-07-25 10:55:09 -04:00
Mark DePristo 7f8e6a97ee InfoFieldAnnotation now an abstract class extended by annotations so doc system works 2011-07-25 10:47:11 -04:00
Guillermo del Angel 5db05d2873 Refactored Mills codecs so that they're in correct packages 2011-07-25 10:25:25 -04:00
Guillermo del Angel 115879b31f Merge branch 'master' of ssh://nickel.broadinstitute.org/humgen/gsa-scr1/gsa-engineering/git/unstable 2011-07-25 10:13:16 -04:00
Mauricio Carneiro 4c6c16f895 Documented following the new gatkdoc framework 2011-07-25 00:25:08 -04:00
Mark DePristo 2039ce6102 Default values now displayed in arguments
DiffEngine fixed so that newInstance() would work.  Pretty quickly encountered a situation where newInstance() failed.  Debug output now written when this occurs in the log.
Logger now used instead of standard out, with INFO the default level.
2011-07-24 22:56:55 -04:00
Mark DePristo c43b5981f2 Hidden variables are hidden by default. Settable by command line option
DiffObjectsWalker test arguments removed.
Minor refactoring of GATKDoclet
2011-07-24 20:52:44 -04:00
Mark DePristo 1c1f1da349 Fixing compilation 2011-07-24 20:01:59 -04:00
Mark DePristo 9f06f6c493 Split GATKDoclet from ResourceBundleDoclet. Refactored GaTKDocWorkUnit 2011-07-24 20:00:04 -04:00
Guillermo del Angel e602166f78 First fully functional version outputting a full vcf 2011-07-24 19:38:59 -04:00
Mark DePristo 7b84347047 Main index is sorted by the template now. 2011-07-24 18:19:40 -04:00
Mark DePristo ff85687679 Merge branch 'master' into help 2011-07-24 18:14:32 -04:00
Mark DePristo 83996f7951 Enumerated types are working. 2011-07-24 18:14:21 -04:00
Mark DePristo 3c34e9fa65 Cleanup emuns and tables 2011-07-24 17:45:58 -04:00
Mark DePristo c620d96c96 Inline enum documentation is working 2011-07-24 17:22:14 -04:00
Mark DePristo 793e7d3d1d Improved header and argument details
Argument detail structure cleaned up. Only relevant pieces of information are shown now, and in a cleaner layout.
Misc. cleanup in the code.
2011-07-24 16:36:25 -04:00
Mark DePristo c6af4efcdc Implemented see also and version header 2011-07-24 16:10:17 -04:00
Mark DePristo b8db0510e0 Sytle sheet and common functions 2011-07-24 15:43:23 -04:00
Mark DePristo 5e0fe2d0f9 Support for style.css via refactored common.html included in all files 2011-07-24 15:42:39 -04:00
Mark DePristo d0ab6bf7a9 Now links to sub and superclass documentation, where possible. 2011-07-24 09:56:17 -04:00
Mark DePristo e2dabb70b8 Merge branch 'master' of ssh://gsa1/humgen/gsa-scr1/gsa-engineering/git/unstable 2011-07-24 08:57:47 -04:00
Mauricio Carneiro 1ef964c92c Merge branch 'contig' 2011-07-24 02:40:42 -04:00
Mauricio Carneiro e0cb403b33 A read walker for contig statistics
I used this walker for my mtdna analysis where the goal was to see how the chromosomes were represented by the sequences in the bam files. It is very useful as a first look at a new dataset if you want to have an idea of where most of the reads fall. It reports the number of reads in each contig, percetages, enrichment as well as the expected number of reads for each contig and enrichment given the size of your dataset. I will document it accordingly with the new documentation tool. It's in public, and I'm happy to support it.
2011-07-24 02:36:17 -04:00
Mauricio Carneiro 7ffedf211c Contig comparator -- sorting contigs like Picard
This is very useful if you want to output your text files or manipulate data in the usual chromosome ordering :
 1
 2
 3
 ...
 21
 22
 X
 Y
 GL???
 ...

 Just use this comparator in any SortedSet class constructor and your data will be sorted like in the BAM file.
2011-07-24 02:33:19 -04:00