Commit Graph

111 Commits (9b5e5e06f94e602b97cf00280a4da47fa6a142dc)

Author SHA1 Message Date
depristo 9b5e5e06f9 Now supports checking that the input files exist and are good
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@117 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-20 16:40:54 +00:00
ebanks f3f1b47808 deal with reverse complemented reads
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@115 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-20 16:01:49 +00:00
asivache 9ec96414c7 git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@114 348d0f76-0448-11de-a6fe-93d51630548a 2009-03-20 15:54:29 +00:00
depristo 322f4b944f Better stress test
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@113 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-20 15:52:54 +00:00
asivache 3565b50ff5 main class (argument processing and traversing the reference) and implementation of all the Receiver functionality for building read piles over indels
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@112 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-20 05:18:04 +00:00
asivache 4c3b92b860 comparator for interval objects
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@111 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-20 05:15:13 +00:00
asivache f810412d75 equals(), hashCode() updated/added, also a few minor changes
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@110 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-20 05:13:07 +00:00
asivache 4badd54216 Indel also implements Interval interface but has its quirks
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@109 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-20 05:11:17 +00:00
asivache 501e92d441 an interface for an interval object and simple minimum implementation; note: in contrast to arachne, this is closed interval
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@108 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-20 05:09:56 +00:00
asivache 29d2d460f3 a trivial interface and even more trivial implementations that do nothing (ignore the data they receive)
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@107 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-20 05:08:15 +00:00
depristo b83c8319c7 Crushed subtle and potentially insidous bug in seeking within the fasta; a beer for anyone who can tell me the situation where this might arise...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@106 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-20 00:07:06 +00:00
depristo 34ee48fd82 Fixing output printing issues in the code, as well as adding more safety checks
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@105 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-19 23:02:49 +00:00
hanna 6fdd622160 Describe how GATK finds walkers. Change the example to avoid copying the class file into the walkers directory.
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@104 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-19 22:41:12 +00:00
hanna 104e2811ec Configure the plugin directory.
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@103 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-19 22:12:25 +00:00
andrewk 6bcdac5c62 Restructured AlleleFrequency classes into 3 classes: AlleleFrequencyWalker, AlleleFrequencyMetricsWalker, AlleleFrequencyEstimate. AlleleFrequencyMetricsWalker class now calls mapper function of AlleleFrequencyWalker and works with the result. AlleleFrequencyEstimate is now a separate class instead of a subclass of AlleleFrequencyWalker.
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@102 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-19 22:06:01 +00:00
hanna 41fec1565c Hello, world! for GATK.
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@101 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-19 21:46:22 +00:00
aaron 7bc45b68aa Added dependences on two libraries: the Colt package, which is a collection of high performance computing libraries from CERN; and Log4j, which will be our new logging platform.
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@100 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-19 16:16:31 +00:00
andrewk 5fa99f430e One line format is useable and two levels of debug output are available (debug = 1: one line format, debug = 2: table of sampled probs for each locus). Class AlleleFrequencyMetrics computes %dbSNP and frequency of SNPs.
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@99 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-19 15:05:05 +00:00
depristo f1034f3dfd Stress Test utility for pushing the GATK to its limits. Takes a list of sam files and runs Analyses on them all, optionally in the queue
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@98 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-19 03:15:00 +00:00
hanna 4242dba295 Remove endless iterator.
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@97 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-18 23:53:40 +00:00
hanna 225ea64bd9 Moved extra walkers at Mark's request.
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@96 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-18 23:52:08 +00:00
hanna ffb6f8f5da Move the basic gatk framework into the core subtree.
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@95 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-18 23:39:00 +00:00
asivache 69316f1873 removed unused import statement
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@94 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-18 21:56:15 +00:00
asivache 875272e5c5 moved counted object to utils
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@93 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-18 21:54:04 +00:00
asivache e09af2ef70 changed variable declaration from concrete class to interface
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@92 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-18 21:50:47 +00:00
asivache 708ada3e99 an accessory for CountedObject: builds a comparator for CountedObject<T> given a comparator for T; compares the underlying objects T themselves, *not* the associated counters
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@91 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-18 21:45:54 +00:00
asivache 37101045af a simple wrapper class; less overhead than keeping a separate Integer counter object and going through object reallocation and/or autoboxing on each counter increment
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@90 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-18 21:44:30 +00:00
ebanks 45d2a9acd8 Added walker to print out a histogram of where mismatches occur in alignments
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@89 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-18 19:46:42 +00:00
hanna 1096bbd4d9 Moved build.xml, ivy.xml and settings to root of Sting repository.
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@88 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-18 19:13:19 +00:00
hanna d46ee96269 Added support for loose Walker class files in walkers directory.
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@87 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-18 17:32:24 +00:00
ebanks fe9e52c47e allow on fly sorting AND validation
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@86 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-18 15:50:17 +00:00
hanna bb94c853f8 Added WalkerManager -- a class that dynamically loads available walkers
from the jar file.  For now, added placeholder Walker interface so that
WalkerManager could work with classes of type Walker rather than classes 
of type Object.


git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@85 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-17 23:22:37 +00:00
ebanks d9fa04f65c Fixed logic
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@84 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-17 22:20:03 +00:00
ebanks 1aa3958644 Added ability to sort reads on the fly
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@83 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-17 20:29:09 +00:00
asivache 0362cb9e59 added Utils.filterInPlace() - purges elements directly from the passed collection object without creating new list for results
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@82 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-17 19:06:40 +00:00
hanna 58aa2aab43 Rough draft of patch to use bam indices when available.
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@81 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-17 16:39:03 +00:00
asivache 151c37591e removed unnecessary import that produced a warning. where did it come from in the first place??
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@80 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-17 15:46:27 +00:00
depristo 478425b3d8 Better error messages
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@79 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-17 15:37:02 +00:00
depristo 0fd55d91d2 Fixed bug in unsafe mode
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@78 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-17 15:28:04 +00:00
ebanks c74bd871b1 added module for aligned reads
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@77 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-17 14:08:54 +00:00
ebanks 28cc670a92 Walker to print out a histogram of aligned reads per mismatches allowed
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@76 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-17 14:05:29 +00:00
depristo 9ae551e858 Lots of error checking added, fixed bugs associated with reading files out of order, added support for U (unsafe) flag for processing reads
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@75 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-16 23:22:04 +00:00
asivache 36b8b34490 Main tool that builds the clusters (multiple alignments) - so far; to be heavily refactored; most methods should find their proper homes in other classes
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@74 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-16 22:03:31 +00:00
asivache b9ffcdf047 matrix as the name suggests; utilizes special property (zeros at diagonal and below) to use less memory at the expense of slower access; this one is built directly on primitive data type (double) so it should not have any overhead associated with java classes
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@73 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-16 22:01:53 +00:00
asivache a17ed3cbf1 this class really computes (and keeps) a gapless pairwise alignment between the two sequences, ILT-style
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@72 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-16 21:59:26 +00:00
asivache 4972b03059 a class that keeps a pile of reads and can perform some simple computations on them; does not perform multiple alignments (so far) - external tools do the job
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@71 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-16 21:58:05 +00:00
asivache 6d481c64e7 just a square matrix of arbitrary stuff; the stuff must be full fledged Java type, however, not a primitive type. Hooray Java!
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@70 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-16 21:56:45 +00:00
asivache c68e0cc1fe Walks along the sequence and emits a sequence of subsequent, encoded Kmers (uses short int, so currently it's up to K=8)
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@68 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-16 21:54:45 +00:00
hanna 34d9af4702 Remove orphaned modules directory.
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@67 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-16 21:53:05 +00:00
asivache 1e89dbfcb1 Sequence bundled with its Kmer-based lookup index (same thing as old lookup table)
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@66 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-16 21:52:57 +00:00