depristo
9b5e5e06f9
Now supports checking that the input files exist and are good
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@117 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-20 16:40:54 +00:00
ebanks
f3f1b47808
deal with reverse complemented reads
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@115 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-20 16:01:49 +00:00
asivache
9ec96414c7
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@114 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-20 15:54:29 +00:00
depristo
322f4b944f
Better stress test
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@113 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-20 15:52:54 +00:00
asivache
3565b50ff5
main class (argument processing and traversing the reference) and implementation of all the Receiver functionality for building read piles over indels
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@112 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-20 05:18:04 +00:00
asivache
4c3b92b860
comparator for interval objects
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@111 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-20 05:15:13 +00:00
asivache
f810412d75
equals(), hashCode() updated/added, also a few minor changes
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@110 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-20 05:13:07 +00:00
asivache
4badd54216
Indel also implements Interval interface but has its quirks
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@109 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-20 05:11:17 +00:00
asivache
501e92d441
an interface for an interval object and simple minimum implementation; note: in contrast to arachne, this is closed interval
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@108 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-20 05:09:56 +00:00
asivache
29d2d460f3
a trivial interface and even more trivial implementations that do nothing (ignore the data they receive)
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@107 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-20 05:08:15 +00:00
depristo
b83c8319c7
Crushed subtle and potentially insidous bug in seeking within the fasta; a beer for anyone who can tell me the situation where this might arise...
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@106 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-20 00:07:06 +00:00
depristo
34ee48fd82
Fixing output printing issues in the code, as well as adding more safety checks
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@105 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-19 23:02:49 +00:00
hanna
6fdd622160
Describe how GATK finds walkers. Change the example to avoid copying the class file into the walkers directory.
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@104 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-19 22:41:12 +00:00
hanna
104e2811ec
Configure the plugin directory.
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@103 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-19 22:12:25 +00:00
andrewk
6bcdac5c62
Restructured AlleleFrequency classes into 3 classes: AlleleFrequencyWalker, AlleleFrequencyMetricsWalker, AlleleFrequencyEstimate. AlleleFrequencyMetricsWalker class now calls mapper function of AlleleFrequencyWalker and works with the result. AlleleFrequencyEstimate is now a separate class instead of a subclass of AlleleFrequencyWalker.
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@102 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-19 22:06:01 +00:00
hanna
41fec1565c
Hello, world! for GATK.
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@101 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-19 21:46:22 +00:00
aaron
7bc45b68aa
Added dependences on two libraries: the Colt package, which is a collection of high performance computing libraries from CERN; and Log4j, which will be our new logging platform.
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@100 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-19 16:16:31 +00:00
andrewk
5fa99f430e
One line format is useable and two levels of debug output are available (debug = 1: one line format, debug = 2: table of sampled probs for each locus). Class AlleleFrequencyMetrics computes %dbSNP and frequency of SNPs.
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@99 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-19 15:05:05 +00:00
depristo
f1034f3dfd
Stress Test utility for pushing the GATK to its limits. Takes a list of sam files and runs Analyses on them all, optionally in the queue
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@98 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-19 03:15:00 +00:00
hanna
4242dba295
Remove endless iterator.
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@97 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-18 23:53:40 +00:00
hanna
225ea64bd9
Moved extra walkers at Mark's request.
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@96 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-18 23:52:08 +00:00
hanna
ffb6f8f5da
Move the basic gatk framework into the core subtree.
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@95 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-18 23:39:00 +00:00
asivache
69316f1873
removed unused import statement
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@94 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-18 21:56:15 +00:00
asivache
875272e5c5
moved counted object to utils
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@93 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-18 21:54:04 +00:00
asivache
e09af2ef70
changed variable declaration from concrete class to interface
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@92 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-18 21:50:47 +00:00
asivache
708ada3e99
an accessory for CountedObject: builds a comparator for CountedObject<T> given a comparator for T; compares the underlying objects T themselves, *not* the associated counters
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@91 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-18 21:45:54 +00:00
asivache
37101045af
a simple wrapper class; less overhead than keeping a separate Integer counter object and going through object reallocation and/or autoboxing on each counter increment
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@90 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-18 21:44:30 +00:00
ebanks
45d2a9acd8
Added walker to print out a histogram of where mismatches occur in alignments
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@89 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-18 19:46:42 +00:00
hanna
1096bbd4d9
Moved build.xml, ivy.xml and settings to root of Sting repository.
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@88 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-18 19:13:19 +00:00
hanna
d46ee96269
Added support for loose Walker class files in walkers directory.
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@87 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-18 17:32:24 +00:00
ebanks
fe9e52c47e
allow on fly sorting AND validation
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@86 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-18 15:50:17 +00:00
hanna
bb94c853f8
Added WalkerManager -- a class that dynamically loads available walkers
...
from the jar file. For now, added placeholder Walker interface so that
WalkerManager could work with classes of type Walker rather than classes
of type Object.
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@85 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-17 23:22:37 +00:00
ebanks
d9fa04f65c
Fixed logic
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@84 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-17 22:20:03 +00:00
ebanks
1aa3958644
Added ability to sort reads on the fly
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@83 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-17 20:29:09 +00:00
asivache
0362cb9e59
added Utils.filterInPlace() - purges elements directly from the passed collection object without creating new list for results
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@82 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-17 19:06:40 +00:00
hanna
58aa2aab43
Rough draft of patch to use bam indices when available.
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@81 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-17 16:39:03 +00:00
asivache
151c37591e
removed unnecessary import that produced a warning. where did it come from in the first place??
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@80 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-17 15:46:27 +00:00
depristo
478425b3d8
Better error messages
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@79 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-17 15:37:02 +00:00
depristo
0fd55d91d2
Fixed bug in unsafe mode
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@78 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-17 15:28:04 +00:00
ebanks
c74bd871b1
added module for aligned reads
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@77 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-17 14:08:54 +00:00
ebanks
28cc670a92
Walker to print out a histogram of aligned reads per mismatches allowed
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@76 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-17 14:05:29 +00:00
depristo
9ae551e858
Lots of error checking added, fixed bugs associated with reading files out of order, added support for U (unsafe) flag for processing reads
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@75 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-16 23:22:04 +00:00
asivache
36b8b34490
Main tool that builds the clusters (multiple alignments) - so far; to be heavily refactored; most methods should find their proper homes in other classes
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@74 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-16 22:03:31 +00:00
asivache
b9ffcdf047
matrix as the name suggests; utilizes special property (zeros at diagonal and below) to use less memory at the expense of slower access; this one is built directly on primitive data type (double) so it should not have any overhead associated with java classes
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@73 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-16 22:01:53 +00:00
asivache
a17ed3cbf1
this class really computes (and keeps) a gapless pairwise alignment between the two sequences, ILT-style
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@72 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-16 21:59:26 +00:00
asivache
4972b03059
a class that keeps a pile of reads and can perform some simple computations on them; does not perform multiple alignments (so far) - external tools do the job
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@71 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-16 21:58:05 +00:00
asivache
6d481c64e7
just a square matrix of arbitrary stuff; the stuff must be full fledged Java type, however, not a primitive type. Hooray Java!
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@70 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-16 21:56:45 +00:00
asivache
c68e0cc1fe
Walks along the sequence and emits a sequence of subsequent, encoded Kmers (uses short int, so currently it's up to K=8)
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@68 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-16 21:54:45 +00:00
hanna
34d9af4702
Remove orphaned modules directory.
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@67 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-16 21:53:05 +00:00
asivache
1e89dbfcb1
Sequence bundled with its Kmer-based lookup index (same thing as old lookup table)
...
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@66 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-16 21:52:57 +00:00