Commit Graph

66 Commits (3e350006e084e71d954c1640c95af4deb7d9a29e)

Author SHA1 Message Date
kiran 3e350006e0 Added a directory to house some Illumina output parsers. Hopefully this will be merged back into Picard at some point.
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@126 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-21 17:55:56 +00:00
asivache 497eea2e5c minor changes and shuffling code around; also, now when realigned piles are printed they are sorted by start position
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@125 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-21 17:43:49 +00:00
hanna 1fcf4c0cbf Update picard to work with new samtools.
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@123 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-20 21:51:26 +00:00
jmaguire 5dca560c3c A bunch of refactoring, and more on the way.
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@122 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-20 21:31:07 +00:00
hanna b806a9cf68 Updated for new version of samtools, which returns a sequence dictionary
rather than a simple list of sequences.


git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@121 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-20 20:38:24 +00:00
ebanks c5433a3120 dumps out base qualities per position for use in making boxplots
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@119 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-20 17:01:18 +00:00
jmaguire 1161c261ac made all data members public.
switched logOddsVarRef to LOD.


git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@118 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-20 16:44:17 +00:00
ebanks f3f1b47808 deal with reverse complemented reads
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@115 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-20 16:01:49 +00:00
asivache 9ec96414c7 git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@114 348d0f76-0448-11de-a6fe-93d51630548a 2009-03-20 15:54:29 +00:00
asivache 3565b50ff5 main class (argument processing and traversing the reference) and implementation of all the Receiver functionality for building read piles over indels
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@112 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-20 05:18:04 +00:00
asivache 4c3b92b860 comparator for interval objects
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@111 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-20 05:15:13 +00:00
asivache f810412d75 equals(), hashCode() updated/added, also a few minor changes
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@110 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-20 05:13:07 +00:00
asivache 4badd54216 Indel also implements Interval interface but has its quirks
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@109 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-20 05:11:17 +00:00
asivache 501e92d441 an interface for an interval object and simple minimum implementation; note: in contrast to arachne, this is closed interval
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@108 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-20 05:09:56 +00:00
asivache 29d2d460f3 a trivial interface and even more trivial implementations that do nothing (ignore the data they receive)
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@107 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-20 05:08:15 +00:00
andrewk 6bcdac5c62 Restructured AlleleFrequency classes into 3 classes: AlleleFrequencyWalker, AlleleFrequencyMetricsWalker, AlleleFrequencyEstimate. AlleleFrequencyMetricsWalker class now calls mapper function of AlleleFrequencyWalker and works with the result. AlleleFrequencyEstimate is now a separate class instead of a subclass of AlleleFrequencyWalker.
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@102 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-19 22:06:01 +00:00
andrewk 5fa99f430e One line format is useable and two levels of debug output are available (debug = 1: one line format, debug = 2: table of sampled probs for each locus). Class AlleleFrequencyMetrics computes %dbSNP and frequency of SNPs.
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@99 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-19 15:05:05 +00:00
hanna 4242dba295 Remove endless iterator.
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@97 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-18 23:53:40 +00:00
hanna 225ea64bd9 Moved extra walkers at Mark's request.
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@96 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-18 23:52:08 +00:00
hanna ffb6f8f5da Move the basic gatk framework into the core subtree.
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@95 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-18 23:39:00 +00:00
asivache 69316f1873 removed unused import statement
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@94 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-18 21:56:15 +00:00
asivache 875272e5c5 moved counted object to utils
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@93 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-18 21:54:04 +00:00
asivache e09af2ef70 changed variable declaration from concrete class to interface
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@92 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-18 21:50:47 +00:00
asivache 708ada3e99 an accessory for CountedObject: builds a comparator for CountedObject<T> given a comparator for T; compares the underlying objects T themselves, *not* the associated counters
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@91 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-18 21:45:54 +00:00
asivache 37101045af a simple wrapper class; less overhead than keeping a separate Integer counter object and going through object reallocation and/or autoboxing on each counter increment
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@90 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-18 21:44:30 +00:00
ebanks 45d2a9acd8 Added walker to print out a histogram of where mismatches occur in alignments
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@89 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-18 19:46:42 +00:00
hanna d46ee96269 Added support for loose Walker class files in walkers directory.
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@87 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-18 17:32:24 +00:00
ebanks fe9e52c47e allow on fly sorting AND validation
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@86 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-18 15:50:17 +00:00
hanna bb94c853f8 Added WalkerManager -- a class that dynamically loads available walkers
from the jar file.  For now, added placeholder Walker interface so that
WalkerManager could work with classes of type Walker rather than classes 
of type Object.


git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@85 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-17 23:22:37 +00:00
ebanks d9fa04f65c Fixed logic
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@84 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-17 22:20:03 +00:00
ebanks 1aa3958644 Added ability to sort reads on the fly
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@83 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-17 20:29:09 +00:00
asivache 0362cb9e59 added Utils.filterInPlace() - purges elements directly from the passed collection object without creating new list for results
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@82 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-17 19:06:40 +00:00
hanna 58aa2aab43 Rough draft of patch to use bam indices when available.
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@81 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-17 16:39:03 +00:00
asivache 151c37591e removed unnecessary import that produced a warning. where did it come from in the first place??
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@80 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-17 15:46:27 +00:00
depristo 478425b3d8 Better error messages
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@79 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-17 15:37:02 +00:00
depristo 0fd55d91d2 Fixed bug in unsafe mode
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@78 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-17 15:28:04 +00:00
ebanks c74bd871b1 added module for aligned reads
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@77 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-17 14:08:54 +00:00
ebanks 28cc670a92 Walker to print out a histogram of aligned reads per mismatches allowed
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@76 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-17 14:05:29 +00:00
depristo 9ae551e858 Lots of error checking added, fixed bugs associated with reading files out of order, added support for U (unsafe) flag for processing reads
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@75 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-16 23:22:04 +00:00
asivache 36b8b34490 Main tool that builds the clusters (multiple alignments) - so far; to be heavily refactored; most methods should find their proper homes in other classes
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@74 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-16 22:03:31 +00:00
asivache b9ffcdf047 matrix as the name suggests; utilizes special property (zeros at diagonal and below) to use less memory at the expense of slower access; this one is built directly on primitive data type (double) so it should not have any overhead associated with java classes
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@73 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-16 22:01:53 +00:00
asivache a17ed3cbf1 this class really computes (and keeps) a gapless pairwise alignment between the two sequences, ILT-style
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@72 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-16 21:59:26 +00:00
asivache 4972b03059 a class that keeps a pile of reads and can perform some simple computations on them; does not perform multiple alignments (so far) - external tools do the job
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@71 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-16 21:58:05 +00:00
asivache 6d481c64e7 just a square matrix of arbitrary stuff; the stuff must be full fledged Java type, however, not a primitive type. Hooray Java!
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@70 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-16 21:56:45 +00:00
asivache c68e0cc1fe Walks along the sequence and emits a sequence of subsequent, encoded Kmers (uses short int, so currently it's up to K=8)
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@68 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-16 21:54:45 +00:00
hanna 34d9af4702 Remove orphaned modules directory.
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@67 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-16 21:53:05 +00:00
asivache 1e89dbfcb1 Sequence bundled with its Kmer-based lookup index (same thing as old lookup table)
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@66 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-16 21:52:57 +00:00
andrewk 685fc8bd61 Partial implementation of single sample allele calling
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@64 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-16 19:30:42 +00:00
depristo 8ccbcc4101 Mismatch counter
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@62 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-16 15:51:48 +00:00
depristo 3b5003bd11 Added support for accessing the reference in read traversal
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@60 348d0f76-0448-11de-a6fe-93d51630548a
2009-03-16 14:46:19 +00:00