Commit Graph

70 Commits (06fc5eecf80df9360894e971c8513faefdc8086e)

Author SHA1 Message Date
aaron e27951ab39 re-updating the VCF code to handle spaces in sample names
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@3528 348d0f76-0448-11de-a6fe-93d51630548a
2010-06-10 20:18:34 +00:00
aaron ad98512f6c adding changes so that we look at the headers already loaded by the engine for samples and other VCF utils, and not create readers for each file to get them (this caused Tribble to regerenate indices if the index file can't be written to disk).
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@3518 348d0f76-0448-11de-a6fe-93d51630548a
2010-06-09 17:21:12 +00:00
aaron 6febd0291d rev tribble to include some dbsnp clean-up and fixes
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@3510 348d0f76-0448-11de-a6fe-93d51630548a
2010-06-09 03:08:31 +00:00
aaron 6d5556939d updating Tribble with a couple of important Tabix fixes, and updating the variant eval integration tests to run each test with both plain vcf and gzipped tabix (added the tabix version
to the vlidation directory), using the same md5sum.



git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@3509 348d0f76-0448-11de-a6fe-93d51630548a
2010-06-09 01:47:04 +00:00
hanna c1ecf75dd5 Update to the latest rev of the picard sharding patch. Includes updates reflecting
the imminent move of IlluminaUtil into picard public.


git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@3493 348d0f76-0448-11de-a6fe-93d51630548a
2010-06-06 20:33:21 +00:00
aaron 0b03e28b60 updating the tribble library to include the reference dictionary reading / writing. We now check the dictionaries of any tracks that have them against the reference (all new tribble tracks and out-of-date tracks will have this). Also renamed some classes to be more reflective of their function.
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@3485 348d0f76-0448-11de-a6fe-93d51630548a
2010-06-04 06:34:26 +00:00
ebanks ffeb3fd80d Thanks to Guillermo, I found a bug in the Unified Genotyper output: GL was posteriors instead of likelihoods. Not a huge deal because the
priors were flat, but fixed nonetheless.
Also, needed to update Tribble.
Minor updates to the Beagle input maker.


git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@3461 348d0f76-0448-11de-a6fe-93d51630548a
2010-05-28 19:28:26 +00:00
aaron 98350da177 rev tribble
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@3456 348d0f76-0448-11de-a6fe-93d51630548a
2010-05-28 14:52:39 +00:00
aaron f7c9f131ea revisioning tribble to version 85, which includes tabix and bug fixes
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@3428 348d0f76-0448-11de-a6fe-93d51630548a
2010-05-25 04:28:32 +00:00
kshakir e9ee55d7dd A cleaned up functioning early, early access version of Queue for others to play with and provide feedback about next steps.
Current version only has syntatic sugar for accessing the graph via rules ex. "bam" -> "bam.bai", "samtools index ${bam}" and DOES NOT have sugar for constructing your own graph.
Usage info on the internal wiki at https://iwww.broadinstitute.org/gsa/wiki/index.php/Queue


git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@3420 348d0f76-0448-11de-a6fe-93d51630548a
2010-05-23 20:21:09 +00:00
aaron 7cfb9ff3dc updates for Tribble 82, fixes for Ryans case where multiple processes would attempt to read/write to the same index, and a couple other Tribble-centric bug fixes.
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@3382 348d0f76-0448-11de-a6fe-93d51630548a
2010-05-18 19:34:45 +00:00
aaron 2c55ac1374 fixes for parallel processing problems with Tribble, a small bug in the resource pool, and some more documentation.
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@3349 348d0f76-0448-11de-a6fe-93d51630548a
2010-05-12 06:13:26 +00:00
hanna 76efa757f0 Switched over to reviewed version of Picard patch. In process, did some optimization to the IntervalSharder
which improved startup time 5-10x when dynamically merging many BAMs.


git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@3331 348d0f76-0448-11de-a6fe-93d51630548a
2010-05-08 14:12:22 +00:00
aaron 7d2df3f511 example windowed ROD walker for Kristian, and updates to Tribble
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@3325 348d0f76-0448-11de-a6fe-93d51630548a
2010-05-07 17:12:50 +00:00
aaron 78409dca0d turned off the progress output from tribble when making an index, and fixing a case where the index file isn't writable so we instead make the index in memory.
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@3312 348d0f76-0448-11de-a6fe-93d51630548a
2010-05-06 16:36:58 +00:00
aaron d91b27aca1 updating Tribble with VCF changes from Eric
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@3310 348d0f76-0448-11de-a6fe-93d51630548a
2010-05-06 04:03:21 +00:00
aaron 7467ec2fd6 updating the reflections library; Matt found a problem where the reflections library doesn't sort out non-java objects from the classpath (affects only OS X so far). I'll push back the changes to
the reflections library people.


git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@3307 348d0f76-0448-11de-a6fe-93d51630548a
2010-05-06 02:08:41 +00:00
aaron 97dd04cbf0 updating Tribble ahead of the big VCF commit
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@3297 348d0f76-0448-11de-a6fe-93d51630548a
2010-05-05 05:17:54 +00:00
aaron 447081583a rev tribble with updated version
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@3287 348d0f76-0448-11de-a6fe-93d51630548a
2010-05-03 04:07:28 +00:00
aaron b648e89096 updating Tribble with a bunch of bug and performance fixes found while performance testing GeliText in the GATK
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@3267 348d0f76-0448-11de-a6fe-93d51630548a
2010-04-28 18:45:10 +00:00
aaron 64c5f287c5 fixes for edge-cases when using reflections to find classes outside of the main jar. Will push as a patch to reflections
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@3264 348d0f76-0448-11de-a6fe-93d51630548a
2010-04-27 17:46:46 +00:00
aaron c647153b10 Adding Jama for Ryan.
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@3262 348d0f76-0448-11de-a6fe-93d51630548a
2010-04-27 14:30:36 +00:00
aaron f6468f9143 a fix for a bug we've worked around in the reflections package: previously it didn't find classes that weren't in the main jar. Fixed in this version.
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@3261 348d0f76-0448-11de-a6fe-93d51630548a
2010-04-27 04:49:49 +00:00
hanna 4bb8984f80 Updating picard -- switching to Alec's more robust fix for gzip decompression issue
and updating serialization components.


git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@3154 348d0f76-0448-11de-a6fe-93d51630548a
2010-04-12 19:56:45 +00:00
hanna b60197ae10 Another round of cleanup and simplification in Picard -- Picard's unit tests
are now passing for my branch.


git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@3100 348d0f76-0448-11de-a6fe-93d51630548a
2010-03-31 01:02:59 +00:00
hanna 400684542c Revisions to take into account finalization of Picard patch: naming changes, better definition
of public interfaces.  This won't be the last Picard patch, but it should be the last big one.


git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@3096 348d0f76-0448-11de-a6fe-93d51630548a
2010-03-30 19:28:14 +00:00
hanna 85037ab13f Fix for Kiran's sharding issue (Invalid GZIP header). General cleanup of
Picard patch, including move of some of the Picard private classes we use to Picard public.


git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@3087 348d0f76-0448-11de-a6fe-93d51630548a
2010-03-29 03:21:27 +00:00
hanna 46c14ec63f New, much less memory intensive implementation of BAM file sharding. Streams indices together with the expectation
that bins will be present in the bin sparse array, which avoids the problem of having to hold the sparse bin array
stored in every BAM file index in memory at the same time.


git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@3075 348d0f76-0448-11de-a6fe-93d51630548a
2010-03-25 17:41:22 +00:00
hanna 1f451e17e5 Changing preloaded index to only "preload" reference sequences on demand.
Results in drastic lowering of startup cost when multiple BAM files are 
merged.


git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@3066 348d0f76-0448-11de-a6fe-93d51630548a
2010-03-23 22:02:28 +00:00
hanna 884a577013 Phase 2 of Picard patch refactoring: kill off SAMFileReader2/BAMFileReader2, merging the changes back into the base classes.
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@3065 348d0f76-0448-11de-a6fe-93d51630548a
2010-03-23 16:48:11 +00:00
hanna 169d0c6e8f Up the svn revision number in an attempt to force an update, again due to an
artifact of the way we build picard-private-parts.


git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@3051 348d0f76-0448-11de-a6fe-93d51630548a
2010-03-19 22:39:56 +00:00
hanna c0eb5c27ea Lower memory support for merged sharding. Merged sharding is still not available.
WARNING: If you update frequently, you might have to rm -rf ~/.ant/cache -- this is an unfortunate side effect of the way we
	 distribute picard-private.jar.


git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@3050 348d0f76-0448-11de-a6fe-93d51630548a
2010-03-19 22:03:47 +00:00
hanna 96662d8d1b Moving from GATK dependencies on isolated classes checked into the GATK
codebase to a dependency on a jar file compiled from my private picard branch.


git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@3034 348d0f76-0448-11de-a6fe-93d51630548a
2010-03-18 17:43:42 +00:00
aaron 8fd3351971 adding a stripped down Tribble library for the start of integration
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@2851 348d0f76-0448-11de-a6fe-93d51630548a
2010-02-17 21:29:25 +00:00
alecw 3feead019d New Picard and SAM-JDK with automatic sequence name truncation
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@2557 348d0f76-0448-11de-a6fe-93d51630548a
2010-01-11 20:49:13 +00:00
alecw b5e5e27225 New versions of picard-private, sam and picard jars for TileCovariate and regeneration of NM tag
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@2456 348d0f76-0448-11de-a6fe-93d51630548a
2009-12-28 22:18:55 +00:00
hanna d1d6b79673 Update picard public and private to match new SAM update.
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@2438 348d0f76-0448-11de-a6fe-93d51630548a
2009-12-23 22:34:12 +00:00
alecw 159778416c In TableRecalibrationWalker, update UQ tag if it was present in the original SAMRecord. This required a new sam.jar, which caused some other files to need to be changed.
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@2435 348d0f76-0448-11de-a6fe-93d51630548a
2009-12-23 21:42:36 +00:00
alecw 84921b18ed Push version number of picard-private-parts
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@2163 348d0f76-0448-11de-a6fe-93d51630548a
2009-11-25 19:28:57 +00:00
alecw 5f2801e015 Push version number of picard-private-parts
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@2162 348d0f76-0448-11de-a6fe-93d51630548a
2009-11-25 19:28:42 +00:00
alecw d563f4bd2c Add IlluminaUtil to picard-private.jar
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@2161 348d0f76-0448-11de-a6fe-93d51630548a
2009-11-25 19:25:21 +00:00
hanna 1fcd28bba9 Update xml descriptor with proper version number.
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@2090 348d0f76-0448-11de-a6fe-93d51630548a
2009-11-19 17:32:22 +00:00
alecw 4082f4677e Picard-private with classes for reading Picard dbSNP binary file
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@2086 348d0f76-0448-11de-a6fe-93d51630548a
2009-11-19 17:24:28 +00:00
alecw 7463904428 Picard-private with classes for reading Picard dbSNP binary file
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@2085 348d0f76-0448-11de-a6fe-93d51630548a
2009-11-19 17:23:37 +00:00
alecw d1325909f7 Picard-public 1.08
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@2084 348d0f76-0448-11de-a6fe-93d51630548a
2009-11-19 17:22:56 +00:00
alecw db793ac870 Picard-public 1.08
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@2083 348d0f76-0448-11de-a6fe-93d51630548a
2009-11-19 17:21:00 +00:00
hanna 8eff1cc436 Extract and include only the Tim Fennell-approved parts of picard private.
Hopefully this is a temporary solution and these classes will be migrated
into picard-public.


git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@2041 348d0f76-0448-11de-a6fe-93d51630548a
2009-11-13 19:42:33 +00:00
hanna 8145ed4672 Take 2, updating picard with bug fix for bam files containing no reads.
Just stomped on the existing md5s because that's what Eric told me to do.


git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@2029 348d0f76-0448-11de-a6fe-93d51630548a
2009-11-12 22:52:08 +00:00
hanna 8406325247 New Picard is breaking one of the integration tests.
Revert until we find out whether the cause is legit.


git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@2017 348d0f76-0448-11de-a6fe-93d51630548a
2009-11-11 03:59:32 +00:00
hanna bae4d3f7ea Updated Picard with fix for Doug Voet. Thanks Alec.
git-svn-id: file:///humgen/gsa-scr1/gsa-engineering/svn_contents/trunk@2015 348d0f76-0448-11de-a6fe-93d51630548a
2009-11-11 02:01:08 +00:00