Commit Graph

8769 Commits (aa3638ecb3db2bfcaf80fd37c45f63887824a261)

Author SHA1 Message Date
Christopher Hartl aa3638ecb3 Merge branch 'master' of ssh://chartl@ni.broadinstitute.org/humgen/gsa-scr1/gsa-engineering/git/unstable 2012-02-03 13:42:09 -05:00
Mark DePristo 53e6d66601 analyzeRunReports pushes full stack trace to DB for reporting
-- Minor parsing changes to handle this.
2012-02-03 12:32:33 -05:00
Mark DePristo 79da5ca2ae Parses major and minor GATK versions now. Added unit tests 2012-02-03 12:32:31 -05:00
Eric Banks 3abfbcbcf2 Generalized the TDT for multi-allelic events 2012-02-03 12:23:21 -05:00
Eric Banks f596377e73 Oops. Forgot that some samples are new and the data directory doesn't already exist. Now try to create it if it's not already present. 2012-02-02 12:39:49 -05:00
Christopher Hartl 27ea6426a4 Small script to chunk up a VCF into equal-sized chunks 2012-02-02 12:29:03 -05:00
Christopher Hartl 0111505ea9 Terrible. Swapping the paternal and sample ids. 2012-02-02 11:41:16 -05:00
Christopher Hartl 0c562756eb Add a memory limit so this thing doesn't get killed on the farm 2012-02-02 10:30:09 -05:00
Ryan Poplin 1f50f6970b Merge branch 'master' of ssh://nickel.broadinstitute.org/humgen/gsa-scr1/gsa-engineering/git/unstable 2012-02-02 10:17:13 -05:00
Ryan Poplin 4ed06801a7 Updating HaplotypeCaller's HMM calc to use GOP as a function of the read instead of a function of the haplotype in preparation for IQSR 2012-02-02 10:17:04 -05:00
Eric Banks d1a1888d2f Merge branch 'master' of ssh://gsa1.broadinstitute.org/humgen/gsa-scr1/gsa-engineering/git/unstable 2012-02-02 09:58:22 -05:00
Eric Banks 48220700eb Adding lftp to the supported download protocols for the 1000G sync. Initial results look very good! Bringing down chrom11 and chrom20 files now. 2012-02-02 09:58:03 -05:00
Christopher Hartl 45bf2562cc . 2012-02-02 09:11:17 -05:00
Christopher Hartl f8c5406084 Add the ability to extract samples 2012-02-02 09:06:39 -05:00
Matt Hanna 8adfc79123 Merged bug fix from Stable into Unstable 2012-02-01 16:07:41 -05:00
Matt Hanna 30b937d2af Fix bug discovered in FGTP branch in which BlockInputStream returns -1 in cases where some data could be read,
but not all the data requested by the caller.
2012-02-01 16:06:22 -05:00
Ryan Poplin 2109122cdd merging branches 2012-02-01 14:37:24 -05:00
Ryan Poplin bc6abc94e9 Bug fix for check of isMateUnmapped. Requires check of mate is mapped. 2012-02-01 14:32:19 -05:00
Ryan Poplin a46a29501b Marking unused code in HC for removal. 2012-02-01 13:22:17 -05:00
Christopher Hartl b567ed8793 Use the right reference path :( 2012-02-01 12:35:18 -05:00
Christopher Hartl 87a63d54d6 fix the script! 2012-02-01 12:05:29 -05:00
Mauricio Carneiro 45da892ecc Better exceptions to catch malformed reads
* throw exceptions in LocusIteratorByState when hitting reads starting or ending with deletions
2012-02-01 11:56:19 -05:00
Christopher Hartl 810996cfca Introducing: VariantsToPed, the world's most annoying walker! And also a busted QScript to run it that I need Khalid's help debugging ( frownie face ). Note that VariantsToPed and PlinkSeq generate the same binary file (up to strand flips...thanks PlinkSeq), so I know it's working properly. Hooray! 2012-02-01 10:39:03 -05:00
Christopher Hartl 25d943f706 Merge branch 'master' of ssh://chartl@ni.broadinstitute.org/humgen/gsa-scr1/gsa-engineering/git/unstable 2012-02-01 10:32:11 -05:00
Ryan Poplin e8528bc526 updating HaplotypeCaller integration tests 2012-02-01 09:43:19 -05:00
Ryan Poplin dc23265640 Merge branch 'master' of ssh://nickel.broadinstitute.org/humgen/gsa-scr1/gsa-engineering/git/unstable 2012-02-01 09:22:43 -05:00
Menachem Fromer 579627568e Limit to 3 ALT alleles 2012-01-31 23:39:39 -05:00
Mauricio Carneiro 08c7c07f25 Added the option of not compressing read names to ReduceReads
* When scatter/gathering, name compression cannot guarantee uniqueness. If uniqueness is important, it is recommended to turn compression off for scatter/gathering ReduceReads.
2012-01-31 17:14:57 -05:00
Ryan Poplin 056b24ccd6 Resolving merge conflicts with LocusIteratorByState 2012-01-31 16:13:32 -05:00
Ryan Poplin febc634557 Changing PileupElement's isSoftClipped to isNextToSoftClip since soft clipped bases aren't actually added to pileups, oops. Removing the intrinsic clustered variants filter from the HaplotypeCaller 2012-01-31 16:06:14 -05:00
Matt Hanna 7f70612beb Merge branch 'master' of ssh://gsa1/humgen/gsa-scr1/gsa-engineering/git/unstable 2012-01-31 11:59:25 -05:00
Matt Hanna a630db1703 Oops...HierarchicalMicroScheduler was transforming any exception from the walker level into a ReviewedStingException.
Thanks to Ryan for pointing this out.
2012-01-31 11:58:21 -05:00
Mauricio Carneiro a7f5d26326 No more synthetic reads starting/ending with deletions
bug reported by Kristian Cibulskis that we were generating filtered data synthetic reads with leading deletions. Added integration test.
2012-01-31 11:41:36 -05:00
Mark DePristo 2f2f039c37 Better flow for byNegTrainingFraction 2012-01-31 10:49:46 -05:00
Mark DePristo d8a4d78854 Bugfix for exceptions with unknown source whose error was not being shown in tableau 2012-01-31 10:49:06 -05:00
Christopher Hartl faba3dd530 Merge branch 'master' of ssh://chartl@ni.broadinstitute.org/humgen/gsa-scr1/gsa-engineering/git/unstable 2012-01-31 10:25:29 -05:00
Mauricio Carneiro 17dbe9a95d A few cleanups in the LocusIteratorByState
* No more N's in the extended event pileups
   * Only add to the pileup MQ0 counter if the read actually goes into the pileup
2012-01-31 09:40:51 -05:00
Menachem Fromer e7ace8efc4 Fix NullPointerException caused in cases with too many ALT alleles 2012-01-30 21:00:16 -05:00
Ryan Poplin f9162ea705 Merge branch 'master' of ssh://nickel.broadinstitute.org/humgen/gsa-scr1/gsa-engineering/git/unstable 2012-01-30 19:45:19 -05:00
Ryan Poplin abb91cf26b Increasing the size of the active regions that are produced by the active probability integrator, more context is needed to call more complex events 2012-01-30 15:36:12 -05:00
Mauricio Carneiro d5d4fa8a88 Fixed discordance bug reported by Brad Chapman
discordance now reports discordance between genotypes as well (just like concordance)
2012-01-30 09:50:45 -05:00
Menachem Fromer f1e07f169e Only apply filters if there are filters to apply 2012-01-30 02:22:13 -05:00
Menachem Fromer d1aa5204d7 Merge branch 'master' of ssh://copper.broadinstitute.org/humgen/gsa-scr1/gsa-engineering/git/unstable 2012-01-29 23:39:34 -05:00
Menachem Fromer 3186f0f1b0 Try more memory and fewer ALT alleles so that we don't run out of memory 2012-01-29 23:38:32 -05:00
Mark DePristo 3164c8dee5 S3 upload now directly creates the XML report in memory and puts that in S3
-- This is a partial fix for the problem with uploading S3 logs reported by Mauricio.  There the problem is that the java.io.tmpdir is not accessible (network just hangs).  Because of that the s3 upload fails because the underlying system uses tmpdir for caching, etc.  As far as I can tell there's no way around this bug -- you cannot overload the java.io.tmpdir programmatically and even if I could what value would we use?  The only solution seems to me is to detect that tmpdir is hanging (how?!) and fail with a meaningful error.
2012-01-29 15:14:58 -05:00
Menachem Fromer 0e17cbbce9 Merged bug fix from Stable into Unstable 2012-01-27 16:03:16 -05:00
Menachem Fromer a9671b73ca Fix to permit proper handling of mapping qualities between 128 to 255 (which get converted to byte values of -128 to -1) 2012-01-27 16:01:30 -05:00
Ryan Poplin f7ac1f4a69 Merge branch 'master' of ssh://nickel.broadinstitute.org/humgen/gsa-scr1/gsa-engineering/git/unstable 2012-01-27 15:12:55 -05:00
Ryan Poplin fc08235ff3 Bug fix in active region traversal, locusView.getNext() skips over pileups with zero coverage but still need to count them in the active probability integrator 2012-01-27 15:12:37 -05:00
Mauricio Carneiro 052a4bdb9c Turning off PHONE HOME option in the MDCP
* MDCP is for internal use and there is no need to report to the Amazon cloud.
   * Reporting to ASW_S3 is not allowing jobs to finish, this is probably a bug.
2012-01-27 11:13:30 -05:00