gatk-3.8

Commit Graph

Author	SHA1	Message	Date
Mark DePristo	dd75ad9f49	95% PedReader -- Passes significiant unit tests -- Implicit sample creation for mom / dad when you create single samples -- Continuing cleanup of Sample and SampleDataSource	2011-09-30 18:03:34 -04:00
Mark DePristo	84160bd83f	Reorganization of Sample -- Moved Gender and Afflication to separate public enums -- PedReader 90% implemented -- Improve interface cleanup to XReadLines and UserException	2011-09-30 15:50:54 -04:00
Mark DePristo	c1cf6bc45a	PEDReader should be in samples	2011-09-30 14:22:19 -04:00
Mark DePristo	56f10b40a8	Fixing test bugs for WindowMaker that required empty sample list	2011-09-30 14:18:27 -04:00
Mark DePristo	810e8ad011	Removed getXByReaders() function from the engine -- These could be simplied in their downstream uses -- Or they could be replaced with a generic getSAMFileHeaders() function and then apply the getSamples(header) as desired downstream	2011-09-30 10:43:51 -04:00
Mark DePristo	178ba24c27	Move getSamplesForSamFile to SampleUtils -- A nearly identical piece of code already lived in SampleUtils. Now there are two functions, one taking a regular header and another grabbing the merged header from the GATK engine itself. Much cleaner	2011-09-30 10:28:18 -04:00
Mark DePristo	30d23942b1	Renamed ReadBackedPileup getXSampleName() functions to getXSample -- now that we don't have Sample objects floating around we don't have to have all of the Name extensions on our functions	2011-09-30 10:02:57 -04:00
Mark DePristo	3289a325fc	Removed final use of Sample in RBP	2011-09-30 09:57:39 -04:00
Mark DePristo	a69a4dda2f	SamplesDB no longer has null sample -- Updated getSamples().size() == 2 test in CallableLociWalker that really ensured there was one sample in the system	2011-09-30 09:56:23 -04:00
Mark DePristo	e055a78f6e	LIBS now requires at least one sample be present -- UnitTest provides a "null" sample for matching the reads without read groups	2011-09-30 09:49:35 -04:00
Mark DePristo	9860a2c989	Merge branch 'master' into ped	2011-09-30 09:28:18 -04:00
Mark DePristo	a881d6f145	Now only generates the poly VCF with select variants if the file doesn't exist	2011-09-30 08:42:09 -04:00
Mark DePristo	d901fed617	Merge branch 'master' of ssh://gsa1/humgen/gsa-scr1/gsa-engineering/git/unstable	2011-09-30 08:41:44 -04:00
Mauricio Carneiro	cabacf028d	Intermediate commit to fix interval skipping may need additional testing.	2011-09-29 18:45:12 -04:00
Mark DePristo	b71b51751e	Bug fix for UnitTest -- Provide the null sample to the LIBS, as this seems to be required for correctly passing this unit test -- Will be fixed in a future update	2011-09-29 17:30:01 -04:00
Mark DePristo	1765fbeb6b	Merge branch 'master' into ped	2011-09-29 17:18:51 -04:00
Mark DePristo	98ecaf8aa0	Support for ReducedReads with reduced counts and average quals -- ReadUtils and UnitTest updated to support new byte[] style -- Removed unnecessary read transformer in PairHMM	2011-09-29 17:18:39 -04:00
Mauricio Carneiro	9508220157	fixed hard clipping both ends inside deletion If both ends of the interval falls within a deletion in the read then hardClipBothEnds would cut the right tail first including the entire deletion, then fail to cut the left tail because there would not be any bases there anymore. Fixed.	2011-09-29 15:36:49 -04:00
Mark DePristo	9458f01409	Test cleanup of Sample object	2011-09-29 15:13:05 -04:00
Mark DePristo	625ffb6a07	LocusIteratorByState and ReadBackedPileups no long use Sample	2011-09-29 14:52:11 -04:00
Mark DePristo	b3a2371925	Merge branch 'master' into ped	2011-09-29 14:32:17 -04:00
Mark DePristo	68761a6e28	Removed sample from header	2011-09-29 14:13:05 -04:00
Mauricio Carneiro	a5e75cd14c	Outputting both consensus base qualities and counts The base qualities of a consensus reads are now the average quality of the bases forming the consensus base (most common base) and the consensus quality tag now carry an array with the counts of each base in the consensus. This should increase file size but improve calling sensitivity/specificity.	2011-09-29 12:54:41 -04:00
Mauricio Carneiro	d62f2f33bc	Added indel specific context size parameter Parameter was added to the framework but implementing the functionality is pending.	2011-09-29 12:54:41 -04:00
Mark DePristo	505416b6c0	Merge branch 'master' into ped	2011-09-29 12:22:39 -04:00
Mauricio Carneiro	21c4abdd36	Disabling all SlidingReadUnitTests	2011-09-29 12:20:35 -04:00
Mauricio Carneiro	4086fa768f	Disabling all ReadClipperUnitTests	2011-09-29 12:20:35 -04:00
Mark DePristo	9536845e35	Cleaning up unused code in MV	2011-09-29 12:20:07 -04:00
Mark DePristo	5043d76c3d	Removing more bad uses of SampleDataSource creation	2011-09-29 12:16:34 -04:00
Mark DePristo	5c9227cf5e	Further cleanup of Sample database -- Removing more and more unnecessary code -- Partial removal of type safe Sample usage. On the road to SampleDB only	2011-09-29 11:50:05 -04:00
Khalid Shakir	6dec932ca9	Merge branch 'master' of ssh://gsa3.broadinstitute.org/humgen/gsa-scr1/gsa-engineering/git/unstable	2011-09-29 11:47:13 -04:00
Khalid Shakir	c08468eb9d	A couple of updates while trying to get desired R 2.13 compactPDF support. preQC: - For R 2.13 when parsing fingerprints explicitly coercing the text before parsing - Added LOD geom_line() at +/-3 based on Tim's presentation at PM meeting (ppt to go to pipeline wiki asap) - PF_INDEL_RATE of zero replaced with NA - NA's are not "violations" auto filter samples since 0+NA = NA, and subset test only looks for 0 violations - Restored plots for MEAN_READ_LENGTH, BAD_CYCLES, and MEDIAN_INSERT_SIZE by explicitly print()'ing the created plots postQC: - Fixed R 2.13 font scaling by moving size out of aes, except when using highlighting - TODO: Don't know how to scale by aes for highlighting and use a smaller overall font size outside aes	2011-09-29 11:21:50 -04:00
Mark DePristo	2a0cd556d3	Further cleanup of Sample -- Cleaned up interface functions in GAE -- Added Walker.getSampleDB() function which is an easier option for tools to get the samples db	2011-09-29 10:34:51 -04:00
Mark DePristo	e76f381628	Moved sample package from DataSources to gatk, and renamed it samples -- All associated changes to the codebase are just header updates	2011-09-29 09:57:15 -04:00
Mark DePristo	e197dcd1f3	Pre-cleanup commit of Sample and SampleDataSource -- SampleDataSource has all reader functionality disabled	2011-09-29 09:44:18 -04:00
Mark DePristo	4d31673cc5	No longer supporting YAML file allows us to delete 75% of the sample's codebase	2011-09-29 09:43:31 -04:00
Mauricio Carneiro	fc86cd6fd8	Merge branch 'master' of ssh://nickel.broadinstitute.org/humgen/gsa-scr1/carneiro/gatk/RR into rr	2011-09-29 00:12:15 -04:00
Roger Zurawicki	4fd5630f6a	Added ReadClipper Unit Test * Includes tests that include HardClip to Read and Reference Coords. * Changed ReadUtils.HardClipByReferenceCoordinates from private to protected to allow for testing	2011-09-28 23:13:50 -04:00
Mauricio Carneiro	f49a12de6b	Updating latest changes from the repository to reduce reads repo	2011-09-28 22:31:57 -04:00
Matt Hanna	9272ed03b5	Merged bug fix from Stable into Unstable	2011-09-28 21:26:43 -04:00
Matt Hanna	0acaf2df65	Fix an embarrassing issue where a specific configuration of minimal coverage over small intervals could cause reads to be dropped from the pileup. Nothing to see here...	2011-09-28 21:23:01 -04:00
Roger Zurawicki	07b0a75d96	Added SlidingRead Unit Test Includes test clipStart and trimToVariableRegion	2011-09-28 21:22:57 -04:00
Khalid Shakir	c5f1a4325f	Updated preQC: - full 8.5x11 - concating multiple initiatives / bait_sets - Using NA instead of python None when WR dates are unavailable - In new aggregations where the sample may have per library metrics, only using the sample level metrics, i.e. library is null Updated postQC: - Renamed some variables to assist with traceback() - Fixed crashes on batches with two alleles or two samples such as Seminara_MC_1_09222011 or Engle_MC_2_09222011 - Added dependency tracking to PostCallingQC.scala so that the R script does try to run before the evals are complete Other minor cleanup. Tried to use R 2.13 compactPDF but a few issues to work out with fingerprint boxplots in preQC and geom_text font size in postQC.	2011-09-28 20:23:30 -04:00
Mauricio Carneiro	edf852d47d	Adding lists to ReduceReads script script can handle single file or list of files separately now. Always scatter/gathering.	2011-09-28 18:40:30 -04:00
Mauricio Carneiro	64e7b3000c	Fix read spans deletion through the entire interval if the read has a deletion that spans the entire length of the interval, it should not be added to mapped reads.	2011-09-28 18:40:30 -04:00
Mauricio Carneiro	a93ece07e3	ScatterGatherable reduce reads script Get your reduce read in a matter of seconds...	2011-09-28 18:40:30 -04:00
Guillermo del Angel	c8d3a720f9	Merge branch 'master' of ssh://nickel.broadinstitute.org/humgen/gsa-scr1/gsa-engineering/git/unstable	2011-09-28 18:17:34 -04:00
Guillermo del Angel	7e3cb45093	Further performance optim in banded hmm, about 60% speed improvement over current implementation now	2011-09-28 16:27:28 -04:00
Ryan Poplin	1b1ca80df2	Merge branch 'master' of ssh://nickel.broadinstitute.org/humgen/gsa-scr1/gsa-engineering/git/unstable	2011-09-28 16:17:39 -04:00
Ryan Poplin	3b73dc89fe	Making several esoteric arguments in the BQSR @Hidden. Adding basic support for Complete Genomics machine cycle.	2011-09-28 16:17:31 -04:00

1 2 3 4 5 ...

7663 Commits (dd75ad9f49ed3a07dded0cc4eab7318cd60fda1e) All Branches Search

7663 Commits (dd75ad9f49ed3a07dded0cc4eab7318cd60fda1e)

All Branches