gatk-3.8

Commit Graph

Author	SHA1	Message	Date
Mark DePristo	ffdfdcde3f	Updating MD5s -- Interval test now uses RG containing BAM -- DoC sample name ordering has changed.	2011-10-04 15:54:45 -07:00
Mark DePristo	a45d985818	TODO method stubs	2011-10-04 15:54:09 -07:00
Mark DePristo	463eab7604	All MD5 mismatches for test are shown -- Now for tests like DoC, with 20 output md5s, you see all of the differences before failing.	2011-10-04 15:53:52 -07:00
Mark DePristo	e1d6c7a50a	Updating MD5 that have changed due to sample ordering differences	2011-10-04 09:33:23 -07:00
Mark DePristo	343a7b6b2f	Updating UG integration tests for arbitrary impact of sample order changes on downsampling	2011-10-04 08:14:00 -07:00
Mark DePristo	fee89e47ff	Only throws an error when there are no samples but there are reads -- Handles the case when you are running a ROD traversal and yet the LIBS is still used to return null everywhere.	2011-10-04 06:50:54 -07:00
Mark DePristo	f552aede42	Only provide the sample names in the BAM file for efficiency	2011-10-04 06:50:12 -07:00
Mark DePristo	a27641e1fc	Cleaned up imports	2011-10-04 06:28:36 -07:00
Mark DePristo	b20689ff55	No longer supports extraProperties -- the underlying data structure is still present, but until I decide what to do for the extensible system I've completely disabled the subsystem -- Added code to merge Samples, so that a mostly full record can be merged with a consistent empty record. If the two records are inconsistent, an error is thrown -- addSample() in Sample.class now invokes mergeSample() when appropriate -- Validation types are now only STRICT or SILENT -- Validation code implemented in SampleDBBuilder -- Extensive unit tests for SampleDBBuilder	2011-10-03 19:20:33 -07:00
Mark DePristo	867a7476c1	Systematic unit tests for the sample object	2011-10-03 19:09:02 -07:00
Mark DePristo	2e3dc52088	Minor function renaming	2011-10-03 14:41:13 -07:00
Mark DePristo	dd71884b0c	On path to SampleDB engine integration -- PedReader tag parser -- Separation of SampleDBBuilder from SampleDB (now immutable) -- Removed old sample engine arguments	2011-10-03 12:08:07 -07:00
Mark DePristo	8ee0f91904	Remove residual processing tracker arguments	2011-10-03 09:50:01 -07:00
Mark DePristo	89ac50e86e	SampleDataSource -> SampleDB	2011-10-03 09:33:30 -07:00
Mark DePristo	93fba06cb5	Support for whitespace only lines	2011-10-03 09:30:10 -07:00
Mark DePristo	0604ce55d1	PedReader support for ; separated lines, not only newline	2011-10-03 09:19:58 -07:00
Mark DePristo	52f670c8b8	100% version of PedReader -- Passes all unit tests -- Added unit tests for missing fields	2011-10-03 06:12:58 -07:00
Mark DePristo	dd75ad9f49	95% PedReader -- Passes significiant unit tests -- Implicit sample creation for mom / dad when you create single samples -- Continuing cleanup of Sample and SampleDataSource	2011-09-30 18:03:34 -04:00
Mark DePristo	84160bd83f	Reorganization of Sample -- Moved Gender and Afflication to separate public enums -- PedReader 90% implemented -- Improve interface cleanup to XReadLines and UserException	2011-09-30 15:50:54 -04:00
Mark DePristo	c1cf6bc45a	PEDReader should be in samples	2011-09-30 14:22:19 -04:00
Mark DePristo	56f10b40a8	Fixing test bugs for WindowMaker that required empty sample list	2011-09-30 14:18:27 -04:00
Mark DePristo	810e8ad011	Removed getXByReaders() function from the engine -- These could be simplied in their downstream uses -- Or they could be replaced with a generic getSAMFileHeaders() function and then apply the getSamples(header) as desired downstream	2011-09-30 10:43:51 -04:00
Mark DePristo	178ba24c27	Move getSamplesForSamFile to SampleUtils -- A nearly identical piece of code already lived in SampleUtils. Now there are two functions, one taking a regular header and another grabbing the merged header from the GATK engine itself. Much cleaner	2011-09-30 10:28:18 -04:00
Mark DePristo	30d23942b1	Renamed ReadBackedPileup getXSampleName() functions to getXSample -- now that we don't have Sample objects floating around we don't have to have all of the Name extensions on our functions	2011-09-30 10:02:57 -04:00
Mark DePristo	3289a325fc	Removed final use of Sample in RBP	2011-09-30 09:57:39 -04:00
Mark DePristo	a69a4dda2f	SamplesDB no longer has null sample -- Updated getSamples().size() == 2 test in CallableLociWalker that really ensured there was one sample in the system	2011-09-30 09:56:23 -04:00
Mark DePristo	e055a78f6e	LIBS now requires at least one sample be present -- UnitTest provides a "null" sample for matching the reads without read groups	2011-09-30 09:49:35 -04:00
Mark DePristo	9860a2c989	Merge branch 'master' into ped	2011-09-30 09:28:18 -04:00
Mark DePristo	a881d6f145	Now only generates the poly VCF with select variants if the file doesn't exist	2011-09-30 08:42:09 -04:00
Mark DePristo	d901fed617	Merge branch 'master' of ssh://gsa1/humgen/gsa-scr1/gsa-engineering/git/unstable	2011-09-30 08:41:44 -04:00
Mauricio Carneiro	cabacf028d	Intermediate commit to fix interval skipping may need additional testing.	2011-09-29 18:45:12 -04:00
Mark DePristo	b71b51751e	Bug fix for UnitTest -- Provide the null sample to the LIBS, as this seems to be required for correctly passing this unit test -- Will be fixed in a future update	2011-09-29 17:30:01 -04:00
Mark DePristo	1765fbeb6b	Merge branch 'master' into ped	2011-09-29 17:18:51 -04:00
Mark DePristo	98ecaf8aa0	Support for ReducedReads with reduced counts and average quals -- ReadUtils and UnitTest updated to support new byte[] style -- Removed unnecessary read transformer in PairHMM	2011-09-29 17:18:39 -04:00
Mauricio Carneiro	9508220157	fixed hard clipping both ends inside deletion If both ends of the interval falls within a deletion in the read then hardClipBothEnds would cut the right tail first including the entire deletion, then fail to cut the left tail because there would not be any bases there anymore. Fixed.	2011-09-29 15:36:49 -04:00
Mark DePristo	9458f01409	Test cleanup of Sample object	2011-09-29 15:13:05 -04:00
Mark DePristo	625ffb6a07	LocusIteratorByState and ReadBackedPileups no long use Sample	2011-09-29 14:52:11 -04:00
Mark DePristo	b3a2371925	Merge branch 'master' into ped	2011-09-29 14:32:17 -04:00
Mark DePristo	68761a6e28	Removed sample from header	2011-09-29 14:13:05 -04:00
Mauricio Carneiro	a5e75cd14c	Outputting both consensus base qualities and counts The base qualities of a consensus reads are now the average quality of the bases forming the consensus base (most common base) and the consensus quality tag now carry an array with the counts of each base in the consensus. This should increase file size but improve calling sensitivity/specificity.	2011-09-29 12:54:41 -04:00
Mauricio Carneiro	d62f2f33bc	Added indel specific context size parameter Parameter was added to the framework but implementing the functionality is pending.	2011-09-29 12:54:41 -04:00
Mark DePristo	505416b6c0	Merge branch 'master' into ped	2011-09-29 12:22:39 -04:00
Mauricio Carneiro	21c4abdd36	Disabling all SlidingReadUnitTests	2011-09-29 12:20:35 -04:00
Mauricio Carneiro	4086fa768f	Disabling all ReadClipperUnitTests	2011-09-29 12:20:35 -04:00
Mark DePristo	9536845e35	Cleaning up unused code in MV	2011-09-29 12:20:07 -04:00
Mark DePristo	5043d76c3d	Removing more bad uses of SampleDataSource creation	2011-09-29 12:16:34 -04:00
Mark DePristo	5c9227cf5e	Further cleanup of Sample database -- Removing more and more unnecessary code -- Partial removal of type safe Sample usage. On the road to SampleDB only	2011-09-29 11:50:05 -04:00
Khalid Shakir	6dec932ca9	Merge branch 'master' of ssh://gsa3.broadinstitute.org/humgen/gsa-scr1/gsa-engineering/git/unstable	2011-09-29 11:47:13 -04:00
Khalid Shakir	c08468eb9d	A couple of updates while trying to get desired R 2.13 compactPDF support. preQC: - For R 2.13 when parsing fingerprints explicitly coercing the text before parsing - Added LOD geom_line() at +/-3 based on Tim's presentation at PM meeting (ppt to go to pipeline wiki asap) - PF_INDEL_RATE of zero replaced with NA - NA's are not "violations" auto filter samples since 0+NA = NA, and subset test only looks for 0 violations - Restored plots for MEAN_READ_LENGTH, BAD_CYCLES, and MEDIAN_INSERT_SIZE by explicitly print()'ing the created plots postQC: - Fixed R 2.13 font scaling by moving size out of aes, except when using highlighting - TODO: Don't know how to scale by aes for highlighting and use a smaller overall font size outside aes	2011-09-29 11:21:50 -04:00
Mark DePristo	2a0cd556d3	Further cleanup of Sample -- Cleaned up interface functions in GAE -- Added Walker.getSampleDB() function which is an easier option for tools to get the samples db	2011-09-29 10:34:51 -04:00

1 2 3 4 5 ...

7680 Commits (ffdfdcde3ff3340d822693cb27efc8f7b6aaeeb4) All Branches Search

7680 Commits (ffdfdcde3ff3340d822693cb27efc8f7b6aaeeb4)

All Branches