gatk-3.8/public/java/test/org/broadinstitute/sting/gatk
David Roazen 605a5ac2e3 GATK engine: add ability to do on-the-fly BAM file sample renaming at runtime
-User must provide a mapping file via new --sample_rename_mapping_file argument.
 Mapping file must contain a mapping from absolute bam file path to new sample name
 (format is described in the docs for the argument).

-Requires that each bam file listed in the mapping file contain only one sample
 in their headers (they may contain multiple read groups for that sample, however).
 The engine enforces this, and throws a UserException if on-the-fly renaming is
 requested for a multi-sample bam.

-Not all bam files for a traversal need to be listed in the mapping file.

-On-the-fly renaming is done as the VERY first step after creating the SAMFileReaders
 in SAMDataSource (before the headers are even merged), to prevent possible consistency
 issues.

-Renaming is done ONCE at traversal start for each SAMReaders resource creation in the
 SAMResourcePool; this effectively means once per -nt thread

-Comprehensive unit/integration tests

Known issues: -if you specify the absolute path to a bam in the mapping file, and then
               provide a path to that same bam to -I using SYMLINKS, the renaming won't
               work. The absolute paths will look different to the engine due to the
               symlink being present in one path and not in the other path.

GSA-974 #resolve
2013-07-18 15:48:42 -04:00
..
datasources GATK engine: add ability to do on-the-fly BAM file sample renaming at runtime 2013-07-18 15:48:42 -04:00
downsampling Exclude reduced reads from elimination during downsampling 2013-06-11 16:16:26 -04:00
executive Updated all JAVA file licenses accordingly 2013-01-10 17:06:41 -05:00
filters This commit addresses JIRA issue GSA-948: Prevent users from doing the wrong thing with RNA-Seq data and the GATK. 2013-06-10 10:44:42 -04:00
iterators SAMDataSource: always consolidate cigar strings into canonical form 2013-06-19 13:29:01 -04:00
refdata Detect stuck lock-acquisition calls, and disable file locking for tests 2013-04-24 22:49:02 -04:00
report Updated all JAVA file licenses accordingly 2013-01-10 17:06:41 -05:00
samples Trivial update to ceutrio.ped file to make it really the CEU trio sample names 2013-05-14 17:08:13 -04:00
traversals GATK engine: add ability to do on-the-fly BAM file sample renaming at runtime 2013-07-18 15:48:42 -04:00
walkers SAMDataSource: always consolidate cigar strings into canonical form 2013-06-19 13:29:01 -04:00
CommandLineGATKUnitTest.java Updated all JAVA file licenses accordingly 2013-01-10 17:06:41 -05:00
EngineFeaturesIntegrationTest.java GATK engine: add ability to do on-the-fly BAM file sample renaming at runtime 2013-07-18 15:48:42 -04:00
GenomeAnalysisEngineUnitTest.java GATK engine: add ability to do on-the-fly BAM file sample renaming at runtime 2013-07-18 15:48:42 -04:00
MaxRuntimeIntegrationTest.java Subshard timeouts in the GATK 2013-05-15 07:00:39 -04:00
ReadMetricsUnitTest.java GATK engine: add ability to do on-the-fly BAM file sample renaming at runtime 2013-07-18 15:48:42 -04:00
WalkerManagerUnitTest.java Updated all JAVA file licenses accordingly 2013-01-10 17:06:41 -05:00