gatk-3.8/public/java/test/org/broadinstitute/sting/utils/sam
Ami Levy-Moonshine 6da53aea09 Write a new tool for spliting reads that have N cigar string.
For example, this tool can be used for processing bowtie RNA-seq data.
Each read with k N-cigar elemments is plit to k+1 reads. The split is done by hard clipping the bases rest of the bases.

In order to do it, few changes were introduced to some other clipping methods:
- make a segnificant change in ClippingOp.hardClip() that prevent the spliting of read with cigar: 1M2I1N1M3I.
- change getReadCoordinateForReferenceCoordinate in ReadUtil to recognize Ns

create unitTests for that walker:
- change ReadClipperTestUtils to be more general in order to use its code and avoid code duplication
- move some useful methods from ReadClipperTestUtils to CigarUtils

create integration test for that class

small change in a comment in FullProcessingPipeline

last commit:

Address review comments:
- move to protected under walkers/rnaseq
- change the read splitting methods to be more readable and more efficiant
- change (minor changes) some methods in ReadClipper to allow the changes in split reads
- add (minor change) one method to CigarUtils to allow the changes in split reads
- change ReadUtils.getReadCoordinateForReferenceCoordinate to include possible N in the cigar
- address the rest of the review comments (minor changes)

- fix ReadUtilsUnitTest.testReadWithNs acoording to the defult behaviour of getReadCoordinateForReferenceCoordinate (in case of refernce index that fall into deletion, return the read index of the base before the deletion).
- add another test to ReadUtilsUnitTest.testReadWithNs

- Allow the user to print the split positions (not working proparly currently)
2014-01-01 22:21:36 -05:00
..
AlignmentUtilsUnitTest.java Reworking of the dangling tails merging code. 2013-06-11 12:53:04 -04:00
ArtificialBAMBuilderUnitTest.java Remove org.apache.commons.collections.IteratorUtils dependency from the test suite 2013-08-21 19:44:02 -04:00
ArtificialPatternedSAMIteratorUnitTest.java Updated all JAVA file licenses accordingly 2013-01-10 17:06:41 -05:00
ArtificialSAMFileWriterUnitTest.java PrintReads writes a header when used with -BQSR 2013-02-14 22:19:14 -05:00
ArtificialSAMQueryIteratorUnitTest.java Updated all JAVA file licenses accordingly 2013-01-10 17:06:41 -05:00
ArtificialSAMUtilsUnitTest.java Updated all JAVA file licenses accordingly 2013-01-10 17:06:41 -05:00
ArtificialSingleSampleReadStreamUnitTest.java Updated all JAVA file licenses accordingly 2013-01-10 17:06:41 -05:00
GATKSAMRecordUnitTest.java Update expected test output for Java 7 2013-05-01 16:18:01 -04:00
MisencodedBaseQualityUnitTest.java Fix improper dependencies in QScripts used by pipeline tests, and attempt to fix the flawed MisencodedBaseQualityUnitTest 2013-02-27 04:45:53 -05:00
ReadUtilsUnitTest.java Write a new tool for spliting reads that have N cigar string. 2014-01-01 22:21:36 -05:00