gatk3的最后一个经典版本3.8
 
 
 
 
Go to file
Guillermo del Angel 55d5f2194c Read Error Corrector for haplotype assembly
Principle is simple: when coverage is deep enough, any single-base read error will look like a rare k-mer but correct sequence will be supported by many reads to correct sequences will look like common k-mers. So, algorithm has 3 main steps:
1. K-mer graph buildup.
For each read in an active region, a map from k-mers to the number of times they have been seen is built.
2. Building correction map.
All "rare" k-mers that are sparse (by default, seen only once), get mapped to k-mers that are good (by default, seen at least 20 times but this is a CL argument), and that lie within a given Hamming distance (by default, =1). This map can be empty (i.e. k-mers can be uncorrectable).
3. Correction proposal
For each constituent k-mer of each read, if this k-mer is rare and maps to a good k-mer, get differing base positions in k-mer and add these to a list of corrections for each base in each read. Then, correct read at positions where correction proposal is unanimous and non-empty.

The algorithm defaults are chosen to be very stringent and conservative in the correction: we only try to correct singleton k-mers, we only look for good k-mers lying at Hamming distance = 1 from them, and we only correct a base in read if all correction proposals are congruent.

By default, algorithm is disabled but can be enabled in HaplotypeCaller via the -readErrorCorrect CL option. However, at this point it's about 3x-10x more expensive so it needs to be optimized if it's to be used.
2013-06-11 12:26:24 -04:00
licensing Removing the symlink from the private license file 2013-02-09 12:57:44 -05:00
protected Read Error Corrector for haplotype assembly 2013-06-11 12:26:24 -04:00
public Implement ActiveRegionTraversal RefMetaDataTracker for map call; HaplotypeCaller now annotates ID from dbSNP 2013-06-10 16:20:31 -04:00
settings Rev picard, sam-jdk, tribble, and variant to version 1.91.1453 2013-05-07 14:21:15 -04:00
.gitignore Detect stuck lock-acquisition calls, and disable file locking for tests 2013-04-24 22:49:02 -04:00
build.xml Restore scala compilation by default in build.xml 2013-05-31 11:28:29 -04:00
intellij_example.tar.bz2 Removed the intellij files from the root and made an example package for new users. This allows users to start at the same page and then change it as they see fit without interfering with the repo (thanks guillermo!) 2012-09-27 11:04:56 -04:00
ivy.xml Brought all of ReduceReads to fastutils 2013-02-23 22:53:23 -05:00