Heng Li
19cb7cd7ed
r388: cleanup mem_process_seqs() interface
...
Print output outside the function and allow to feed insert size distribution.
2013-04-26 12:31:18 -04:00
Heng Li
53bb846407
r373: optionally distable mate rescue
2013-04-09 16:13:55 -04:00
Heng Li
b0a76884e8
r340: feature freeze; updated the manpage
...
I will stop adding new features to bwa and prepare for the next release. I will
briefly evaluate the variant calling accuracy before the release.
2013-03-07 11:51:23 -05:00
Heng Li
3e3236dfc4
r337: mem - always read even number of reads
...
In the old code, we may read odd number of reads from an interleaved fastq.
2013-03-07 11:00:15 -05:00
Heng Li
5fbd454682
r332: added output threshold
...
Otherwise there are far too many short hits
2013-03-05 22:49:38 -05:00
Heng Li
07921659cf
move mem_fill_scmat() to bwa.{h,c}
2013-03-05 09:38:12 -05:00
Heng Li
efd9769b07
r324: a little code cleanup
...
The changes after r317 aim to improve the performance and accuracy for very
long query alignment. The short-read alignment should not be affected. The
changes include:
1) Z-dropoff. This is a variant of blast's X-dropoff. I orginally thought this
heuristic only improves speed, but now I realize it also reduces poor
alignment with long good flanking alignments. The difference from blast's
X-dropoff is that Z-dropoff allows big gaps, but X-dropoff does not.
2) Band width doubling. When band width is too small, we will get a poor
alignment in the middle. Sometimes such alignments cannot be fully excluded
with Z-dropoff. Band width doubling is an alternative heuristic. It is based
on the observation that the existing of close-to-boundary high score
possibly implies inadequate band width. When we see such a signal, we double
the band width.
2013-03-05 00:57:16 -05:00
Heng Li
e0991d6a45
r323: added Z-dropoff, a variant of blast's X-drop
2013-03-05 00:34:33 -05:00
Heng Li
4bb0bdddca
r306: introduce clipping penalty
...
More clipping leads to more severe reference bias. We should not clip the
alignment unless necessary.
2013-02-27 21:13:39 -05:00
Heng Li
e620f0ff4e
r302: updated the manpage
2013-02-27 13:16:22 -05:00
Heng Li
98787f0ae0
r295: generate NM
2013-02-26 13:36:01 -05:00
Heng Li
9957e04590
r278: don't perform too many mate-sw
2013-02-25 11:56:02 -05:00
Heng Li
5092211d75
controllable scoring matrix
2013-02-25 11:24:21 -05:00
Heng Li
5ead86acd3
optionally mark split hit as secondary
2013-02-25 11:18:35 -05:00
Heng Li
4dc982a3c7
support interleaved fastq
2013-02-25 00:13:32 -05:00
Heng Li
0b4a40dc25
updated revision number; to merge into master
2013-02-24 13:34:20 -05:00
Heng Li
85775c3384
output multiple hits
2013-02-24 13:23:43 -05:00
Heng Li
6bdccf2a8a
added a bit documentation
2013-02-24 13:09:29 -05:00
Heng Li
6e7903e9f3
added kopen support
2013-02-23 17:09:23 -05:00
Heng Li
b4c38bcc1c
append fasta/q comment
2013-02-23 16:57:34 -05:00
Heng Li
ee4540c394
support read group in bwa-mem
2013-02-23 16:41:44 -05:00
Heng Li
67543f19a1
code refactoring
2013-02-23 15:55:55 -05:00
Heng Li
3c330d5049
for another round of code cleanup
2013-02-23 15:12:26 -05:00
Heng Li
d460f2ec9e
bugfix in multi-threaded bwa-mem
2013-02-23 14:48:54 -05:00
Heng Li
f122fad562
minor code clean up
...
bwtio.c is merged to bwt.c
2013-02-22 17:09:40 -05:00
Heng Li
54da54ffd4
extend more seeds (and thus slower...)
2013-02-21 12:52:00 -05:00
Heng Li
66585b7982
code backup
2013-02-18 16:33:06 -05:00
Heng Li
95d18449b3
merge bseq.{h,c} to utils.{h,c}
...
I do not like many small files.
2013-02-12 10:36:15 -05:00
Heng Li
987d4b4205
fixed a stupid bug in fastq reading
2013-02-11 11:27:35 -05:00
Heng Li
59eaf650ac
code backup
2013-02-11 10:59:38 -05:00
Heng Li
cb55617f50
added a new line
2013-02-08 22:12:18 -05:00
Heng Li
95a79afe71
command-line prompt
2013-02-08 22:11:44 -05:00
Heng Li
39607065e0
allow more seeds to be seen (thus slower..)
2013-02-08 16:56:28 -05:00
Heng Li
cd6bd524d4
discard internal seeds shorter than half
2013-02-07 19:50:37 -05:00
Heng Li
ff3fea115c
write soft clip; added debugging code
2013-02-07 16:27:11 -05:00
Heng Li
1fd51fc3f7
code backup
2013-02-07 14:36:18 -05:00
Heng Li
5dc398cdef
start to write CLI
2013-02-07 13:13:43 -05:00
Heng Li
5a0b32bfd2
updated to the latest kseq.h
2013-02-06 14:38:40 -05:00
Heng Li
a9292d674d
a bit code cleanup
2013-02-06 13:59:32 -05:00
Heng Li
a61288c768
separate CIGAR generation
2013-02-05 21:49:19 -05:00
Heng Li
1e16f3e701
calling ksw_global(); ksw_extend() is buggy!
2013-02-05 17:13:12 -05:00
Heng Li
d6a73c9171
chain filtering apparently working
2013-02-05 00:17:20 -05:00
Heng Li
9d0cdb2d3c
unfinished chain filter
2013-02-04 17:23:06 -05:00
Heng Li
788e9d1e3d
fixed a couple of leaks; buggy atm
2013-02-04 15:40:26 -05:00
Heng Li
ba18db1a9f
sw extension works for the simplest case
2013-02-04 12:37:38 -05:00
Heng Li
d25a87cc50
code backup
2013-02-02 15:14:24 -05:00
Heng Li
00e5302219
routine to get subsequence from 2-bit pac
2013-02-01 16:39:50 -05:00
Heng Li
f8f3b7577a
code cleanup; added a missing file
2013-02-01 14:38:44 -05:00
Heng Li
620ad6e5b9
reseed long SMEMs
2013-02-01 14:20:38 -05:00
Heng Li
8977737460
basic chaining working
...
Definitely suboptimal in a lot of corner cases...
2013-01-31 16:26:05 -05:00