Commit Graph

30 Commits (7d085962a26cab160a07a42a461378f575ff011a)

Author SHA1 Message Date
zzh 7d085962a2 开始改成sbwa那种batch模式 2024-03-07 18:23:21 +08:00
zzh 463f7da138 将smem1函数用fmt结构实现了,结果基本正确 2024-02-07 22:08:51 +08:00
zzh 293f3bb80e 添加了fmt_idx文件,开始改进seed过程 2023-12-25 11:11:19 +08:00
Heng Li d422bdbed9 debug flag to measure memory 2021-02-22 23:26:03 -05:00
Heng Li 02a9add042 added MIT license to some non-GPL source files 2020-07-01 23:02:01 -04:00
Heng Li 9a828344ec moved indexing to libbwa 2016-02-13 18:48:46 -05:00
Heng Li b5f6ed3020 r1005: insert arbitrary header lines 2014-11-19 10:59:05 -05:00
Heng Li 80e4ecfa79 r998: smart pairing; allow mixture of SE/PE reads 2014-11-18 14:30:22 -05:00
Heng Li 6804f17d38 replaced Sys V shm with POSIX shm 2014-10-15 23:06:03 -04:00
Heng Li bfd5e1840f shm works on small files, but not large ones
I don't know why. SHMMAX, SHMALL and SHMMNI are large enough.
2014-10-15 15:44:06 -04:00
Heng Li 6a0952948d shared memory 2014-10-15 14:44:08 -04:00
Heng Li c5e859b49f r898: read the index into a single memory block
Prepare for shared memory. Not used now.
2014-10-15 12:27:45 -04:00
Heng Li 8638cfadc8 dev-472: get rid of bwa_fix_xref()
This function causes all kinds of problems when the reference genome consists
of many short reads/contigs/chromsomes. Some of the problems are nearly
unfixable at the point where bwa_fix_xref() gets called. This commit attempts
to fix the problem at the root. It disallows chains spanning multiple contigs
and never retrieves sequences bridging two adjacent contigs. Thus all the
chaining, extension, SW and global alignments are confined to on contig only.

This commit brings many changes. I have tested it on a couple examples
including Peter Field's PacBio example. It works well so far.
2014-04-10 20:54:27 -04:00
Heng Li 9ce50a4e5e dev-450: support diff ins/del penalties. NO TEST!! 2014-03-28 14:54:06 -04:00
Heng Li 07921659cf move mem_fill_scmat() to bwa.{h,c} 2013-03-05 09:38:12 -05:00
Heng Li 98787f0ae0 r295: generate NM 2013-02-26 13:36:01 -05:00
Heng Li e70c7c2a71 r284: amend cross-reference hit
I really hate this: complex and twisted logic for a nasty scenario that almost
never happens to short reads - but it may become serious when the reference
genome consists of many contigs.

On toy examples, the code seems to work. Don't know if it really works...
2013-02-26 00:03:49 -05:00
Heng Li 6bdccf2a8a added a bit documentation 2013-02-24 13:09:29 -05:00
Heng Li ee4540c394 support read group in bwa-mem 2013-02-23 16:41:44 -05:00
Heng Li cfa7165036 cleanup index loading code 2013-02-23 16:10:48 -05:00
Heng Li 67543f19a1 code refactoring 2013-02-23 15:55:55 -05:00
Heng Li e613195e17 moved some common code to bwa.{c,h} 2013-02-23 15:30:46 -05:00
Heng Li e5ab59db53 Multiple changes:
1. Removed bwa.{h,c}. I am not going to finish them anyway.
2. Updated to the latest khash.h, which should be faster.
3. Define 64-bit vector and 128-bit integer/vector in utils.h.
2013-02-12 09:50:28 -05:00
Heng Li 9c486fa41e updated APIs abit 2012-04-08 00:55:52 -04:00
Heng Li 3b5a9e5595 simplified bwa_se() interface 2012-04-08 00:12:34 -04:00
Heng Li ca93a71e6e complete single-end alignment 2012-04-08 00:02:06 -04:00
Heng Li 080726cb47 preliminary doc 2012-04-07 22:50:07 -04:00
Heng Li 1cef219667 compute CIGAR; rev seq not working 2012-04-07 22:00:03 -04:00
Heng Li 66154ff5d2 towards refined gapped; unfinished 2012-04-07 01:25:39 -04:00
Heng Li a1abfe9977 API: aln seems working 2012-04-07 00:23:01 -04:00