Reimplement the current alignment::Aligner without breaking API #379

TianyiShi2001 · 2020-10-03T09:58:03Z

Faster global, local, semiglobal alignment with less space usage. Related to #354 and #355 (review).

All tests are unchanged and passed. No breaking changes. The API is completely unchanged.

Old(a1b1143):

test bench_aligner_wc_global     ... bench: 549,914,573 ns/iter (+/- 66,950,334)
test bench_aligner_wc_local      ... bench: 648,835,834 ns/iter (+/- 93,166,340)
test bench_aligner_wc_semiglobal ... bench: 622,027,993 ns/iter (+/- 43,517,887)

New:

test bench_aligner_wc_global     ... bench: 117,967,967 ns/iter (+/- 11,637,569)
test bench_aligner_wc_local      ... bench: 220,659,055 ns/iter (+/- 18,904,199)
test bench_aligner_wc_semiglobal ... bench: 152,071,703 ns/iter (+/- 11,597,219)

I didn't implement the "custom" alignment bacause it seemed not very useful. I haven't seen this type of alignment other than in SeqAn and rust-bio. Maybe this will be discussed in another issue.

- fix two typos - add internal links - fix format (schematics in code blocks; proper new lines)

repitition of line 559: self.S[k].extend(repeat(MIN_SCORE).take(m + 1));

now passes all tests!

a commit for comparing the benchmarks of the old aligner and the new one. To be restored later

All tests are unchanged and passed.

#379

TianyiShi2001 · 2020-10-03T10:06:17Z

TODO

update docs
deprecation warning of with_capacity methods
bypass clippy warning with justification

already checked when creating Scoring

TianyiShi2001 · 2020-10-03T10:32:10Z

This will partially fix #377

TianyiShi2001 · 2020-10-03T10:44:12Z

I want to deprecate with_capacity because the time taken to initialize a few vectors is almost negligible when compared to the alignment computation as a whole.

In addition, if the aligner first aligns large sequences and allocates a big capacity, and then aligns short sequences, there will be a significant waste of memory

rust-bio/rust-bio#379

maxall41 · 2024-05-23T02:54:21Z

This is really great work! Has any progress been made on this?

TianyiShi2001 and others added 20 commits September 6, 2020 09:17

improve docs of bio::alignment::pairwise

752ab77

- fix two typos - add internal links - fix format (schematics in code blocks; proper new lines)

improve docs for bio::alignment::pairwise

7c4eb6f

ignore non-node

28b6428

idomatic assignment

ca790a2

idiomatic

4f2e943

remove repetition

9c174b5

repitition of line 559: self.S[k].extend(repeat(MIN_SCORE).take(m + 1));

Improve algorithm of alignment::pairwise

efb6390

fix the bug

4f51e40

now passes all tests!

benchmark

a96e3b6

a commit for comparing the benchmarks of the old aligner and the new one. To be restored later

restore

d43bac0

Merge branch 'ts-improve-docs' into aligner-restructure

b956e36

update docs

e7dad84

add reference

b5f9424

Merge branch 'master' into aligner-restructure

464b45a

Create fast.rs

64eb1a5

Merge branch 'seq-aln' into fast-alignment

8e77748

Create fast.rs

bf9e89d

reimplement the fast but O(nm) space alignment algorithm

5e2c730

All tests are unchanged and passed.

Merge branch 'aligner-restructure' into fast-alignment

e569e4f

removing mut

1b9aece

TianyiShi2001 mentioned this pull request Oct 3, 2020

improve speed & space-efficiency of current alignment::pairwise::Aligner #354

Closed

TianyiShi2001 added a commit that referenced this pull request Oct 3, 2020

remove pairwise::fast; open a separate PR

879e8ac

#379

TianyiShi2001 mentioned this pull request Oct 3, 2020

Reimplement Pairwise Alignment Algorithms #355

Draft

TianyiShi2001 added the WIP label Oct 3, 2020

TianyiShi2001 added 4 commits October 3, 2020 11:08

Delete fast.rs

daf2a1b

deprecation warning

ca9fed8

remove repeated asserts

2e04e26

already checked when creating Scoring

bypass clippy with justification

abf29ad

using Aligner::new instead of Aligner::with_capacity

941b5b3

TianyiShi2001 added 3 commits October 3, 2020 12:06

reorganise docs

1178df8

remove use of deprecated fn

5a0b16b

fix clippy

ff27825

ChallengeDev210 pushed a commit to ChallengeDev210/rust-bio that referenced this pull request Aug 1, 2022

remove pairwise::fast; open a separate PR

d2e4cb4

rust-bio/rust-bio#379

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Reimplement the current alignment::Aligner without breaking API #379

Reimplement the current alignment::Aligner without breaking API #379

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reimplement the current alignment::Aligner without breaking API #379

Are you sure you want to change the base?

Reimplement the current alignment::Aligner without breaking API #379

Uh oh!

Conversation

Uh oh!

Uh oh!

TODO

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!