Open
Description
Kmer indexing currently uses python to loop over sequences extract each kmer and add index position to a dict. See find_match_pos_diag()
This is not practical for chromosome scale sequences due to long compute time. The kmer indexing and location lookup task could be sped up by writing this function in Rust (i.e. using Rust-Bio FM-Index).
Would require adding Maturin as a build dep for mixed rust / python project.
Metadata
Metadata
Assignees
Labels
No labels