Skip to content

Speed up kmer indexing  #20

Open
Open
@Adamtaranto

Description

@Adamtaranto

Kmer indexing currently uses python to loop over sequences extract each kmer and add index position to a dict. See find_match_pos_diag()

This is not practical for chromosome scale sequences due to long compute time. The kmer indexing and location lookup task could be sped up by writing this function in Rust (i.e. using Rust-Bio FM-Index).

Would require adding Maturin as a build dep for mixed rust / python project.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions