Skip to content

memory mapping #33

@mcolpus

Description

@mcolpus

Thanks for the great software. I just wanted to check if memory mapping (like with Kraken2) works with Sylph, or if it doesn't make sense to here. The index here is much smaller but still some 14G.

I've tried putting the database into /dev/shm and that didn't seem to affect the running time.
The longest step was obtaining sketches which took 20 seconds either way. But looking in htop it seems that the limit step is loading into RAM.

ubuntu@mcolpus-main:~/pipelines/gatekeeper_pipeline$ sylph profile /mnt/block_data/sylph_dbs/gtdb-r220-c200-dbv1.syldb -t 20 -1 reads_1.fastq.gz -2 reads_2.fastq.gz -o out
2024-12-09T11:08:17.811Z INFO  [sylph::contain] Obtaining sketches...
2024-12-09T11:08:37.584Z INFO  [sylph::contain] Finished obtaining genome sketches.
2024-12-09T11:08:41.640Z INFO  [sylph::contain] reads_1.fastq.gz taxonomic profiling; reassigning k-mers for 1 genomes...
2024-12-09T11:08:41.685Z INFO  [sylph::contain] reads_1.fastq.gz has 1 genomes passing profiling threshold. 
2024-12-09T11:08:41.685Z INFO  [sylph::contain] Finished paired sample reads_1.fastq.gz.
2024-12-09T11:08:41.685Z INFO  [sylph::contain] sylph finished.

ubuntu@mcolpus-main:~/pipelines/gatekeeper_pipeline$ sylph profile /dev/shm/gtdb-r220-c200-dbv1.syldb -t 20 -1 reads_1.fastq.gz -2 reads_2.fastq.gz -o out
2024-12-09T11:09:02.694Z INFO  [sylph::contain] Obtaining sketches...
2024-12-09T11:09:23.187Z INFO  [sylph::contain] Finished obtaining genome sketches.
2024-12-09T11:09:27.228Z INFO  [sylph::contain] reads_1.fastq.gz taxonomic profiling; reassigning k-mers for 1 genomes...
2024-12-09T11:09:27.273Z INFO  [sylph::contain] reads_1.fastq.gz has 1 genomes passing profiling threshold. 
2024-12-09T11:09:27.273Z INFO  [sylph::contain] Finished paired sample reads_1.fastq.gz.
2024-12-09T11:09:27.273Z INFO  [sylph::contain] sylph finished.

If I change to using -t 1 then it still takes 20 seconds to obtain sketches, but subsequent steps take longer as expected.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions