Skip to content

script: options for shuffling the order in which sketches are searched in a database #3801

@ctb

Description

@ctb

for straight up sourmash zipfiles, it turns out to be surprisingly hard to change the order in which sketches are loaded.

some options for @bettafische -

  • update the actual order of things in the SOURMASH-MANIFEST.csv inside the actual sig zip file; requires copying the entire sig zip file 😭.
  • use a file list - a list of paths to .sig, .sig.gz, or .sig.zip files containing individual sketches - and shuffle that. May be slow/memory intensive.

it would also be fairly straightforward to add an option to sourmash to randomize the order of manifests, but it would be a bit of a hack... could be good for testing/evaluation but shouldn't be in the released version of sourmash.

Metadata

Metadata

Assignees

No one assigned

    Labels

    codeherein lies code

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions