-
Notifications
You must be signed in to change notification settings - Fork 0
Open
0 / 20 of 2 issues completedLabels
designUX or presentation needs attentionUX or presentation needs attentiondocumentationImprovements or additions to documentationImprovements or additions to documentationstructureRefactoring or architecture, general code organizationRefactoring or architecture, general code organization
Milestone
Description
Specifically presented as taking the output of distributed-downloader
and then does all these things to create the TreeOfLife dataset.
- Rename repo: TreeOfLife-toolbox
- Rename under
src
:TOL-toolbox
- Reorganize under
TOL-toolbox
to have submodules based on function (e.g.,filtering
,data-transfer
) with sub-submodules (e.g.,research-filtering
) - Add from TreeOfLife-dev as submodules in
src/TOL-toolbox
:- webdataset creation
- metadata transfer
- lookup table creator
- Add from TreeOfLife-dev at root-level
scripts/
directory all lookup table exclude creators (many of these are here):- PDQ hash process
- Image type separator (museum, camera trap, and citizen science categories)
- Museum Specimen processing: label vs specimen filter
- camera trap processing: megadetector & reducer
- citizen science image processing
- face detection
- Add from TreeOfLife-dev at root-level
notebooks/
directory all lookup table exclude creators:- clustering and other EDA determinations
- Add root-level
data/
folder with embeddings for museum support sets (relates to Add processing tools for exclusion sets #22).
This will be done following the merge of PRs #1 through #15.
hlapp
Sub-issues
Metadata
Metadata
Assignees
Labels
designUX or presentation needs attentionUX or presentation needs attentiondocumentationImprovements or additions to documentationImprovements or additions to documentationstructureRefactoring or architecture, general code organizationRefactoring or architecture, general code organization