question: I am interested to contribute, is there a public issue backlog? 

This project generally seems quite helpful. Honestly, I'm most interested in the clustering, we are fairly happy with deduplication system as is. It seems like for this to work as is you need enough memory to hold all your vectors at once. Then, from there, can run the alrogithm. 

Most of our customer vector datasets are >80GB in size so we would need some way to cluster them in a paginated method. It would be cool to contribute that, but I wanted to see if there was maybe already an issue for it or something adjacent? 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

question: I am interested to contribute, is there a public issue backlog? #18

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

question: I am interested to contribute, is there a public issue backlog? #18

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions