Avoid identical computation in self tanimoto similarity

The ´self_tanimoto_similarity´ function equates matrix_a to itself, and then it calls the `tanimoto_similarity_sparse`. Calculating norm_2 is repeated in this case which is unnecessarily costly for large arrays. See https://github.com/basf/MolPipeline/blob/8190785d7ec0ad19db019938be7b6ef050f22635/molpipeline/utils/kernel.py#L29

We can add a simple check for identity of the two matrices to avoid redundant computation.

Thanks to Afnan for bringing this to our attention!



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Avoid identical computation in self tanimoto similarity #117

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Avoid identical computation in self tanimoto similarity #117

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions