Skip to content

search: weight names according to TF-IDF #597

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
adamdecaf opened this issue Feb 15, 2025 · 0 comments
Open

search: weight names according to TF-IDF #597

adamdecaf opened this issue Feb 15, 2025 · 0 comments
Labels
enhancement New feature or request match results Questions, concerns, bugs around match percentages

Comments

@adamdecaf
Copy link
Member

adamdecaf commented Feb 15, 2025

We should apply TF-IDF (less common terms matching increases significance) to the corpus to rank name matches accordingly with the index.

This will help with less false positives for common names (often requiring more fields anyway) and higher match percentages for less common names (in the index at least).

@adamdecaf adamdecaf added enhancement New feature or request match results Questions, concerns, bugs around match percentages labels Feb 15, 2025
@adamdecaf adamdecaf added this to the v1.1 milestone Feb 15, 2025
@adamdecaf adamdecaf removed this from the v1.1 milestone Mar 7, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request match results Questions, concerns, bugs around match percentages
Projects
None yet
Development

No branches or pull requests

1 participant