This project aims to cluster feedback texts using advanced natural language processing techniques. It utilizes Sentence Transformers for generating meaningful embeddings of feedback sentences, followed by KMeans clustering to group similar feedback together.
- Sentence Embeddings: Utilizes state-of-the-art Sentence Transformers to convert feedback texts into numerical representations.
- Unsupervised Clustering: Applies KMeans clustering to organize feedback into distinct groups based on similarity.
- Visualization: Includes visual tools such as the Elbow Method and Silhouette Score to determine optimal cluster numbers.
- Clone the repository:
- Install dependencies:
- Prepare your feedback data in a CSV format (
data.csv
). - Run
notebook.ipynb
to generate clusters and visualize results: - View the output clusters and explore insights from the feedback data.
- Python 3.6+
- Dependencies listed in
requirements.txt
Contributions are welcome! Please fork the repository and submit pull requests for new features, improvements, or bug fixes.