Unsupervised Classification of Documents

# Description
Using some kind of clustering algorithm to predict a class per document. Classes may be genre, topic, usefulness, etc. Finding the closest cluster per document relies on a distance metric.

# Objectives
1. Implement different clustering algorithms to classify documents into an arbitrary set of classes. Text similarity would be a good starting point as the distance metric utilized.
2. Use zero-shot learning (ZSL) to classify documents from a group of pre-determined classes. *HuggingFace* has a pipeline for that. Checkout the comments in [here](https://discuss.huggingface.co/t/new-pipeline-for-zero-shot-text-classification/681).


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Unsupervised Classification of Documents #7

Description

Objectives

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Unsupervised Classification of Documents #7

Description

Description

Objectives

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions