🤖 NLU Chatbot Project (Work in Progress)

This is a Natural Language Understanding (NLU) project written in Python, where I aim to build a chatbot that can intelligently respond to customer inquiries based on their utterances.

Note: This project is still in development! I’m currently exploring two different NLP libraries: spaCy and Stanza. The code isn’t finalized yet, and I’m experimenting and learning as I go.

Project Goal

The goal of this chatbot is to understand customer utterances from a dataset and provide appropriate responses. The bot doesn’t use predefined intents — instead, it learns patterns and clusters them based on semantic similarity.

What I'm Doing (Step-by-Step)

Preprocessing customer utterances from a dataset
Tokenizing the text using both spaCy and Stanza
Creating custom stopword lists
Mapping certain words (e.g., synonyms or brand-specific terms) to standard forms
Lemmatizing the tokens for better generalization
Vectorizing utterances using Sentence-BERT (SBERT)
Using KMeans Clustering to group similar utterances
Assigning appropriate chatbot responses based on clusters

Approaches

I'm testing two NLP pipelines:

chatbot_spacy.py – Based on spaCy
chatbot_stanza.py – Based on Stanza

Each script is a work-in-progress and may evolve over time.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
data		data
notebooks		notebooks
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🤖 NLU Chatbot Project (Work in Progress)

Project Goal

What I'm Doing (Step-by-Step)

Approaches

About

Uh oh!

Releases

Packages

Uh oh!

Languages

katerinaharana/chatbot

Folders and files

Latest commit

History

Repository files navigation

🤖 NLU Chatbot Project (Work in Progress)

Project Goal

What I'm Doing (Step-by-Step)

Approaches

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages