Hidden Markov Model for Speech Recognition

This project is an assignment code compilation of CS 304 Course at Duke Kunshan University taught by Prof. Li

The project uses data from TI Dights, an ancient Dataset that contains several thousands of digit sequence recordings from dozens of speakers. Because of the data is recorded before standardized C is even a thing, the included conversation software in TI Dights CD-ROM may not work as expected. Instead, we can use FFmpeg to convert the recordings to modern standards.

Results

At its ultimate form, unrestricted HMM with continuous speech sequence training, it can reach a accuracy of 85% for TI Dights test Dataset. The accuracy means the predicted sequence matches exactly the true sequence. An interactive script is provided for test purpose as well.

Name		Name	Last commit message	Last commit date
Latest commit History 174 Commits
scripts		scripts
src/loe_speech_recognition		src/loe_speech_recognition
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Hidden Markov Model for Speech Recognition

Results

About

Uh oh!

Uh oh!

Languages

loeeeee/SpeechRecognition-HiddenMarkovModel

Folders and files

Latest commit

History

Repository files navigation

Hidden Markov Model for Speech Recognition

Results

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Uh oh!

Languages