GitHub - barretts/mid-enigma-wordgen: generating pronouncable words that don't exist already

https://web.archive.org/web/20190529191811/https://www.wolfram.com/language/gallery/generate-random-pronounceable-words/ What is a Bigram and a Trigram?

A bigram is a sequence of two consecutive elements (e.g., letters, words) in a given dataset. A trigram is a sequence of three consecutive elements.

In the context of Markov models for word generation, we use bigrams and trigrams to predict the next letter based on the previous one (bigram) or previous two (trigram). Example

Given the word "hello", we can extract:

Bigrams: "he", "el", "ll", "lo"

Trigrams: "hel", "ell", "llo"

Model	Definition	Pros	Cons
Bigram	Predicts next letter based on the last one letter.	✅ Simpler, needs less data. ✅ Faster word generation.	❌ Less structure, more randomness. ❌ Might generate unrealistic letter combinations.
Trigram	Predicts next letter based on the last two letters.	✅ More context-aware. ✅ Generates more realistic, structured words.	❌ Requires more training data. ❌ Slightly slower word generation.

When to Use Which?

Use a Bigram Model if:

    You need a lightweight, fast method for generating words.

    You don't mind if some words are a bit random.

    You have a small dataset to train on.

Use a Trigram Model if:

    You want more natural, structured, pronounceable words.

    You have enough training data to capture realistic letter sequences.

    You’re okay with slightly slower generation for better quality.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

barretts/mid-enigma-wordgen

Folders and files

Latest commit

History

Repository files navigation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages