A multilingual command line sentence tokenizer in Golang
-
Updated
Feb 28, 2024 - Go
A multilingual command line sentence tokenizer in Golang
KSS: Korean String processing Suite
This repository contains the Uwuifier package! It's written in Deno with TypeScript and compiled into JavaScript for NPM, makes use of Jest for testing the code and is deployed on NPM and https://deno.land.
A Word Level Transformer layer based on PyTorch and 🤗 Transformers.
Generate Sentences From Meaningless Words.
Conjugates, downloads audio files, brings up detailed word and kanji information, creates tests and more. Useful for quickly making Anki cards and searching definitions of words.
Finally, some decent sample sentences
🔱 explore textual possibilities like never before
Contributed markdownlint rule for limiting sentences per line. 📐
Generate grammatical sentences https://1j01.itch.io/nonsensical
A high-performance wrapper around Intl.Segmenter for efficient text segmentation. This class resolves memory handling issues seen with large strings and "maximum call stack exceeded" exceptions that occur when strings exceed 40-50k characters. Enhances performance by 50-500x. Only ~70 loc (with comments) and no dependencies.
Romanian Word Embeddings. Here you can find pre-trained corpora of word embeddings. Current methods: CBOW, Skip-Gram, Fast-Text (from Gensim library). The .vec and .model files are available for download (all in one archive).
Over-engineered string template engine with a simple interface, focused on versatility and user control.
Over a total of 45,000 trivia, would you rather, never have I ever, and truth or dare sentences.
Public Domain Words and Texts for Conlangs
Collaborative list of questions to trigger interesting conversations, thinking ... and, obviously, avoid small talks.
Add a description, image, and links to the sentences topic page so that developers can more easily learn about it.
To associate your repository with the sentences topic, visit your repo's landing page and select "manage topics."