Repository detailing the process of converting Greek text to and from Betacode which is a necessary step for applying Morpheus morphological tagging to the Greek words in the N1904-TF dataset of the New Testament.
-
Create NT wordlist in Betacode.ipynb: Create the wordlist from the N1904 corpus.
-
Mapping unicode to betacode.ipynb: The conversion from Unicode to Betacode.
The engine for these notebooks is beta-code-py which is available under the MIT license. Hence these notebooks are also made available under the same license.
Transcoding between beta- and uni-code you can easily be done in the following manner:
import beta_code
beta_code.greek_to_beta_code(u'χαῖρε ὦ κόσμε')
'xai=re w)= ko/sme'
beta_code.beta_code_to_greek('xai=re w)= ko/sme')
'χαῖρε ὦ κόσμε'
An online conversion tool was developed by Zachary Fletcher and published in his repository zfletch/beta-code-converter-js. The tool is available for use online at apps.perseids.org/beta-code.
A new feature betacode
has been created that can be loaded as addtional module together with the N1904-TF Text-Fabric dataset of a syntactic anotated Greek New Testament.