GitHub - Glottocrisio/MaLaNova: A collection of functions to aid the decryption of the mysterious epigraph conserved in the Turbolo Chapel of Santa Maria la Nova.

This codebase has been developed to power the following research paper:

Palma, C. (2023). Encrypted epigraphy: The case of a mysterious inscription in the Neapolitan church of Santa Maria La Nova. Proceedings of the 6th International Conference on Historical Cryptology HistoCrypt 2023, 139–147. Published on May 30, 2023.

If you are using it for publication purposes, please use the provided citation.

The main available tool for deciphering historical ciphers is currently CrypTool 2. It offers a wide choice of de- and encryption algorithms in a beautiful and interactive User Experience. Although the provided clear-text languages are updated at every release, they are fix and not customizable. The major added value of this contribution is providing a workflow for generating n-grams files from customized historical corpora. A notable source for downloading historical corpora is HistCorp.

The choice can be supported by calculating the Friedman's Index of Coincidence on the encrypted text.

The corpus with the most similar Index of Coincidence would be a good candidate for further attacks. This can be easily performed by the function:

Another possibility would be the AZdecrypt integrated function ”Languages” -> select ”File” -> ”Batch n-grams (substitution)” -> open ”Languages.azd” under ”Languages”.

To include in the chi-squared test the customized languages/corpora, the "languages.azd" file should be placed in the AZdecrypt\N-grams\N-gram normalization\ folder and can be edited to include them. Look at the other normalization files in that folder as to how your data should look.

After selecting an historical corpus, it is necessary to pre-process it. The present codebase provides also functions to achieve this (cleaning, removing spaces, removing special characters, ecc.) in the module "NLP.py". The following function performs then the transformation into a N-Grams file suitable for AZDecrypt.

The output file will enlist each N-gram immedi- ately followed by its log value, a number between 0 and 255 obtained by:

All N-grams followed by ”000” could be removed. The GUI library used for AZdecrypt does not support Unicode. Hence, only languages that can be represented in ASCII are visually supported. A workaround for this problem is substituting Unicode with ASCII and then providing a ASCII to Unicode mapping table in the n-gram .ini file. The .ini format is used for simple text files containing initialization parameters. In AZdecrypt, it accompains in the ”Ngrams” folder every N-gram file. Its appearance for Persian, a language not supported by Unicode, is:

N-gram size=b5

N-gram factor=90.11

Entropy weight=1

Alphabet=#<*)576%4$

,3:-+?1;0(2&"!8’/.>9=

Temperature=700

, whereby in the first line the ”b” stands for ”binary”12. It should be deleted for all non-binary formatted N-grams files. The alphabet line shall contain all characters present in the related N- grams file. The temperature variable refers to the probability of accepting a modification with a lower fitness. It continuously decreases, emulating the process of annealing in metallurgy, therefore the name. The strategy adopted in my study to avoid unsupported characters is transposing the corpus into Latin characters before generating the N-grams file. This is achieved by the functions contained into the file ”Replace.py”, and at the state of the art are available for Greek, Coptic and Cyrillic. Other alphabets can be mapped easily following the same model used for the other ”Replace” functions.

As you can notice from the overwiew above, the ATDecrypt environment is quite intutive to use as well. After pasting the encrypted text in the window on the left and selecting a decryption method from the menu-list, you have to select a candidate language from:

File -> Load N-Grams,

then click "Solve".

Enjoy decryption!

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.vs/MariaLaNova		.vs/MariaLaNova
corpora		corpora
outputs		outputs
picsreadme		picsreadme
ConstraintProgramming.py		ConstraintProgramming.py
CopticCorpus.py		CopticCorpus.py
Crypto.py		Crypto.py
Graphs.py		Graphs.py
GreekCorpus.py		GreekCorpus.py
LICENSE		LICENSE
LatinCorpus.py		LatinCorpus.py
MariaLaNova.py		MariaLaNova.py
MariaLaNova.pyproj		MariaLaNova.pyproj
MariaLaNova.sln		MariaLaNova.sln
MarialanovaOriginal.py		MarialanovaOriginal.py
NLP.py		NLP.py
Palindromes.py		Palindromes.py
README.md		README.md
Replace.py		Replace.py
SeleniumFirefox.py		SeleniumFirefox.py
WebScraping.py		WebScraping.py
WordLength.py		WordLength.py
epigraphsolver.py		epigraphsolver.py
scriptiocontinua.txt		scriptiocontinua.txt
scriptiocontinua2.txt		scriptiocontinua2.txt
scriptiocontinua3.txt		scriptiocontinua3.txt
scriptiocontinua4.txt		scriptiocontinua4.txt
scriptiocontinuanoq.txt		scriptiocontinuanoq.txt
soluzione.odt		soluzione.odt
soluzione.txt		soluzione.txt
stringanospaces.txt		stringanospaces.txt
stringaoriginale.txt		stringaoriginale.txt
stringaoriginale2.txt		stringaoriginale2.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

About

Uh oh!

Releases

Packages

Languages

License

Glottocrisio/MaLaNova

Folders and files

Latest commit

History

Repository files navigation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages