-
Notifications
You must be signed in to change notification settings - Fork 1
Open
Description
Write a function that does the following:
- Take a language and a wordform (you may need to map from ISO code to language name; CLiCS might have a mapping table, otherwise use the one from Glottolog).
- Visit the Wiktionary page for the wordform and go to the section for the appropriate language.
- Go to the "Etymology" subsection (or if there are multiple etymology subsections, loop over them).
- Extract the first sentence, of the form "From
<language name>
<ancestral form>
". - Take the
<language name>
<ancestral form>
pairing, and use those as input for another function call. (I.e. use tail recursion.) - Keep chasing etymologies until you’ve hit a dead end. Along the way, save each etymological link to a table (with the fields "Language", "Wordform", "Source language", and "Source wordform").
Once we have a function that does the above, we can just loop it over all the words in NorthEuraLex.
Metadata
Metadata
Assignees
Labels
No labels