Corpus Tools for creating an OCFL repository for Eastern Central Arrernte dictionary recordings.
- Node.js (v22 or newer needed for oni-ocfl)
- npm
- Visual Studio Code
Clone the repository and install dependencies:
git clone https://github.com/Language-Research-Technology/corpus-tools-ec-arrernte.git
cd corpus-tools-ec-arrernte
npm install
You can run the main corpus tool using the provided Makefile:
make
You can override the incldued makefile to include your data location such as:
#!/usr/bin/env bash
make BASE_DATA_DIR=/opt/storage/ECArrernte \
TEMPLATE_DIR=/opt/storage/ECArrernte \
REPO_OUT_DIR=/opt/storage/oni/ocfl \
REPO_SCRATCH_DIR=/opt/storage/oni/scratch-ocfl \
BASE_TMP_DIR=./storage/temp \
NAMESPACE=ec-arrernte-dictionary-recordings \
CORPUS_NAME=ec-arrernte-dictionary-recordings \
DATA_DIR="/opt/storage/ECArrernte/DICTIONARY RECORDINGS TRANSCRIBED ETC/" \
REPO_NAME=LDaCA \
DEBUG=true
Or directly with Node.js:
node index.js -s ec-arrernte-dictionary-recordings \
-t "./template" \
-c ec-arrernte-dictionary-recordings -n LDaCA \
-r "./ocfl-repo" -x "/data/override" \
-d "/data/override" \
-D true \
-p "temp" -z "scratch"
make clean
Run all tests using Mocha:
npm test
Or run a specific test file:
npx mocha test/elan.spec.js
- Open the project folder in VS Code.
- Go to the "Run and Debug" panel (Ctrl+Shift+D).
- Select a debug configuration, e.g. Debug ELAN Tests.
- Set breakpoints in your test or source files.
- Click the green "Start Debugging" button.
The .vscode/launch.json
includes a configuration for debugging tests:
index.js
— Main entry pointlib/
— Library modulestest/
— Mocha test filesMakeFile
— Makefile for common tasks
For issues or questions, please open an issue